dirkf
0402710227
[jsinterp] Fix regexp parsing and .replace[All] method
...
* For performance, make regexp object instantiation lazy
* Other small performance improvements
2023-03-07 01:24:21 +00:00
pukkandan
3e92c60fcd
[jsinterp] Handle Date
at epoch 0
...
See yt-dlp/yt_dlp#6400
2023-03-03 15:02:15 +00:00
dirkf
e67e52a8f8
[test] Support test-case with volatile ID (eg live show)
...
Signalled by regexp ID value, eg: `'id': r're:[\da-zA-Z_-]{8,}'`
2023-02-24 12:22:16 +00:00
pukkandan
1d3751c3fe
Escape URLs in sanitized_Request
, not sanitize_url
d2558234cf5dd12d6896eed5427b7dcdb3ab7b5a added escaping of URLs while sanitizing. However, sanitize_url
may not always receive an actual URL. Eg: When using youtube-dl "search query" --default-search ytsearch
, search query
gets escaped to search%20query
before being prefixed with ytsearch:
which is not the intended behavior. So the escaping is moved to sanitized_Request
instead.
2023-02-20 20:27:25 +00:00
dirkf
dd9aa74bee
[test] Avoid name TestIE which causes a pytest warning
...
See: 060ac76257
2023-02-14 16:36:40 +00:00
dirkf
42b098dd79
[InfoExtractor] Handle unquoted values in OpenGraph searches
2023-02-14 02:53:16 +00:00
teddy171
33db85c571
[feat]: Add support to external downloader aria2p ( #31500 )
...
* feat: add class Aria2pFD
* feat: create call_downloader function
* feat: a colorful download interface to aria2pFD
* feat: change value name
* Apply suggestions from code review
Co-authored-by: dirkf <fieldhouse@gmx.net>
* Typo in suggestion
* fix: remove unused value
* fix: add not function to return value(0 is normal); add total_seconds to download.eta(timedelta object); add waiting status when hook progress
* fix: remove unuse method ..utils.format_bytes
* fix: be up to flake8
* fix: be up to flake8
* Apply suggestions from code review
* [feat] test external downloader aria2p
* [feat] test external downloader aria2p
* [fix] test_external_downloader.py
* Apply suggestions from code review
Co-authored-by: dirkf <fieldhouse@gmx.net>
* Apply suggestions from code review
Co-authored-by: dirkf <fieldhouse@gmx.net>
* Update test/test_external_downloader.py
Co-authored-by: dirkf <fieldhouse@gmx.net>
* Update test/test_external_downloader.py
Co-authored-by: dirkf <fieldhouse@gmx.net>
* Update youtube_dl/downloader/external.py
Co-authored-by: dirkf <fieldhouse@gmx.net>
* refactoring code and fix bugs
* Apply suggestions from code review
* Rename test_external_downloader.py to test_downloader_external.py
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
2023-02-13 03:54:51 +00:00
dirkf
bafb6dec72
[YouTube] Refresh compat/utils usage
...
* import parse_qs()
* import parse_qs in lazy_extractors (clears old TODO)
* clean up old compiled lazy_extractors for Py2
* use update_url()
2023-02-13 03:54:51 +00:00
dirkf
4e04f10499
[compat] Update test_compat
...
[skip ci]
2023-02-13 03:54:51 +00:00
dirkf
249f2b6316
[compat] Systematise compat_ naming
...
[skip ci]
2023-02-13 03:54:51 +00:00
dirkf
d6b14ba316
[test] Fix TestAgeRestriction
...
* age restriction may cause DownloadError
* update obsolete test URLs
[skip ci]
2023-02-13 03:54:51 +00:00
dirkf
295736c9cb
[jsinterp] Improve parsing
...
* support subset `... else if ...`
* support `while`
* add `RegExp` class
* generalise `new` support
* limited more debug strings
* matching test changes
2023-02-02 16:31:49 +00:00
pukkandan
14ef89a8da
Support if
statements
...
Fix for yt-dlp/yt_dlp#6131
Closes #31509
2023-02-02 13:12:46 +00:00
Andrei Lebedev
27ed77aabb
[utils] Backport traverse_obj (etc) from yt-dlp ( #31156 )
...
* Backport traverse_obj and closely related function from yt-dlp (code by pukkandan)
* Backport LazyList, variadic(), try_call (code by pukkandan)
* Recast using yt-dlp's newer traverse_obj() implementation and tests (code by grub4k)
* Add tests for Unicode case folding support matching Py3.5+ (requires f102e3d
)
* Improve/add tests for variadic, try_call, join_nonempty
Co-authored-by: dirkf <fieldhouse@gmx.net>
2022-11-03 10:09:37 +00:00
dirkf
087ddc2371
[compat] Add test for compat_casefold()
2022-11-01 22:47:02 +00:00
dirkf
65ccb0dd4e
[compat] Add test for compat_casefold()
2022-11-01 21:33:39 +00:00
Xie Yanbo
c91cbf6072
[netease] Get netease music download url through player api ( #31235 )
...
* remove unplayable song from test
* compatible with python 2
* using standard User_Agent, fix imports
* use hash instead of long description
* fix lint
* fix hash
2022-10-11 13:55:09 +01:00
dirkf
2ced5a7912
[test] Implement string "lambda x: condition(x)" as an expected value
...
Semantics equivalent to `assert condition(got)`
2022-10-11 00:05:17 +01:00
pukkandan
7009bb9f31
[jsinterp] Workaround operator associativity issue
...
* temporary fix for player 5a3b6271 [1]
1. https://github.com/yt-dlp/yt-dlp/issues/4635#issuecomment-1235384480
2022-09-03 00:53:56 +01:00
dirkf
218c423bc0
[cache] Add cache validation by program version, based on yt-dlp
2022-09-01 13:28:30 +01:00
dirkf
55c823634d
[jsinterp] Handle new YT players 113ca41c, c57c113c
...
* add NaN
* allow any white-space character for `after_op`
* align with yt-dlp f26af78a8ac11d9d617ed31ea5282cfaa5bcbcfa (charcodeAt and bitwise overflow)
* allow escaping in regex, fixing player c57c113c
2022-09-01 10:57:12 +01:00
dirkf
4c6fba3765
[jsinterp] Improve try/catch/finally support
2022-08-26 08:51:17 +01:00
dirkf
d619dd712f
[jsinterp] Fix bug in operator precedence
...
* from 164b03c486
* added tests
2022-08-25 12:16:10 +01:00
dirkf
46b8ae2f52
[jsinterp] Clean up and pull yt-dlp style
...
* add compat_re_Pattern
* improve compat_collections_chain_map
* use class JS_Undefined
* remove unused code
2022-08-19 15:34:33 +01:00
dirkf
538ec65ba7
[jsinterp] Handle regexp literals and throw/catch execution ( #31182 )
...
* based on f6ca640b12
, thanks pukkandan
* adds parse support for regexp flags
2022-08-19 11:45:04 +01:00
dirkf
b0a60ce203
[jsinterp] Improve JS language support ( #31175 )
...
* operator ??
* operator ?.
* operator **
* accurate operator functions
* `undefined` handling
* object literals {a: 1, "b": expr}
* more tests for weird JS comparisons: see https://github.com/ytdl-org/youtube-dl/issues/31173#issuecomment-1217854397 .
2022-08-17 14:22:02 +01:00
dirkf
d231b56717
[jsinterp] Overhaul JSInterp to handle new YT players 4c3f79c5, 324f67b9 ( #31170 )
...
* back-port from yt-dlp 8f53dc44a0cc1c2d98c35740b9293462c080f5d0, thanks pukkandan
* also support void, improve <</>> precedence, improve expressions in comma-list
* add more tests
2022-08-14 18:45:45 +01:00
dirkf
deee741fb1
[test, etc] Improve download test logs; also clean up some new flake8 issues ( #31153 )
...
* [test] Identify testcase errors better
* [test] Identify download errors better
* [extractor/minds] Linter
* [extractor/aes] Linter
2022-08-09 21:05:00 +01:00
dirkf
11665dd236
[test] Fix linter for 3aa94d7945
2022-06-15 18:28:19 +01:00
dirkf
811c480f7b
[YouTube] Support JSON3 subtitle format
...
* subtitle tests updated to match
2022-06-09 15:25:23 +01:00
dirkf
3aa94d7945
[test] Fix workable subtitle tests (except YT) and mark others as skip, broken
...
* broken tests need to be fixed when fixing the respective IE
2022-06-08 23:11:33 +01:00
dirkf
ef044be34b
[test] Skip not _WORKING IE in subtitle tests; use unittest.skipTest throughout
2022-06-08 15:52:21 +01:00
df
6508688e88
Make default upload_/release_date a compat_str
...
Ensures download tests pass in Python 2 as well as 3; also
add YoutubeDL tests for timestamp -> upload_date etc.
2022-02-26 10:29:42 +00:00
dirkf
73e1ab6125
[test:download] Only extract enough videos for playlist_mincount
2022-02-05 02:47:21 +00:00
dirkf
7a497f1405
Rework 2c2c2bd
with an actual Mix page and realistic playlist size
...
From 2c2c2bd348 (commitcomment-65953545)
2022-02-04 04:09:23 +00:00
dirkf
5add3f4373
Merge branch 'pukkandan-yt-searchurl' into yt-dl-master
...
Closes #27749
2022-02-04 03:50:32 +00:00
dirkf
1e677567cd
[YouTube] Fix n-sig for player e06dea74 ( #30582 )
...
From yt-dl commit 48416bc
2022-02-01 14:39:03 +00:00
dirkf
9d142109f4
Back-port test_youtube_signature.py from yt-dlp and fix JSInterp accordingly
2022-01-30 00:05:54 +00:00
df
e1eae16b56
Handle default in switch better
...
Add a1fc7ca074
Thanks coletdjnz
2022-01-30 00:05:54 +00:00
df
96f87aaa3b
Back-port JS interpreter upgrade from yt-dlp PR #1437
2022-01-30 00:05:54 +00:00
df
39ca35e765
Fix test_youtube_flat_playlist_extraction
2022-01-29 20:00:21 +00:00
df
d76d59d99d
Remove obsolete non-working test_youtube_toptracks
2022-01-29 20:00:21 +00:00
df
2c2c2bd348
Fix test_youtube_mix
2022-01-29 20:00:21 +00:00
df
46e0a729b2
Remove obsolete test_youtube_course
2022-01-29 20:00:21 +00:00
df
57044eaceb
Fix test_youtube_playlist_noplaylist
2022-01-29 20:00:21 +00:00
pukkandan
a3373da70c
Merge branch 'UP/youtube-dl' into dl/YoutubeSearchURLIE
2022-01-30 01:07:28 +05:30
pukkandan
ed99d68bdd
Add back YoutubeSearchURLIE
2022-01-30 00:41:47 +05:30
Vladislav Dyshakov
690a357cc7
Added allow_file_scheme feature
2021-10-17 17:21:25 +07:00
Sergey M․
c4a451bcdd
[test_execution] Add test for lazy extractors (refs #28780 )
2021-04-21 04:47:29 +07:00
Sergey M․
5ad69d3d0e
[test_youtube_misc] Move YoutubeIE.extract_id test into separate module
2021-04-21 04:45:13 +07:00
PrinceOfPuppers
70baa7bfae
[test_youtube_lists] Actualize youtube flat playlist test ( closes #28045 )
2021-02-17 04:58:54 +07:00
Remita Amine
99c68db0a8
[youtube] add support phone/tablet JS player( closes #26424 )
2021-02-08 09:20:28 +01:00
Remita Amine
b46483a6ec
[youtube/test_youtube_signature] fix test
2021-02-01 16:35:07 +01:00
Remita Amine
9c724601ba
[youtube] remove description chapters tests
...
video description no longer contain yt.www.watch.player.seekTo
function
2021-02-01 16:11:07 +01:00
Sergey M․
142c584063
Introduce --output-na-placeholder ( closes #27896 )
2021-01-22 23:54:29 +07:00
Sergey M․
d81a213cfb
[YoutubeDL] Raise syntax error for format selection expressions with multiple + operators ( closes #27803 )
2021-01-14 00:38:31 +07:00
nixxo
3a61e6d360
[rai] improve subtitles extraction ( #27705 )
...
closes #27698
2021-01-07 13:48:45 +00:00
Remita Amine
e88c9ef62a
[utils] add a function to clean podcast URLs
2021-01-04 01:14:25 +01:00
Remita Amine
9dd674e1d2
[utils] accept only supported protocols in url_or_none
2020-12-30 09:22:30 +01:00
Sergey M․
af1312bfc3
[youtube:tab] Extend _VALID_URL ( closes #27501 )
2020-12-26 19:59:57 +07:00
Sergey M․
03d3af9768
[test_InfoExtractor] PEP 8
2020-12-13 23:47:13 +07:00
Sergey M․
1727541315
[extractor/common] Improve JSON-LD interaction statistic extraction (refs #23306 )
2020-12-13 20:24:13 +07:00
Sergey M․
5a1fbbf8b7
[extractor/common] Fix inline HTML5 media tags processing and add test ( closes #27345 )
2020-12-09 00:05:21 +07:00
Sergey M․
191286265d
[youtube:tab] Fix feeds extraction ( closes #25695 , closes #26452 )
2020-11-24 00:10:25 +07:00
Josh Soref
71ddc222ad
Fix typos ( #27084 )
...
* spelling: authorization
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: brightcove
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: creation
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: exceeded
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: exception
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: extension
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: extracting
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: extraction
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: frontline
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: improve
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: length
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: listsubtitles
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: multimedia
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: obfuscated
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: partitioning
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: playlist
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: playlists
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: restriction
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: services
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: split
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: srmediathek
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: support
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: thumbnail
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: verification
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
* spelling: whitespaces
Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
2020-11-21 22:00:05 +07:00
Sergey M․
ab0eda99e1
[YoutubeDL] Fix --ignore-errors for playlists with generator-based entries of url_transparent ( closes #27064 )
2020-11-21 04:00:08 +07:00
Sergey M․
2864179293
[youtube] Improve extraction
...
+ Add support for --no-playlist (closes #27009 )
* Improve playlist and mix extraction (closes #26390 , closes #26509 , closes #26534 , closes #27011 )
+ Extract playlist uploader data
* Update tests
2020-11-18 03:34:08 +07:00
Sergey M․
fe07e788bf
[utils] Skip ! prefixed code in js_to_json
2020-11-17 01:30:43 +07:00
Sergey M․
2de2ca6659
[youtube] Rework extractors
...
WIP
2020-11-12 06:16:37 +07:00
Kevin O'Connor
4eda10499e
[utils] Don't attempt to coerce JS strings to numbers in js_to_json ( #26851 )
...
The current logic in `js_to_json` tries to rewrite octal/hex numbers to
decimal. However, when the logic actually happens the `"` or `'` have
already been trimmed off. This causes what were originally strings, that
happen to look like octal/hex numbers, to get rewritten to decimal and
returned as a number rather than a string.
In practive something like:
```js
{
"0x40": "foo",
"040": "bar",
}
```
would get rewritten as:
```json
{
64: "foo",
32: "bar
}
```
This is problematic since this isn't valid JSON as you cannot have
non-string keys.
2020-10-18 00:10:41 +07:00
Sergey M․
1d9bf655e6
[utils] Recognize wav mimetype ( closes #26463 )
2020-09-06 11:19:53 +07:00
Sergey M․
84213ea8d4
[youtube] Extract chapters from JSON ( closes #24819 )
2020-06-06 04:22:10 +07:00
Sergey M․
c380cc28c4
[utils] Improve cookie files support
...
+ Add support for UTF-8 in cookie files
* Skip malformed cookie file entries instead of crashing (invalid entry len, invalid expires at)
2020-05-05 04:21:25 +07:00
Sergey M․
e40c758c2a
[youtube] Improve player id extraction and add tests
2020-05-02 07:18:08 +07:00
Sergey M․
042b664933
Revert "[utils] Add support for cookies with spaces used instead of tabs"
...
According to [1] TABs must be used as separators between fields.
Files produces by some tools with spaces as separators are considered
malformed.
1. https://curl.haxx.se/docs/http-cookies.html
This reverts commit cff99c91d1
.
2020-03-10 04:53:51 +07:00
Sergey M․
cff99c91d1
[utils] Add support for cookies with spaces used instead of tabs
2020-03-08 18:01:32 +07:00
Sergey M․
ea17979d83
[test_subtitles] Remove obsolete test
2020-02-29 22:08:43 +07:00
Sergey M․
4e9e1e240d
[test_YoutubeDL] Add tests for #10591 ( closes #23873 )
2020-02-15 03:37:31 +07:00
Sergey M․
e0abaab293
[test_YoutubeDL] Fix get_ids
2020-02-15 03:37:25 +07:00
Sergey M․
42db58ec73
[utils] Improve str_to_int
2019-12-15 23:15:24 +07:00
Remita Amine
348c6bf1c1
[utils] handle int values passed to str_to_int
2019-11-29 17:39:18 +01:00
Sergey M․
1ced222120
[utils] Add generic caesar cipher and rot47
2019-11-27 02:26:42 +07:00
InfernalUnderling
9d30c2132a
[utils] Handle rd-suffixed day parts in unified_strdate ( #23199 )
2019-11-27 00:08:37 +07:00
Remita Amine
237513e801
[yahoo] restore support for cbs suffixed URLs
2019-10-31 07:38:53 +01:00
Sergey M․
824fa51165
[utils] Improve subtitles_filename ( closes #22753 )
2019-10-18 04:03:53 +07:00
Sergey M․
28cc2241e4
[utils] Restrict parse_codecs and add theora as known vcodec ( #21381 )
2019-06-14 01:56:17 +07:00
Sergey M․
53cd37bac5
[utils] Improve strip_or_none
2019-05-24 00:03:01 +07:00
Sergey M․
3089bc748c
Fix W504 and disable W503 ( closes #20863 )
2019-05-11 03:57:40 +07:00
Jakub Wilk
fd35d8cdfd
[utils] Transliterate "þ" as "th" ( #20897 )
...
Despite visual similarity "þ" is unrelated to "p".
It is normally transliterated as "th":
$ echo þ-Þ | iconv -t ASCII//TRANSLIT
th-TH
2019-05-11 01:42:31 +07:00
Sergey M․
5e1271c56d
[utils] Improve int_or_none and float_or_none ( #20403 )
2019-03-23 01:08:54 +07:00
Sergey M․
d493f15c11
[extractor/common] Improve HTML5 entries extraction and add some realworld tests
2019-03-17 09:09:32 +07:00
Sergey M․
0dc41787af
[utils] Introduce parse_bitrate
2019-03-17 09:07:47 +07:00
Sergey M․
2e27421c70
[test_InfoExtractor] Add test for #20346
2019-03-15 01:20:24 +07:00
Sergey M․
067aa17edf
Start moving to ytdl-org
2019-03-11 04:00:54 +07:00
Sergey M․
fca9baf0da
[test] Fix test_compat_etree_Element
2019-03-06 02:46:26 +07:00
Sergey M․
399f76870d
[compat] Introduce compat_etree_Element
2019-03-06 01:18:52 +07:00
remitamine
e7e62441cd
[utils] strip #HttpOnly_ prefix from cookies files ( #20219 )
2019-03-03 19:23:59 +07:00
Ales Jirasek
22f5f5c6fc
[malltv] Add extractor ( closes #18058 )
2019-02-08 00:43:26 +07:00
Sergey M․
e118a8794f
[YoutubeDL] Fix typo in string negation implementation and add more tests ( closes #18961 )
2019-01-24 01:39:39 +07:00
Sergey M․
fad4ceb534
[utils] Fix urljoin for paths with non-http(s) schemes
2019-01-20 20:22:19 +07:00