dirkf
4143140adb
[YouPorn] Improve extraction
...
* use video ID from page
* detect unwatchable videos
* improve duration extraction
* fix count extraction and support large values
* detect and remnove SEO spam boilerplate description
2024-03-10 12:53:57 +00:00
dirkf
e1a57763d7
[YouPorn] Incorporate yt-dlp PR 8827
...
* from https://github.com/yt-dlp/yt-dlp/pull/8827
* extract from webpage instead of broken API URL
* thx The-MAGI
2024-03-10 12:53:57 +00:00
hatsomatt
820fae3b3a
[Videa] Fix extraction
...
* update API URL
* from https://github.com/yt-dlp/yt-dlp/pull/8003
* thanks to the authors!
Closes yt-dlp/7427
Authored by: hatsomatt, aky-01
2024-03-08 13:14:52 +00:00
dirkf
aef24d97e9
[Videa] Align with yt-dlp
2024-03-08 13:14:52 +00:00
dirkf
f7b30e3f73
[XFileShare] Update extractor for 2024
...
* simplify aa_decode()
* review and update supported sites and tests
* in above, include FileMoon.sx, and remove separate module
* incorporate changes from yt-dlp
* allow for decoding multiple scripts (eg, FileMoon)
* use new JWPlayer extraction
2024-03-08 13:03:42 +00:00
dirkf
f66372403f
[InfoExtractor] Rework and improve JWPlayer extraction
...
* use traverse_obj() and _search_json()
* support playlist `.load({**video1},{**video2}, ...)`
* support transform_source=... for _extract_jwplayer_data()
2024-03-08 13:03:42 +00:00
dirkf
7216fa2ac4
[InfoExtractor] Add _search_json()
...
* uses the error diagnostic to truncate the JSON string
* may be confused by non-C-Pythons
2024-03-08 13:03:42 +00:00
Aaron Tan
40bd5c1815
[caffeine.tv] Add new extractor ( #32514 )
...
* Add CaffeineTVIE info extractor to support site caffeine.tv
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
2024-02-22 12:54:07 +00:00
dirkf
70f230f9cf
[GBNews]Add new extractor for GB News TV channel ( #29432 )
...
* Add extractor for GB News TV channel
* Support more GBNews URL formats
Allow alphanumeric and _ in place of `shows`, which redirect to site's preferred URL
* Update for 2024
2024-02-22 12:44:00 +00:00
dirkf
7687389f08
[Vbox7] Improve extraction, adding features from yt-dlp PR #9100
...
* changes from https://github.com/yt-dlp/yt-dlp/pull/9100 (thx
seproDev):
- attempt HLS extraction
- re-enable XFF
- test `view_count`, `duration` extraction
* improve commenting, error checks
2024-02-19 00:53:22 +00:00
dirkf
4416f82c80
[Vbox7IE] Sanitise ld+json containing unexpected characters
...
* based on PR #29680
* added hack to force invoking `transform_source`
* fixes #26218
2024-02-02 12:36:05 +00:00
dirkf
bdda6b81df
[Vbox7IE] Improve extraction
...
* DASH extraction no longer fails with new range support
* but always find combined formats if available
* suppress ineffective XFF geo-bypass (causes time-outs)
* adapted from https://github.com/ytdl-org/youtube-dl/pull/29680
* thx former GH user kikuyan
2024-02-02 12:36:05 +00:00
dirkf
1fd8f802b8
[InfoExtractor] Correctly resolve BaseURL in DASH manifest
...
Specs:
* ISO/IEC 23009-1:2012 section 5.6
* RFC 3986 section 5.
2024-02-02 12:36:05 +00:00
dirkf
4eaeb9b2c6
[InfoExtractor] Support byte range for DASH
...
* adapted from https://github.com/ytdl-org/youtube-dl/pull/30279
* thx former GH user kikuyan
2024-02-02 12:36:05 +00:00
dirkf
c58b655a9e
[InfoExtractor] Support DASH subtitle extraction (yt-dlp back-port)
2024-02-02 12:36:05 +00:00
dirkf
dc512e3a8a
[YouTube] Fix like_count
extraction using likeButtonViewModel
...
* also fix various tests
* TODO: check against yt-dlp tests
2024-01-22 11:10:34 +00:00
dirkf
f8b0135850
[YouTube] Rework n-sig processing, realigning with yt-dlp
...
* apply n-sig before chunked fragments, fixes #32692
2024-01-22 11:10:34 +00:00
dirkf
640d39f03a
[InfoExtractor] Support some warning and ._downloader
shortcut methods from yt-dlp
2024-01-22 11:10:34 +00:00
Robotix
b1bbc1e502
[Epidemic Sound] Add new extractor ( #32628 )
...
* Add simple extractor
* Support separate tracks
* Use index as id instead of slug
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
2023-12-06 01:17:57 +00:00
dirkf
55a442adae
[Imgur] Overhaul extractor module ( #32612 )
...
Revise extractors for new API and page formats
2023-12-05 20:02:30 +00:00
mimvahedi
c62936a5f2
[telewebion] Fix extraction ( #32634 )
...
* [telewebion] fix extraction
Resolves https://github.com/ytdl-org/youtube-dl/issues/5135#issuecomment-932952119
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
2023-12-02 15:25:09 +00:00
ReenigneArcher
b7fca0fab3
[Youtube] Update consent cookie handling to match site
...
Apologies for force push!
[skip ci]
2023-11-29 21:43:02 +00:00
dirkf
31f50c8194
[S4C] Add thumbnail extraction, extract series as playlist
...
Based on https://github.com/yt-dlp/yt-dlp/pull/7776 : thx ifan-t, bashonly
2023-08-31 23:16:50 +01:00
dirkf
86e3cf5e58
[S4C] Add extractor for Sianel Pedwar Cymru
...
* from https://github.com/yt-dlp/yt-dlp/pull/7730 , thx ifan-t, bashonly
2023-08-04 22:54:12 +01:00
dirkf
a25e9f3c84
[compat] Use compat_open()
2023-07-25 13:19:43 +01:00
dirkf
b2ba24bb02
[InfoExtractor] Add _match_valid_url()
class method and refactor
...
* API compatible with yt-dlp
* also support Sequence of patterns in _VALID_URL
* one place to compile _VALID_URL
* TODO: remove existing extractor shims
2023-07-19 22:14:50 +01:00
dirkf
b2741f2654
[InfoExtractor] Add search methods for Next/Nuxt.js from yt-dlp
...
* add _search_nextjs_data(), from https://github.com/yt-dlp/yt-dlp/pull/1386
thanks selfisekai
* add _search_nuxt_data(), from https://github.com/yt-dlp/yt-dlp/pull/1921 ,
thanks Lesmiscore, pukkandan
* add tests for the above
* also fix HTML5 type recognition and tests, from
222a230871
,
thanks Lesmiscore
* update extractors in PR using above, fix tests.
2023-07-19 22:14:50 +01:00
dirkf
8465222041
[Clipchamp] Add new extractor back-ported from yt-dlp
2023-07-19 22:14:50 +01:00
dirkf
4339910df3
[DLF] Add site extractors back-ported from yt-dlp
...
* from https://github.com/yt-dlp/yt-dlp/pull/6697 , thanks nick-cd
2023-07-19 22:14:50 +01:00
dirkf
eaaf4c6736
[Whyp] Add extractor back-ported from yt-dlp
...
* from https://github.com/yt-dlp/yt-dlp/pull/6803 , thanks CoryTibbettsDev
2023-07-19 22:14:50 +01:00
dirkf
4566e6e53e
[GlobalPlayer] Add site extractors back-ported from yt-dlp
...
* from https://github.com/yt-dlp/yt-dlp/pull/6903 , thanks garret1317
2023-07-19 22:14:50 +01:00
dirkf
1e8ccdd2eb
[InfoExtractor] Support groups in _search_regex()
, etc
2023-07-19 22:14:50 +01:00
dirkf
fa7f0effbe
[YouTube] Avoid crash in author extraction
2023-06-22 23:14:21 +01:00
pukkandan
9112e668a5
[YouTube] Improve nsig function name extraction
...
Fixes player b7910ca8, using `,` vs `;`
See https://github.com/ytdl-org/youtube-dl/issues/32292#issuecomment-1602231170
Co-authored-by: dirkf
2023-06-22 16:46:53 +01:00
dirkf
07af47960f
[YouTube] Improve fix for ae8ba2c
...
Thx: https://github.com/yt-dlp/yt-dlp/commit/01aba25
2023-06-18 00:52:18 +01:00
dirkf
ae8ba2c319
[YouTube] Fix KeyError QV
in signature extraction failed
...
* temporarily force missing global definition into sig JS
* improve test: thanks https://github.com/yt-dlp/yt-dlp/issues/7327#issuecomment-1595274615
* resolves #32314
2023-06-17 15:55:19 +01:00
dirkf
ee731f3d00
[ITV] Fix UA capitalisation in 384f632
2023-05-23 16:50:25 +01:00
dirkf
64d6dd64c8
[YouTube] Support Releases tab
2023-04-23 22:58:35 +01:00
dirkf
2da3fa04a6
[YouTube] Simplify signature patterns
2023-04-12 23:53:14 +01:00
pukkandan
3f6d2bd76f
[extractor/youtube] Bypass throttling for -f17
...
and related cleanup
Thanks @AudricV for the finding
Ref: yt-dlp/yt-dlp/commit/c9abebb
2023-03-19 02:29:00 +00:00
pukkandan
88f28f620b
[extractor/youtube] Construct fragment list lazily
...
Ref: yt-dlp/yt-dlp/commit/e389d17
See: yt-dlp/yt-dlp#6517
2023-03-19 02:29:00 +00:00
dirkf
6fece0a96b
[AENetworksBaseIE] Report missing show data instead of crash
2023-03-14 16:23:20 +00:00
pukkandan
3da17834a4
[Youtube] Construct dash formats with range
query
...
See yt-dlp/yt_dlp#6369
2023-03-03 15:02:15 +00:00
dirkf
f7ce98a21e
[YouTube] Support @owner format in uploader_id etc
...
* implement https://github.com/ytdl-org/youtube-dl/issues/31530#issuecomment-1435734719
* update affected tests
* misc clean-ups
2023-02-24 12:22:16 +00:00
pukkandan
1d3751c3fe
Escape URLs in sanitized_Request
, not sanitize_url
d2558234cf5dd12d6896eed5427b7dcdb3ab7b5a added escaping of URLs while sanitizing. However, sanitize_url
may not always receive an actual URL. Eg: When using youtube-dl "search query" --default-search ytsearch
, search query
gets escaped to search%20query
before being prefixed with ytsearch:
which is not the intended behavior. So the escaping is moved to sanitized_Request
instead.
2023-02-20 20:27:25 +00:00
df
6067451e43
[Vimeo] Fix e19ec52
for tween-age Pythons
...
* a check in older Pythons in the 2.7 and earlier, 3.3, 3.4 series caused "sre_constants.error: nothing to repeat"
* satisfy the check by avoiding nested qualifiers that can match empty string
Resolves #31597
2023-02-20 01:41:46 +00:00
dirkf
2dd6c6edd8
[YouTube] Avoid crash if uploader_id extraction fails
...
See #31530 .
2023-02-17 11:16:54 +00:00
dirkf
42b098dd79
[InfoExtractor] Handle unquoted values in OpenGraph searches
2023-02-14 02:53:16 +00:00
fonkap
6f8c2635a5
[StreamsbIE] Add extractor for streamsb.com (viewsb.com) ( #31517 )
...
* Add extractor for streamsb.com (viewsb.com)
* make data url using app.js version
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
2023-02-13 03:54:51 +00:00
fonkap
de48105dd8
[KommunetvIE] Add extractor for kommunetv.no ( #31516 )
...
* Add extractor for kommunetv.no
* Using utils.update_url instead of regex
---------
Co-authored-by: dirkf <fieldhouse@gmx.net>
2023-02-13 03:54:51 +00:00