Commit Graph

599 Commits

Author SHA1 Message Date
Sergey M․ 63d990d285
[generic] Add support for Video.js embeds 2017-09-28 00:37:30 +07:00
Sergey M․ 02d01e15f1
[generic] Fix infinite recursion for twitter:player URLs (closes #14339) 2017-09-26 21:47:18 +07:00
kayb94 9ce1ac4046 [generic] Fix support for multiple HTML5 videos on one page (closes #14080) 2017-09-23 05:49:48 +07:00
Sergey M․ 66c9fa36c1
[youtube] Separate methods for embeds extraction 2017-09-06 00:48:37 +07:00
Sergey M․ ff17be3ac9
[extractor/generic] Extract from LD-JSON last of all
Previous sources may contain several formats, e.g. http://tamasha.com/v/PgGZ
2017-08-27 03:31:40 +07:00
Yen Chi Hsuan 09747ba766
[liveleak] Support another liveleak embedding pattern (closes #13336) 2017-08-19 22:28:13 +08:00
Sergey M․ 5551d7714d
[generic] Convert redirect URLs to unicode strings (closes #13951) 2017-08-17 23:58:01 +07:00
Sergey M․ eb02940cc7
[generic] Add test for #13895 2017-08-13 01:11:27 +07:00
Sergey M․ 1663bd6e1c
[generic] Replace vzaar embed test 2017-08-11 22:02:00 +07:00
tetra-eder 41918eaa5c [generic] Add support for vzaar embeds 2017-08-11 22:00:39 +07:00
Grzegorz Ruciński 198d4cb40c [generic] Add support for another ooyala embed pattern (closes #13727) 2017-07-30 01:30:04 +07:00
Sergey M․ 95908ce453
[extractor/generic] PEP 8 2017-07-29 15:13:12 +07:00
Philipp Hagemeister 24e966e8da [megaphone] Add extractor 2017-07-28 12:13:19 +02:00
Sergey M․ bb176df3bb
[spiegel:article] Move test 2017-07-17 22:19:40 +07:00
Sergey M․ 3f59b0154a
[nexx:embed] Add extractor for iframe embeds 2017-07-16 04:32:37 +07:00
Sergey M․ 4e826cd9ae
[nexx] Add extractor (closes #10807, closes #13465) 2017-07-15 07:02:57 +07:00
Robin Neatherway 2583c0b54e Fix bugs caused by typos 2017-07-14 23:08:32 +07:00
Sergey M․ 71a1db8919
[dailymail] Add support for embeds 2017-07-09 20:06:24 +07:00
Sergey M․ 73cf76a93f
[joj] Rewrite and add support for generic embeds (closes #13268) 2017-07-09 19:17:54 +07:00
Sergey M․ 4328ddf82b
[extractor/common] Add support for AMP tags in _parse_html5_media_entries 2017-07-09 16:29:52 +07:00
Sergey M․ 250b042c7e
[generic] Add tests for #13557 2017-07-09 16:02:38 +07:00
Sergey M․ 665e945246
[eagleplatform] Add support for referrer protected videos (closes #13557) 2017-07-09 15:57:58 +07:00
Sergey M․ b311b0ead2
[generic] Extract more generic metadata (closes #13527) 2017-06-30 21:42:04 +07:00
Sergey M․ fd1c5fba6b
[facebook] Add test for plugin video embed (#13493) 2017-06-27 22:38:59 +07:00
Sergey M․ 0646e34c7d
[facebook] Add support for plugin video embeds and multiple embeds (closes #13493) 2017-06-27 22:38:54 +07:00
Sergey M․ 4e3637034c
[extractor/generic] Ensure format id is unicode string 2017-06-10 23:56:20 +07:00
Sergey M․ 2b8e6a68f8
[extractor/generic] Add test for mediaset embed 2017-05-14 06:40:19 +07:00
Sergey M․ 5d29af3d15
[extractor/generic] Add support for mediaset embeds 2017-05-14 06:29:16 +07:00
Sergey M․ 946826eec7
[extractor/generic] Remove duplicate limelight code 2017-05-14 06:17:34 +07:00
Sergey M․ 58bb440283
[extractor/generic] Extract wistia embed code into separate method 2017-05-13 21:51:58 +07:00
Remita Amine 6c52477f59 [condenast] improve embed support 2017-05-12 16:37:09 +01:00
Sergey M․ 9edf47df7b
[brightcove] Allow whitespace around attribute names in embedded code 2017-05-01 01:03:47 +07:00
John Hawkinson 557194591a
[washingtonpost] Add support for embeds (closes #12699) 2017-04-29 23:07:26 +07:00
Sergey M․ 7986c3abcd
[anvato] Improve extraction (closes #12913)
* Promote to regular shortcut based extractor
* Add mcp to access key mapping table
* Add support for embeds extraction
* Add support for anvato embeds in generic extractor
2017-04-29 19:49:04 +07:00
Yen Chi Hsuan 06d0ad9a4e
[brightcove] Support URLs with bcpid instead of playerID
Fixes #12482
2017-04-18 23:04:22 +08:00
Sergey M․ ab87c26009
[extractor/generic] Add test for #12761 2017-04-17 00:36:59 +07:00
Sergey M․ e5d39886ec
[limelight] Improve embeds extraction (closes #12761)
* Move extraction code to extractor
* Add extraction for LimelightEmbeddedPlayerFlash embeds
* Extract multiple video
2017-04-17 00:23:16 +07:00
Sergey M․ 413c1f8e2f
[extractor/generic] Extract RSS entries as url_transparent (#11163) 2017-04-16 00:57:32 +07:00
Sergey M․ 0254f93b08
[brightcove] Validate ambiguous embeds' URLs (closes #9163, closes #12005, closes #12178, closes #12480) 2017-04-15 20:31:49 +07:00
John Hawkinson 40158f55c9
[brightcove] Relax video tag embeds extraction
BrightcoveNewIE.extract_urls(): Handle player data when it is all in
attributes of the <video> tag, and also the id is
data-brightcove-video-id not data-video-id.

Add test to generic extractor, note HEAD gives 404, no checksum computed(?).
2017-04-15 20:31:43 +07:00
Remita Amine 0ee79a378a [generic] exclude urls with xml ext from valid video urls(closes #10768)(closes #11654) 2017-04-12 16:22:08 +01:00
John Hawkinson 61568e50cf [generic] Add support for <object> youtube embeds (closes #12637) 2017-04-11 00:04:32 +07:00
John Hawkinson a01825a541 [kaltura] Add support for iframe embeds 2017-04-08 16:40:57 +07:00
Remita Amine 3e943cfe09 [generic] pass base_url to _parse_jwplayer_data 2017-03-31 14:54:06 +01:00
Random User 4f06c1c9fc Merge branch 'master' of github.com-rndusr:rg3/youtube-dl into fix/str-item-assignment 2017-03-25 21:36:59 +01:00
Random User c73e330e7a _find_jwplayer_data() returns dict or None
This simplifies code for callers of `_find_jwplayer_data()` which no longer have
to run `_parse_json()` on the return value.

It also makes sure that `_find_jwplayer_data()` returns either a `dict` or
`None` and nothing else.
2017-03-25 19:38:30 +01:00
Random User 03486dbb01 Add test for JWPlayer where config is passed as variable 2017-03-25 19:37:45 +01:00
John Hawkinson e8e4cc5a6a
[generic] Replace LazyYT test with skiplagged
discourse.ubuntu.com has gone away, repalce with skiplagged.com.
Be nice to have a non-frontpage URL that might be more stable,
though I don't have one. Maybe this should move to html
in test/test_InfoExtractor.py?
2017-03-25 19:53:32 +07:00
Sergey M․ 54b960f340
[generic] Do not follow redirects to the same URL 2017-03-24 00:45:24 +07:00
John Hawkinson 97952bdb78 [generic] Add test for Senate ISVP iframe embed 2017-03-22 01:12:14 +08:00
John Hawkinson 46b18f2349 [BostonGlobe] New. Nonstandard version of Brightcove.
Has a "data-brightcove-video-id" instead of a "data-video-id," otherwise
pretty much just Brightcove. Except the Globe isn't all Brightcove
videos, so fallback to Generic, too.

Also, abstract playlist_from_matches() from generic.py to common.py, and use
it here.

History of these changes can be found in
51170427d4b1143572a498dedaee61863a5b2c5b.
2017-03-19 20:40:31 +08:00
Sergey M․ a309684285
[extractor/generic] Add forgotten return for jwplayer formats 2017-03-16 03:28:01 +07:00
Sergey M․ c2ee861c6d
[extractor/generic] Make title optional for jwplayer embeds (closes #12410) 2017-03-10 23:16:53 +07:00
Sergey M․ b68a812ea8
[extractor/generic] Add test for brigthcove UUID-like videoPlayer 2017-03-07 23:00:21 +07:00
Sergey M․ eb3079b6ce
[generic] Add support for rutube embeds 2017-03-04 00:46:33 +07:00
Yen Chi Hsuan fef51645d6
[theplatform] Recognize URLs with whitespaces (closes #12044) 2017-02-17 23:13:51 +08:00
Yen Chi Hsuan a4a554a793
[generic] Try parsing JWPlayer embedded videos (closes #12030) 2017-02-16 23:44:03 +08:00
Remita Amine 454e5cdb17 [limelight] add support referer protected videos 2017-02-13 14:29:05 +01:00
Remita Amine 9dad941853 [disney] improve extraction
- add support for more urls
- detect expired videos
- skip Adobe Flash Access protected videos

closes #4975
closes #11000
closes #11882
closes #11936
2017-02-13 11:43:20 +01:00
Sergey M․ 5620f840f6
[extractor/generic] Add test for #11993 and more metadata for rtmp 2017-02-06 23:31:58 +07:00
Sergey M․ b7a8c1bcfa
[extractor/generic] Improve rtmp support (closes #11993) 2017-02-06 23:23:40 +07:00
Sergey M․ 6ef3e65a7b
[videopress] Add extractor 2017-02-05 13:37:27 +07:00
Sergey M․ 17f8deeb48
[extractor/generic] Add support for openload embeds (closes #11536, closes #11812) 2017-01-25 23:27:22 +07:00
Yen Chi Hsuan 972efe60c3
[generic] Remove a dead test
The web page does not contain a video anymore

Ref: #2694, #2696
2017-01-20 22:27:13 +08:00
Yen Chi Hsuan d77ac73790
[ustream] Add UstreamIE._extract_url()
Ref: #11547
2017-01-20 22:27:13 +08:00
Sergey M․ b687c85eab
[extractor/generic] Add support for 20 minuten embeds (closes #11683, closes #11751) 2017-01-18 22:17:47 +07:00
Yen Chi Hsuan 16e2c8f771
[brightcove] Recognize another player ID
Closes #11688
2017-01-16 00:06:52 +08:00
Remita Amine 9837cb7507 [ooyala] add support for videos with embedToken(#11684) 2017-01-13 23:06:59 +01:00
Yen Chi Hsuan ec85ded83c
Fix "invalid escape sequences" error on Python 3.6 2017-01-06 00:58:56 +08:00
Sergey M․ 027e231295
[tunein] Add support for embeds (closes #11579) 2017-01-03 01:45:59 +07:00
Sergey M․ e186a9ec03
[videa] Add support for videa embeds 2016-12-31 22:05:32 +07:00
Sergey M․ 53a664edf4
[brightcove:legacy] Improve embeds detection (closes #11523) 2016-12-24 22:46:27 +07:00
Remita Amine b1c357975d [piksel] Add new extractor(closes #11246) 2016-12-20 12:35:03 +01:00
Yen Chi Hsuan 8ab7e6c4cc
[kaltura] Improve widget ID extraction (closes #11480) 2016-12-20 18:45:52 +08:00
felix 8953319916 [screenwavemedia] Remove extractor
Rewrite TeamFourStar and Normalboots extractors in terms of JWPlatform
2016-11-28 23:17:56 +07:00
Sergey M․ 83f1481baa
[extractor/generic] Add support for webcaster.pro embeds 2016-11-27 19:56:32 +07:00
Sergey M․ 8f8f182d0b
[extractor/generic] Improve limelight embeds support 2016-11-20 02:13:21 +07:00
Sergey M․ b52c9ef165
[extractor/generic] Improve support for pornhub embeds (closes #11100) 2016-11-06 21:52:00 +07:00
Sergey M․ e28ed498e6
[extractor/generic] Add support for redtube embds (closes #11099) 2016-11-06 21:42:41 +07:00
Sergey M․ 37e7a71c6c
[extractor/generic] Add support for drtuber embds (closes #11098) 2016-11-06 21:33:51 +07:00
Sergey M․ 4119a96ce5
[extractor/generic] Skip URLs we came from when delegating ISM extraction 2016-11-02 23:43:41 +07:00
Sergey M․ 26aae56690
[extractor/generic] Improve ISM extraction 2016-11-02 23:34:37 +07:00
Sergey M․ cc99a77ac1
[extractor/generic] Add support for ISM manifests 2016-11-02 03:01:13 +07:00
Sergey M․ 773017c648
[rentv] Move rentv test from generic extractor and add only matching tests 2016-10-26 19:52:43 +07:00
Yen Chi Hsuan 9dcd6fd3aa
[generic,commonprotocols] Move mms suuport from GenericIE
And use _generic_* helpers in those extractors
2016-10-07 19:24:22 +08:00
Yen Chi Hsuan 3d83a1ae92
[generic] Support direct MMS links (closes #10838) 2016-10-07 17:50:45 +08:00
Sergey M․ 09b9c45e24
[generic] Add support for multiple vimeo embeds (Closes #10862) 2016-10-06 23:22:52 +07:00
Déstin Reed dcdb292fdd
Unify coding cookie 2016-10-03 23:44:29 +07:00
Yen Chi Hsuan bd26441205
[utils] Fix xattr error handling 2016-10-02 03:03:41 +08:00
Sergey M․ 86f4d14f81
Refactor fragments interface and dash segments downloader
- Eliminate segment_urls and initialization_url
+ Introduce manifest_url (manifest may contain unfragmented data in this case url will be used for direct media URL and manifest_url for manifest itself correspondingly)
* Rewrite dashsegments downloader to use fragments data
* Improve generic mpd extraction
2016-09-17 20:35:22 +07:00
Remita Amine d05ef09d9d [mangomolo] fix domain regex 2016-09-17 08:11:01 +01:00
Remita Amine 7d273a387a [mangomolo] add support for Mangomolo embeds 2016-09-16 19:31:39 +01:00
Sergey M․ a5ff05df1a
[extractor/generic] Add vimeo embed that requires Referer passed 2016-09-12 21:49:31 +07:00
Remita Amine 93b8404599 [generic,vodplatform] improve embed regex 2016-08-29 07:57:20 +01:00
Yen Chi Hsuan 39efc6e3e0 [generic] Update some _TESTS 2016-08-28 15:46:11 +08:00
Sergey M․ fd3ec986a4
[generic] Fix dbtv test (Closes #10364) 2016-08-18 21:35:41 +07:00
Déstin Reed b0c8f2e9c8
[DBTV:generic] Add support for embeds 2016-08-18 21:29:27 +07:00
Sergey M․ 2a1321a272
[vbox7:generic] Add support for vbox7 embeds 2016-08-17 01:02:59 +07:00
Sergey M․ 082395d0a0
[extractor/generic] Add proper default to _search_json_ld call 2016-08-08 22:48:33 +07:00
Sergey M․ 395c74615c
Revert "[extractor/generic] Make _search_json_ld non fatal"
This reverts commit 958849275f.
2016-08-08 21:49:27 +07:00
Sergey M․ 958849275f
[extractor/generic] Make _search_json_ld non fatal 2016-08-07 19:04:22 +07:00
Remita Amine 217d5ae013 [vodplatform] Add new extractor 2016-08-04 09:39:06 +01:00
Sergey M․ db19df6ca0
[extractor/generic] Add test for #10179 2016-07-28 22:20:08 +07:00
Sergey M․ 94aae01548
[extractor/generic] Extract all soundcloud embeds (Closes #10179) 2016-07-28 22:15:15 +07:00
Yen Chi Hsuan 326fa4e6e5
[generic] Skip an invalid test 2016-07-26 13:16:04 +08:00
Yen Chi Hsuan 05d1e7aaa9
[generic] Fix an MTV test and another test that breaks nosetests 2016-07-26 11:11:36 +08:00
Sergey M․ 1979969f91
[extractor/generic] Add support for arkena embeds 2016-07-23 17:56:48 +07:00
Sergey M․ 371ddb14fe
[extractor/generic] Change twitter:player embeds priority to lowest (Closes #10090) 2016-07-16 15:59:43 +07:00
Yen Chi Hsuan c03adf90bd
[generic] Add the test. Closes #1638 2016-07-09 14:39:01 +08:00
Sergey M․ 6e6b70d65f
[extractor/generic] Properly comment out a test 2016-07-09 08:37:19 +07:00
Sergey M․ 0de168f7ed
[extractor/generic] Detect schema.org/VideoObject embeds 2016-07-09 03:29:07 +07:00
Yen Chi Hsuan ad213a1d74
[francetv] Recognize more Dailymotion embedded videos
Closes #9955
2016-07-06 23:37:54 +08:00
Remita Amine 81953d1ae5 [kaltura] add support videos stored on custom kaltura servers(closes #5557) 2016-07-04 17:59:58 +01:00
Yen Chi Hsuan dbf0157a26 [generic] Add MD5 checksums 2016-07-02 21:58:07 +08:00
Yen Chi Hsuan 7deef1ba67 [generic] Support Wordpress "YouTube Video Importer" plugin
Closes #9938
2016-07-02 21:58:07 +08:00
Yen Chi Hsuan fd6ca38262 [facebook] Improve Facebook embedded detection
Related to #9938.

Another example comes from 9834872bf6.
2016-07-02 21:58:07 +08:00
Remita Amine 7dbeee7e22 [generic] make twitter:player extraction non fatal 2016-06-30 14:11:55 +01:00
Remita Amine 329179073b [generic] add generic support for twitter:player embeds 2016-06-30 12:01:30 +01:00
Sergey M․ 06a96da15b
[eagleplatform] Improve embed detection and extract in separate routine (Closes #9926) 2016-06-29 23:01:34 +07:00
Sergey M․ c287f2bc60
[extractor/generic] Use _extract_url for kaltura embeds (Closes #9922) 2016-06-27 22:45:26 +07:00
Sergey M․ 427cd050a3
[extractor/generic] Improve kaltura embed detection (Closes #9911) 2016-06-27 04:11:53 +07:00
Yen Chi Hsuan 7d52c052ef
[generic] Fix test_Generic_76
Broken: https://travis-ci.org/rg3/youtube-dl/jobs/140251658
2016-06-26 11:56:27 +08:00
Yen Chi Hsuan c5edd147d1
[generic] Remove an invalid test
Now handled by telewebion.py
2016-06-11 18:39:58 +08:00
Yen Chi Hsuan 1fa309da40
[generic] Update test_Generic_40
The original link now redirects to an YouTube user channel.
2016-06-10 16:39:31 +08:00
Yen Chi Hsuan cc4444662c [generic] Remove Vulture embed detection
Vulture.com videos now hosts on YouTube, Vimeo, MTV, NBC News or Hulu.
Here's an example of Hulu:
http://www.vulture.com/2016/06/kimmel-interviews-mariah-carey-in-a-bathtub.html
2016-06-10 13:40:57 +08:00
Yen Chi Hsuan de3eb07ed6
[generic] Detect NBC News embeds 2016-06-10 13:32:59 +08:00
Sergey M․ 48a5eabc48
[extractor/generic] Add support vessel embeds (Closes #7083) 2016-06-09 04:02:27 +07:00
Yen Chi Hsuan 811586ebcf
[generic] Update the UDNEmbed test case 2016-06-01 19:23:44 +08:00
Yen Chi Hsuan 28bab13348
[generic,viewlift] Move a test case to the specialized extractor 2016-06-01 19:18:01 +08:00
Yen Chi Hsuan 877032314f
[generic] Improve Kaltura detection
Closes #4004
2016-06-01 18:37:34 +08:00
Yen Chi Hsuan 78d3b3e213
[generic] Improve Livestream detection (closes #2234) 2016-05-23 01:40:11 +08:00
Yen Chi Hsuan 4a12077855
[genric] Eliminate duplicated video URLs (closes #6562) 2016-05-22 22:23:20 +08:00
Sergey M․ 4d8819d249
[extractor/generic] Add support for theplatform embeds (Closes #8636, closes #9476) 2016-05-22 06:52:39 +06:00
Sergey M․ 6c114b1210
[extractor/generic] Remove generic id and title from wistia extractionand update tests 2016-05-20 21:55:35 +06:00
Sergey M․ 7ded6545ed
[extractor/generic] Add test for wistia standard embed 2016-05-20 21:43:36 +06:00
Sergey M․ aa5957ac49
[extractor/generic] Add support for async wistia embeds (Closes #9549) 2016-05-20 21:33:31 +06:00
Sergey M․ 6f41b2bcf1
[extractor/generic] Improve 3qsdn embeds support (Closes #9453) 2016-05-14 23:58:25 +06:00
Sergey M․ 5d39176f6d
[extractor/generic:3qsdn] Add support for embeds 2016-05-14 23:40:34 +06:00
remitamine 67167920db [viewlift] replace SnagFilms extractors
- add support for other sites that use the same logic
- improve format extraction and sorting
2016-04-29 11:24:10 +01:00
Yen Chi Hsuan 4645432d7a
[eagleplatform] Checking direct HTTP links
Sometimes they fail with 404
2016-04-25 22:48:17 +08:00
Yen Chi Hsuan 0738187f9b
[ThePlatform] Fix tests failed since 79ba9140dc 2016-04-24 20:46:06 +08:00
Yen Chi Hsuan 949b6497cc
[generic] Unescape the video URL
Fixes #9279
2016-04-24 16:25:37 +08:00
Sergey M․ 11e60fcad8
[extractor/generic] Improve instagram embeds (Closes #9213) 2016-04-16 22:39:20 +06:00
Yen Chi Hsuan ef0e4e7bc0
[generic] Fix test_Generic_2
Now a HEAD request returns 400 Bad Request
2016-04-16 19:44:45 +08:00
Kacper Michajłow b8f67449ec [generic] Add support for LiveLeak embeds 2016-04-13 01:54:19 +02:00
Yen Chi Hsuan 134c207e3f
[arte.tv:embed] Extended support (#2620) 2016-04-11 19:32:27 +08:00
remitamine f9b1529af8 [generic] remove sbnation test(handled by VoxMediaIE) 2016-03-31 23:50:45 +01:00
Sergey M․ 19dbaeece3 Remove _sort_formats from _extract_*_formats methods
Now _sort_formats should be called explicitly.
_sort_formats has been added to all the necessary places in code.

Closes #8051
2016-03-27 07:03:08 +06:00
Yen Chi Hsuan d10fe8358c [generic] Add a test case for brightcove embed
Closes #8862
2016-03-26 18:30:43 +08:00
Yen Chi Hsuan 5a51775a58 [generic] Extract Instagram embeds (#8817) 2016-03-24 16:32:27 +08:00