Iptvcrawl - A scrapy project for crawl IPTV playlist

Related tags

Web Crawlingiptvcrawl
Overview

iptvcrawl

a scrapy project for crawl IPTV playlist.

Dependency

  • Python3
  • pip install scrapy

Usage

scrapy crawl ejatv

Output

Output playlist file is playlist.m3u. You should note that this file will be overwritten every time when you run spider.

Customize

You can customer the filter condition. Just edit the start_urls in ejatv.py

Example:

this url

https://eja.tv/?limit=0&country=js&language=Chinese&category=&level=0&search=

means channel from Japan, language is Chinese, and any category

Avaliable parameters value are follow:

Category

Animation => Animation
Auto => Auto
Business => Business
Classic => Classic
Comedy => Comedy
Cooking => Cooking
Culture => Culture
Documentary => Documentary
Education => Education
Entertainment => Entertainment
Family => Family
Fashion => Fashion
General => General
Kids => Kids
Legislative => Legislative
Lifestyle => Lifestyle
Local => Local
Movies => Movies
Music => Music
News => News
Outdoor => Outdoor
Relax => Relax
Religious => Religious
Science => Science
Series => Series
Shop => Shop
Sport => Sport
Sports => Sports
Travel => Travel
Weather => Weather
XXX => XXX
Youtube => Youtube
VOD => VOD

Language

Akan => Akan
Albanian => Albanian
Amharic => Amharic
Arabic => Arabic
Armenian => Armenian
Azerbaijani => Azerbaijani
Bosnian => Bosnian
Bulgarian => Bulgarian
Catalan => Catalan
Chinese => Chinese
Croatian => Croatian
Czech => Czech
Danish => Danish
Divehi => Divehi
Dutch => Dutch
English => English
Estonian => Estonian
Faroese => Faroese
Finnish => Finnish
French => French
Galician => Galician
Georgian => Georgian
German => German
Greek => Greek
Hebrew => Hebrew
Hindi => Hindi
Hungarian => Hungarian
Icelandic => Icelandic
Ignota => Ignota
Indonesian => Indonesian
Italian => Italian
Japanese => Japanese
Javanese => Javanese
Kannada => Kannada
Kazakh => Kazakh
Khmer => Khmer
Kinyarwanda => Kinyarwanda
Korean => Korean
Kurdish => Kurdish
Lao => Lao
Latvian => Latvian
Lithuanian => Lithuanian
Luxembourgish => Luxembourgish
Macedonian => Macedonian
Malay => Malay
Malay => Malay 
Malayalam => Malayalam
Maltese => Maltese
Mandarin%20Chinese => Mandarin Chinese
Min%20Nan%20Chinese => Min Nan Chinese
Modern%20Greek => Modern Greek 
Montenegrin => Montenegrin
Music => Music
Māori => Māori
Norwegian%20Bokmål => Norwegian Bokmål
Persian => Persian
Polish => Polish
Portuguese => Portuguese
Punjabi => Punjabi
Pushto => Pushto
Romanian => Romanian
Russian => Russian
Serbian => Serbian
Sinhala => Sinhala
Slovak => Slovak
Slovenian => Slovenian
Somali => Somali
Spanish => Spanish
Sundanese => Sundanese
Swahili => Swahili
Swedish => Swedish
Tagalog => Tagalog
Tamil => Tamil
Telugu => Telugu
Thai => Thai
Turkish => Turkish
Ukrainian => Ukrainian
Urdu => Urdu
Uzbek => Uzbek
Vietnamese => Vietnamese
Western%20Frisian => Western Frisian
Yue%20Chinese => Yue Chinese

Country

af => Afghanistan
al => Albania
dz => Algeria
ar => Argentina
am => Armenia
aw => Aruba
au => Australia
at => Austria
az => Azerbaijan
bh => Bahrain
bb => Barbados
by => Belarus
be => Belgium
bo => Bolivia
ba => Bosnia and Herzegovina
br => Brazil
bg => Bulgaria
bf => Burkina Faso
kh => Cambodia
cm => Cameroon
ca => Canada
cl => Chile
cn => China
co => Colombia
cr => Costa Rica
hr => Croatia
cw => Curacao
cy => Cyprus
cz => Czechia
cd => Democratic Republic of the Congo
dk => Denmark
do => Dominican Republic
ec => Ecuador
eg => Egypt
sv => El Salvador
gq => Equatorial Guinea
ee => Estonia
et => Ethiopia
fo => Faroe Islands
fj => Fiji
fi => Finland
fr => France
ge => Georgia
de => Germany
gh => Ghana
gr => Greece
gt => Guatemala
gy => Guyana
ht => Haiti
hn => Honduras
hk => Hong Kong
hu => Hungary
is => Iceland
in => India
id => Indonesia
int => International
ir => Iran
iq => Iraq
ie => Ireland
il => Israel
it => Italy
jm => Jamaica
jp => Japan
jo => Jordan
kz => Kazakhstan
ke => Kenya
xk => Kosovo
kw => Kuwait
kg => Kyrgyzstan
la => Laos
lv => Latvia
lb => Lebanon
ly => Libya
li => Liechtenstein
lt => Lithuania
lu => Luxembourg
mo => Macao
my => Malaysia
mv => Maldives
mt => Malta
mx => Mexico
md => Moldova
mc => Monaco
me => Montenegro
ma => Morocco
mz => Mozambique
nl => Netherlands
an => Netherlands Antilles
nz => New Zealand
ni => Nicaragua
ng => Nigeria
mk => North Macedonia
no => Norway
om => Oman
pk => Pakistan
ps => Palestinian Territory
pa => Panama
py => Paraguay
pe => Peru
ph => Philippines
pl => Poland
pt => Portugal
pr => Puerto Rico
qa => Qatar
cg => Republic of the Congo
ro => Romania
ru => Russia
rw => Rwanda
sm => San Marino
sa => Saudi Arabia
sn => Senegal
rs => Serbia
sl => Sierra Leone
sg => Singapore
sk => Slovakia
si => Slovenia
so => Somalia
kr => South Korea
es => Spain
lk => Sri Lanka
sd => Sudan
se => Sweden
ch => Switzerland
sy => Syria
tw => Taiwan
tj => Tajikistan
tz => Tanzania
th => Thailand
tn => Tunisia
tr => Turkey
vi => U.S. Virgin Islands
ug => Uganda
ua => Ukraine
ae => United Arab Emirates
gb => United Kingdom
us => United States
uy => Uruguay
uz => Uzbekistan
va => Vatican
ve => Venezuela
vn => Vietnam
ye => Yemen
zm => Zambia
Owner
Zhijun
Zhijun
Pelican plugin that adds site search capability

Search: A Plugin for Pelican This plugin generates an index for searching content on a Pelican-powered site. Why would you want this? Static sites are

22 Nov 21, 2022
WebScraper - A script that prints out a list of all EXTERNAL references in the HTML response to an HTTP/S request

Project A: WebScraper A script that prints out a list of all EXTERNAL references

2 Apr 26, 2022
Web Scraping Instagram photos with Selenium by only using a hashtag.

Web-Scraping-Instagram This project is used to automatically obtain images by web scraping Instagram with Selenium in Python. The required input will

Sandro Agama 3 Nov 24, 2022
An introduction to free, automated web scraping with GitHub’s powerful new Actions framework.

An introduction to free, automated web scraping with GitHub’s powerful new Actions framework Published at palewi.re/docs/first-github-scraper/ Contrib

Ben Welsh 15 Nov 24, 2022
Current Antarctic large iceberg positions derived from ASCAT and OSCAT-2

Iceberg Locations Antarctic large iceberg positions derived from ASCAT and OSCAT-2. All data collected here are from the NASA SCP website Overview Thi

Joel Hanson 5 Jul 27, 2022
A Python Oriented tool to Scrap WhatsApp Group Link using Google Dork it Scraps Whatsapp Group Links From Google Results And Gives Working Links.

WaGpScraper A Python Oriented tool to Scrap WhatsApp Group Link using Google Dork it Scraps Whatsapp Group Links From Google Results And Gives Working

Muhammed Rizad 27 Dec 18, 2022
Divar.ir Ads scrapper

Divar.ir Ads Scrapper Introduction This project first asynchronously grab Divar.ir Ads and then save to .csv and .xlsx files named data.csv and data.x

Iman Kermani 4 Aug 29, 2022
Unja is a fast & light tool for fetching known URLs from Wayback Machine

Unja Fetch Known Urls What's Unja? Unja is a fast & light tool for fetching known URLs from Wayback Machine, Common Crawl, Virus Total & AlienVault's

Sheryar 10 Aug 07, 2022
👁️ Tool for Data Extraction and Web Requests.

httpmapper 👁️ Project • Technologies • Installation • How it works • License Project 🚧 For educational purposes. This is a project that I developed,

15 Dec 05, 2021
Web and PDF Scraper Refactoring

Web and PDF Scraper Refactoring This repository contains the example code of the Web and PDF scraper code roast. Here are the links to the videos: Par

18 Dec 31, 2022
An Automated udemy coupons scraper which scrapes coupons and autopost the result in blogspot post

Autoscraper-n-blogger An Automated udemy coupons scraper which scrapes coupons and autopost the result in blogspot post and notifies via Telegram bot

GOKUL A.P 13 Dec 21, 2022
Explore scraping with BeautifulSoup!

beautifulsoup-scrape Explore scraping with BeautifulSoup! Part One: Start from Shakespeare As my professor is a poet (yes, and he teaches me data and

Chuqin 2 Oct 05, 2022
Using Python and Pushshift.io to Track stocks on the WallStreetBets subreddit

wallstreetbets-tracker Using Python and Pushshift.io to Track stocks on the WallStreetBets subreddit.

91 Dec 08, 2022
A web scraper for nomadlist.com, made to avoid website restrictions.

Gypsylist gypsylist.py is a web scraper for nomadlist.com, made to avoid website restrictions. nomadlist.com is a website with a lot of information fo

Alessio Greggi 5 Nov 24, 2022
Web-scraping - Program that scrapes a website for a collection of quotes, picks one at random and displays it

web-scraping Program that scrapes a website for a collection of quotes, picks on

Manvir Mann 1 Jan 07, 2022
Works very well and you can ask for the type of image you want the scrapper to collect.

Works very well and you can ask for the type of image you want the scrapper to collect. Also follows a specific urls path depending on keyword selection.

Memo Sim 1 Feb 17, 2022
API to parse tibia.com content into python objects.

Tibia.py An API to parse Tibia.com content into object oriented data. No fetching is done by this module, you must provide the html content. Features:

Allan Galarza 25 Oct 31, 2022
A crawler of doubamovie

豆瓣电影 A crawler of doubamovie 一个小小的入门级scrapy框架的应用,选取豆瓣电影对排行榜前1000的电影数据进行爬取。 spider.py start_requests方法为scrapy的方法,我们对它进行重写。 def start_requests(self):

Cats without dried fish 1 Oct 05, 2021
Deep Web Miner Python | Spyder Crawler

Webcrawler written in Python. This crawler does dig in till the 3 level of inside addressed and mine the respective data accordingly

Karan Arora 17 Jan 24, 2022
jd_maotai rpa 基于selenium驱动的jd抢购rpa机器人

jd_maotai rpa 基于selenium驱动的jd抢购rpa机器人, 照顾我们这样的马大哈, 不会忘记抢购了, 祝大家过年都能喝上茅台. 特别声明: 本仓库发布的jd_maotai_rpa项目定义为自动化rpa项目, 是用于防止忘记参与jd茅台的活动(由于本人时常忘记), 而不是为了秒杀和抢

35 Nov 18, 2022