Scrap the 42 Intranet's elearning videos in a single click

Last update: Oct 27, 2022

Related tags

Web Crawling 42intra_scraper

Overview

42intra_scraper

Scrap the 42 Intranet's elearning videos in a single click.

Why you would want to use it ?

Adjust speed at your convenience. (The intra doesn't allow this)
Working in a remote location where internet is hit or miss ? Download what you need and you'll have it in your computer.
Have a friend that is freeze and can't access the intra's resources ? You can download the videos, compress them and send them via drive.

How to use it:

git clone [email protected]:Dovalich/42intra_scraper.git

pip3 install -r requirements.txt

python3 intra_scraper.py

And then all you have to do is follow the instructions that the program gives you, that is:

enter your 42 intranet username
enter your 42 intranet password
enter the elearning link you want to scrap for example https://elearning.intra.42.fr/tags/38/notions

Here's a short Tutorial gif:

How does it work ?

It's fairly simple.

The program makes a post request to the intranet using your logins (via the requests module).
Once logged-in, it recursively searches for any links that are in the middle of the page (the ones that contain videos).
Once it finds a video link, it downloads it based on the video quality you chose (SD or HD).

Note

As you can see in the code I don't store your user name and password. In fact I only use them once to login. But be careful when using these types of scripts. You should always read the source code before giving away sensitive information.

If you have feedback on the code please let me know! 👨‍🎓

And feel free to use it however you want.

Scrap the 42 Intranet's elearning videos in a single click

Related tags

Overview

42intra_scraper

Why you would want to use it ?

How to use it:

How does it work ?

Note

Owner

Noufel

此脚本为 python 脚本,实现原理为利用 selenium 定位相关元素,再配合点击事件完成浏览器的自动化.

Pseudo API for Google Trends

This is a simple website crawler which asks for a website link from the user to crawl and find specific data from the given website address.

Poolbooru gelscraper - a simple python script for scraping images off gelbooru pools.

Web-scraping - Program that scrapes a website for a collection of quotes, picks one at random and displays it

自动完成每日体温上报（Github Actions）

Html Content / Article Extractor, web scrapping lib in Python

tweet random sand cat pictures

Proxy scraper. Format: IP | PORT | COUNTRY | TYPE

Lovely Scrapper

Scrape Twitter for Tweets

Pythonic Crawling / Scraping Framework based on Non Blocking I/O operations.

PS5 bot to find a console in france for chrismas 🎄🎅🏻 NOT FOR SCALPERS

联通手机营业厅自动做任务、签到、领流量、领积分等。

An automated, headless YouTube Watcher and Scraper

爱奇艺会员,腾讯视频,哔哩哔哩,百度,各类签到

An application that on a given url, crowls a web page and gets all words, sorts and counts them.

A Python package that scrapes Google News article data while remaining undetected by Google.

Discord webhook spammer with proxy support and proxy scraper

python+selenium实现的web端自动打卡 + 每日邮件发送 + 金山词霸每日一句 + 毒鸡汤（从2月份稳定运行至今）

Scrap the 42 Intranet's elearning videos in a single click

Related tags

Overview

42intra_scraper

Why you would want to use it ?

How to use it:

How does it work ?

Note

Owner

Noufel

此脚本为 python 脚本,实现原理为利用 selenium 定位相关元素,再配合点击事件完成浏览器的自动化.

Pseudo API for Google Trends

This is a simple website crawler which asks for a website link from the user to crawl and find specific data from the given website address.

Poolbooru gelscraper - a simple python script for scraping images off gelbooru pools.

Web-scraping - Program that scrapes a website for a collection of quotes, picks one at random and displays it

自动完成每日体温上报（Github Actions）

Html Content / Article Extractor, web scrapping lib in Python

tweet random sand cat pictures

Proxy scraper. Format: IP | PORT | COUNTRY | TYPE

Lovely Scrapper

Scrape Twitter for Tweets

Pythonic Crawling / Scraping Framework based on Non Blocking I/O operations.

PS5 bot to find a console in france for chrismas 🎄🎅🏻 NOT FOR SCALPERS

联通手机营业厅自动做任务、签到、领流量、领积分等。

An automated, headless YouTube Watcher and Scraper

爱奇艺会员,腾讯视频,哔哩哔哩,百度,各类签到

An application that on a given url, crowls a web page and gets all words, sorts and counts them.

A Python package that scrapes Google News article data while remaining undetected by Google.

Discord webhook spammer with proxy support and proxy scraper

python+selenium实现的web端自动打卡 + 每日邮件发送 + 金山词霸 每日一句 + 毒鸡汤（从2月份稳定运行至今）

python+selenium实现的web端自动打卡 + 每日邮件发送 + 金山词霸每日一句 + 毒鸡汤（从2月份稳定运行至今）