A multithreaded tool for searching and downloading images from popular search engines. It is straightforward to set up and run!

Last update: Dec 31, 2022

Overview

🕳️ CygnusX1

Code by Trong-Dat Ngo.

Overviews

🕳️ CygnusX1 is a multithreaded tool 🛠️ , used to search and download images from popular search engines 🔎 . It is straightforward to set up and run!

Key features

🥰 No knowledge is required to get up and to run.
🚀 Download image using customizable number of threads.
⛏️ Crawl all possible images (search results and recommendations).

Installation

This repository is tested on Python 3.6+ and PyTorch selenium 3.141.0+, as well as it works fine on macOS, Windows, Linux.

You should setup and run 🕳️ CygnusX1 in a virtual environment. If you're unfamiliar with Python virtual environments, check out the user guide here.

First, create a virtual environment with the version of Python you're going to use and activate it. (Can be omitted if you want to set up directly on the OS environment)

source venv/bin/activate

Then download 🕳️ CygnusX1 from Github:

git clone https://github.com/dat821168/CygnusX1.git

Finally install dependencies in requirements.txt:

pip install -r requirements.txt

Run

Use run.py to start the script:

python run.py  --keywords "keyword 1, keyword 2" --workers 8 --use_suggestions --headless

Argument details:

--keywords: Indicate the keywords/keyphrases you want to search. For multiple keywords, separate them with commas.
--out_dir: Path where to save results. Default = './IMAGES'.
--workers: The maximum number of workers used to crawl image. Default = 2.
--use_suggestions: Crawl search engine suggestions/recommendations. Default = False.
--headless: Hide browser during scraping. Default = False.

A multithreaded tool for searching and downloading images from popular search engines. It is straightforward to set up and run!

Related tags

Overview

🕳️ CygnusX1

Overviews

Key features

Installation

Run

Future Releases

References

Owner

DatNgo

Linkedin webscraping - Linkedin web scraping with python

淘宝、天猫半价抢购，抢电视、抢茅台，干死黄牛党

UdemyBot - A Simple Udemy Free Courses Scrapper

Web scrapping tool written in python3, using regex, to get CVEs, Source and URLs.

京东云无线宝积分推送，支持查看多设备积分使用情况

A simplistic scraper made to download tons of random screenshots made by people.

Free-Game-Scraper is a useful script that allows you to track down free games and DLCs on many platforms.

A Web Scraper built with beautiful soup, that fetches udemy course information. Get udemy course information and convert it to json, csv or xml file

Using Python and Pushshift.io to Track stocks on the WallStreetBets subreddit

A python module to parse the Open Graph Protocol

Anonymously scrapes onlinesim.ru for new usable phone numbers.

A low-code tool that generates python crawler code based on curl or url

Binance Smart Chain Contract Scraper + Contract Evaluator

A web scraping pipeline project that retrieves TV and movie data from two sources, then transforms and stores data in a MySQL database.

A dead simple crawler to get books information from Douban.

Web3 Pancakeswap Sniper bot written in python3

A scrapy pipeline that provides an easy way to store files and images using various folder structures.

Python Web Scrapper Project

Dex-scrapper - Hobby project for scrapping dex data on VeChain

Rottentomatoes, Goodreads and IMDB sites crawler. Semantic Web final project.