Audio media crawler for lbry.

Last update: Dec 03, 2022

Related tags

Overview

Audio media crawler for lbry.

Requirements

Python 3.8
Poetry 1.1.7
Elasticsearch 7.14.0
Lbry-sdk 0.99.0

Development

This project uses poetry as a dependency management tool.

Install dependencies

Installs all defined dependencies of the project. For more information please read the poetry documentation.

poetry install

Tasks

Update hooks

Setup and update pre-commit hooks. You should run this the first time after poetry install.

poetry run task update-hooks

Format code

For more information please read the black documentation

poetry run task format

Commands

Basic usage

For more information please read the poetry documentation.

poetry run podcatcher <command>

Sync

Scan all audio streams to find music and podcasts episodes, keeping elasticsearch in sync.

poetry run podcatcher sync

Retry sync

Retry failed sync from last checkpoint. If no previous failed sync occured it will just run a normal sync.

poetry run podcatcher retry-sync

Cache sync

Skip scan and sync existent cache data to elasticsearch.

poetry run podcatcher cache-sync

Clear cache

Remove all files on the cache directory.

poetry run podcatcher clear-cache

Drop

Remove all indices from elasticsearch and all files from the cache directory.

poetry run podcatcher drop

Audio media crawler for lbry.

Related tags

Overview

Audio media crawler for lbry.

Requirements

Development

Install dependencies

Tasks

Update hooks

Format code

Commands

Basic usage

Sync

Retry sync

Cache sync

Clear cache

Drop

Owner

Hound.fm

A simple python web scraper.

Crawl BookCorpus

This repo has the source code for the crawler and data crawled from auto-data.net

Creating Scrapy scrapers via the Django admin interface

Google Developer Profile Badge Scraper

Open Crawl Vietnamese Text

一款利用Python来自动获取QQ音乐上某个歌手所有歌曲歌词的爬虫软件

京东茅台抢购

download NCERT books using scrapy

A simple Discord scraper for discord bots

Basic-html-scraper - A complete how to of web scraping with Python for beginners

Current Antarctic large iceberg positions derived from ASCAT and OSCAT-2

🥫 The simple, fast, and modern web scraping library

Html Content / Article Extractor, web scrapping lib in Python

京东茅台抢购最新优化版本，京东茅台秒杀，优化了茅台抢购进程队列

Universal Reddit Scraper - A comprehensive Reddit scraping command-line tool written in Python.

A list of Python Bots used to extract data from several websites

A Web Scraper built with beautiful soup, that fetches udemy course information. Get udemy course information and convert it to json, csv or xml file

SearchifyX, predecessor to Searchify, is a fast Quizlet, Quizizz, and Brainly webscraper with various stealth features.

WebScraper - A script that prints out a list of all EXTERNAL references in the HTML response to an HTTP/S request