Every web site provides APIs.

Last update: Jan 05, 2023

Overview

Toapi

Overview

Toapi give you the ability to make every web site provides APIs.

Version v2.0.0, Completely rewrote.

More elegant. More pythonic

v1.0.0 Documentation: http://www.toapi.org
Awesome: https://github.com/toapi/awesome-toapi
Organization: https://github.com/toapi

Features

Automatic converting HTML web site to API service.
Automatic caching every page of source site.
Automatic caching every request.
Support merging multiple web sites into one API service.

Get Started

Installation

$ pip install toapi
$ toapi -v
toapi, version 2.0.0

Usage

create app.py and copy the code:

from flask import request
from htmlparsing import Attr, Text
from toapi import Api, Item

api = Api()


@api.site('https://news.ycombinator.com')
@api.list('.athing')
@api.route('/posts?page={page}', '/news?p={page}')
@api.route('/posts', '/news?p=1')
class Post(Item):
    url = Attr('.storylink', 'href')
    title = Text('.storylink')


@api.site('https://news.ycombinator.com')
@api.route('/posts?page={page}', '/news?p={page}')
@api.route('/posts', '/news?p=1')
class Page(Item):
    next_page = Attr('.morelink', 'href')

    def clean_next_page(self, value):
        return api.convert_string('/' + value, '/news?p={page}', request.host_url.strip('/') + '/posts?page={page}')


api.run(debug=True, host='0.0.0.0', port=5000)

run python app.py

then open your browser and visit http://127.0.0.1:5000/posts?page=1

you will get the result like:

{
  "Page": {
    "next_page": "http://127.0.0.1:5000/posts?page=2"
  }, 
  "Post": [
    {
      "title": "Mathematicians Crack the Cursed Curve", 
      "url": "https://www.quantamagazine.org/mathematicians-crack-the-cursed-curve-20171207/"
    }, 
    {
      "title": "Stuffing a Tesla Drivetrain into a 1981 Honda Accord", 
      "url": "https://jalopnik.com/this-glorious-madman-stuffed-a-p85-tesla-drivetrain-int-1823461909"
    }
  ]
}

Todo

Visualization. Create toapi project in a web page by drag and drop.

Contributing

Write code and test code and pull request.

Every web site provides APIs.

Related tags

Overview

Toapi

Overview

Features

Get Started

Installation

Usage

Todo

Contributing

Owner

Jiuli Gao

Export your data from Xiami

Every web site provides APIs.

Fast and robust date extraction from web pages, with Python or on the command-line

News, full-text, and article metadata extraction in Python 3. Advanced docs:

a small library for extracting rich content from urls

Module for automatic summarization of text documents and HTML pages.

Web Content Retrieval for Humans™

Web-Extractor - Simple Tool To Extract IP-Adress From Website

fast python port of arc90's readability tool, updated to match latest readability.js!

Pythonic HTML Parsing for Humans™

Github Actions采集RSS, 打造无广告内容优质的头版头条超赞宝藏页

Zotero2Readwise - A Python Library to retrieve annotations and notes from Zotero and upload them to your Readwise

Open clone of OpenAI's unreleased WebText dataset scraper.

Brownant is a web data extracting framework.

Convert HTML to Markdown-formatted text.

RSS feed generator website with user friendly interface

Combine XPath, CSS Selectors and JSONPath for Web data extracting.