h1stats - h1 Program Stats Scraper

This python3 script will call out to HackerOne's graphql API and scrape all currently active programs for information and stats on every h1 program. All programs and their stats get tabulated into a generated CSV file. From here you can compare and contrast all program stats to pick high fidelity targets. Furthermore, you can supply your h1 session cookie to the script to also compile in all private programs to the CSV.

Data Collected:

Program Name
Program URL
Program Type (Public or Private)
Clear Program (Yes/No)
Offers Bounties (Yes/No)
Max Critical (USD)
Max High (USD)
Max Medium (USD)
Max Low (USD)
Average Bounty Max (USD)
Average Bounty Min (USD)
Top Bounty Max (USD)
Top Bounty Min (USD)
Resolved Reports
Reports Received in 90 Days
Total Bounties Paid (USD)
Total Bounties Paid in 90 Days (USD)
Avg Time to First Response (Hours)
Avg Time to Triage (Hours)
Avg Time to Bounty (Hours)
Avg Time to Resolution (Hours)
Progam Age (Months)
Days Since Last Report

Usage

normal usage (public programs): python3 h1stats

authenticated usage (public and private programs): python3 h1stats [<Your HackerOne __Host-session Token>]

WARNING (Authenticated Usage)

THIS SCRIPT HANDLES YOUR H1 SESSION TOKEN WHICH CONTAINS YOUR HACKERONE PRIVATE DATA AND THE PRIVATE DATA OF YOUR HACKERONE PROGRAMS. BECAREFUL WHEN HANDLING THIS TOKEN. THE AUTHORS ARE NOT LIABLE FOR ANY MISUSE OF THIS SCRIPT OR YOUR HACKERONE SESSION TOKEN. PLEASE USE AT YOUR OWN RISK. DO NOT PUBLISH ANY CSVs WITH HACKERONE PRIVATE PROGRAM DATA.

For authenticated usage It is suggested that you assign your token into a variable once using export and pushing the env variable into the script's argument list (as shown in the examples).

Examples

Normal Flow (Public Only):

bash> python3 h1stats
  _     _ ____  _        _
 | |__ / / ___|| |_ __ _| |_ ___
 | '_ \| \___ \| __/ _` | __/ __|
 | | | | |___) | || (_| | |_\__ \
 |_| |_|_|____/ \__\__,_|\__|___/

                      defparam

[+] No session cookie specified
[+] Collecting public data...
[+] Please wait... (this may take several minutes)
[+] Collecting... (350 programs)
[+] Wrote all data to: h1stats-2021-4-24.csv
[+] Done!

Authenticated Flow (Public and Private):

bash> export H1CRED="JGH92kd9...b5e" # HackerOne session cookie
bash> python3 h1stats $H1CRED
  _     _ ____  _        _
 | |__ / / ___|| |_ __ _| |_ ___
 | '_ \| \___ \| __/ _` | __/ __|
 | | | | |___) | || (_| | |_\__ \
 |_| |_|_|____/ \__\__,_|\__|___/

                      defparam

[+] Using specified session cookie
[+] Collecting public and private data...
[+] Please wait... (this may take several minutes)
[+] Collecting... (400 programs)
[+] Wrote all data to: h1stats-PRIVATE-2021-4-24.csv
[+] Warning: this data contains private information under NDA, do not publish!
[+] Done!

a tool that compiles a csv of all h1 program stats

Related tags

Overview

h1stats - h1 Program Stats Scraper

Usage

WARNING (Authenticated Usage)

Examples

Owner

Evan

A data analysis using python and pandas to showcase trends in school performance.

:truck: Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark

Generate lookml for views from dbt models

INFO-H515 - Big Data Scalable Analytics

Driver Analysis with Factors and Forests: An Automated Data Science Tool using Python

Feature Detection Based Template Matching

Port of dplyr and other related R packages in python, using pipda.

Data cleaning tools for Business analysis

Data imputations library to preprocess datasets with missing data

Basis Set Format Converter

We're Team Arson and we're using the power of predictive modeling to combat wildfires.

Full automated data pipeline using docker images

.npy, .npz, .mtx converter.

Predictive Modeling & Analytics on Home Equity Line of Credit

Randomisation-based inference in Python based on data resampling and permutation.

Using Data Science with Machine Learning techniques (ETL pipeline and ML pipeline) to classify received messages after disasters.

Wafer Fault Detection - Wafer circleci with python

Retail-Sim is python package to easily create synthetic dataset of retaile store.

Supply a wrapper ``StockDataFrame`` based on the ``pandas.DataFrame`` with inline stock statistics/indicators support.

An Aspiring Drop-In Replacement for NumPy at Scale