dsignals

Utilities and information for the signals.numer.ai tournament

using eodhistoricaldata.com

eodhistoricaldata.com provides excellent historical price coverage for the signals universe. There are two main challenges with it:

Ticker mapping from bloomberg to eod tickers
Lack of coverage for Japan, Czech Republic and New Zealand

Building the ticker map

To build the mapping from bloomberg_ticker to eodhd, use:

python build_eodhd_map.py

This will retrieve:

live_universe (a small 40 KB file just listing the ~5,340 tickers in current round)
historical_targets (a large 150 MB file, and extract ~13,370 unique historical tickers)
the bloomberg to yahoo map courtesy of Liam @ numerai

And follow the conversion logic in the python code and manual overrides in db/eod-overrides.csv to build eodhd-map.csv in the following format:

bloomberg_ticker	yahoo	data_provider	signals_ticker
MONY LN	MONY.L	eodhd	MONY.LSE
ANIM3 BZ	ANIM3.SA	eodhd	ANIM3.SA
CAO US		eodhd	CAO.US
7013 JP	7013.T	yahoo	7013.T

Download quotes from the correct data_provider

First find EODHD_TOKEN = "put_your_token_here" in the download_quotes.py file and insert your eodhd api token. Then running:

python download_quotes.py

will download each quote from the appropriate source (eodhd or yahoo) saving each ticker to a separate pickle file under ./data/ticker_bin. As of October 2021, this results in 10,900+ ticker histories.

How you can help

Some amount of experimentation is needed with Korean tickers (KO vs KQ extension) to get better fills for ~50 tickers.
Bloomberg Singapore ticker prefixes are very different than the yahoo or eodhd tickers. We are extracting the live universe prefixes from numerai yahoo map, but historical Singapore tickers would need to be manually mapped if anyone is up for the challenge.
The rest of the tickers seem to work well -- all feedback and advice is appreciated.

Utilities and information for the signals.numer.ai tournament

Related tags

Overview

dsignals

using eodhistoricaldata.com

Building the ticker map

Download quotes from the correct data_provider

How you can help

Owner

Degerhan Usluel

Transferable Unrestricted Attacks, which won 1st place in CVPR’21 Security AI Challenger: Unrestricted Adversarial Attacks on ImageNet.

The official implementation of CSG-Stump: A Learning Friendly CSG-Like Representation for Interpretable Shape Parsing

Good Classification Measures and How to Find Them

Based on the given clinical dataset, Predict whether the patient having Heart Disease or Not having Heart Disease

[WWW 2022] Zero-Shot Stance Detection via Contrastive Learning

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Implementation of SETR model, Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Soft actor-critic is a deep reinforcement learning framework for training maximum entropy policies in continuous domains.

List of awesome things around semantic segmentation 🎉

Official code for 'Pixel-wise Energy-biased Abstention Learning for Anomaly Segmentationon Complex Urban Driving Scenes'

Inverse Rendering for Complex Indoor Scenes: Shape, Spatially-Varying Lighting and SVBRDF From a Single Image

Towhee is a flexible machine learning framework currently focused on computing deep learning embeddings over unstructured data.

Arch-Net: Model Distillation for Architecture Agnostic Model Deployment

Official implementation of UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation

[ICRA 2022] CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from Simulation

RTS3D: Real-time Stereo 3D Detection from 4D Feature-Consistency Embedding Space for Autonomous Driving

Reviving Iterative Training with Mask Guidance for Interactive Segmentation

Black-Box-Tuning - Black-Box Tuning for Language-Model-as-a-Service

Implementation of the famous Image Manipulation\Forgery Detector "ManTraNet" in Pytorch

Locally Differentially Private Distributed Deep Learning via Knowledge Distillation (LDP-DL)