Machine Learning in Asset Management (by @firmai)

Overview

Machine Learning in Asset Management

If you like this type of content then visit ML Quant site below:

https://www.ml-quant.com/


Part One

Follow this link for SSRN paper.

If you feel like citing something you can use:

Snow, D (2020). Machine Learning in Asset Management—Part 1: Portfolio Construction—Trading Strategies. The Journal of Financial Data Science, Winter 2020, 2 (1) 10-23.

This is the first in a series of articles dealing with machine learning in asset management. Asset management can be broken into the following tasks: (1) portfolio construction, (2) risk management, (3) capital management, (4) infrastructure and deployment, and (5) sales and marketing. This article focuses on portfolio construction using machine learning. Historically, algorithmic trading could be more narrowly defined as the automation of sell-side trade execution, but since the introduction of more advanced algorithms, the definition has grown to include idea generation, alpha factor design, asset allocation, position sizing, and the testing of strategies. Machine learning, from the vantage of a decision-making tool, can help in all these areas.

Editors: Frank J. Fabozzi | Marcos Lopéz de Prado | Joseph Simonian

This paper investigates various machine learning trading and portfolio optimisation models and techniques. The notebooks to this paper are Python based. By last count there are about 15 distinct trading varieties and around 100 trading strategies. Code and data are made available where appropriate. The hope is that this paper will organically grow with future developments in machine learning and data processing techniques. All feedback, contributions and criticisms are highly encouraged. You can find my contact details on the website, FirmAI.

Trading Strategies


1. Tiny CTA
Resources:
See this paper and blog for further explanation.
Data, Code


2. Tiny RL
Resources:
See this paper and/or blog for further explanation.
Data, Code


3. Tiny VIX CMF
Resources:
Data, Code


4. Quantamental
Resources:
Web-scrapers, Data, Code, Interactive Report, Paper.


5. Earnings Surprise
Resources:
Code, Paper


6. Bankruptcy Prediction
Resources:
Data, Code, Paper


7. Filing Outcomes
Resources:
Data


8. Credit Rating Arbitrage
Resources:
Code


9. Factor Investing:
Resources:
Paper, Code, Data


10. Systematic Global Macro
Resources:
Data, Code


11. Mixture Models
Resources:
Data, Code


12. Evolutionary
Resources:
Code, Repo


13. Agent Strategy
Resources:
Code, Repo


14. Stacked Trading
Resources:
Code, Blog


15. Deep Trading
Resources:
Code, Repo


Part Two:

Snow, D (2020). Machine Learning in Asset Management—Part 2: Portfolio Construction—Weight Optimization. The Journal of Financial Data Science, Spring 2020, 2 (1) 10-23.

This is the second in a series of articles dealing with machine learning in asset management. This article focuses on portfolio weighting using machine learning. Following from the previous article (Snow 2020), which looked at trading strategies, this article identifies different weight optimization methods for supervised, unsupervised, and reinforcement learning frameworks. In total, seven submethods are summarized with the code made available for further exploration.

Weight Optimisation (JFDS)


1. Deep Portfolio
Resources:
Data, Code, Paper


2. Linear Regression
Resources:
Code, Paper


3. Bayesian Sentiment
Resources:
Code


4. PCA and Hierarchical
Resource:
Code


5. HRP
Resources:
Data, Code


6. Network Graph
Resources:
Code


7. RL Deep Deterministic
Resources:
Code

Weight Optimisation (SSRN)


1. Online Portfolio Selection (OLPS)
Resources:
Code

Other (SSRN)


1. GANVaR
Resources:
Code


All Data and Code


Top 1% SSRN paper downloads

All Time Top 10 Paper :

Applied Computing eJournal, CompSciRN: Algorithms, CompSciRN: Clustering, Banking & Financial Institutions eJournals, CompSciRN: Artificial Intelligence, Econometric Modeling: Capital Markets - Portfolio Theory eJournal, Machine Learning eJournal

Other Projects

Other FirmAI projects include AtsPy automating Python's best time series models, PandaPy a data structure solutions that has the speed of NumPy and the usability of Pandas (10x to 50x faster), FairPut a holistic approach to implement fair machine learning outputs at the individual and group level, PandasVault a package for advanced pandas functions and code snippets, and ICR an interactive and fully automated corporate report built with Python.

Code for "3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop"

PyMAF This repository contains the code for the following paper: 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop Hongwe

Hongwen Zhang 450 Dec 28, 2022
Robustness via Cross-Domain Ensembles

Robustness via Cross-Domain Ensembles [ICCV 2021, Oral] This repository contains tools for training and evaluating: Pretrained models Demo code Traini

Visual Intelligence & Learning Lab, Swiss Federal Institute of Technology (EPFL) 27 Dec 23, 2022
Weight estimation in CT by multi atlas techniques

maweight A Python package for multi-atlas based weight estimation for CT images, including segmentation by registration, feature extraction and model

György Kovács 0 Dec 24, 2021
A solution to the 2D Ising model of ferromagnetism, implemented using the Metropolis algorithm

Solving the Ising model on a 2D lattice using the Metropolis Algorithm Introduction The Ising model is a simplified model of ferromagnetism, the pheno

Rohit Prabhu 5 Nov 13, 2022
This is my research project for the Irving Center for Cancer Dynamics/Azizi Lab, Columbia University.

bayesian_uncertainty This is my research project for the Irving Center for Cancer Dynamics/Azizi Lab, Columbia University. In this project I build a s

Max David Gupta 1 Feb 13, 2022
Semi-supervised Domain Adaptation via Minimax Entropy

Semi-supervised Domain Adaptation via Minimax Entropy (ICCV 2019) Install pip install -r requirements.txt The code is written for Pytorch 0.4.0, but s

Vision and Learning Group 243 Jan 09, 2023
atmaCup #11 の Public 4th / Pricvate 5th Solution のリポジトリです。

#11 atmaCup 2021-07-09 ~ 2020-07-21 に行われた #11 [初心者歓迎! / 画像編] atmaCup のリポジトリです。結果は Public 4th / Private 5th でした。 フレームワークは PyTorch で、実装は pytorch-image-m

Tawara 12 Apr 07, 2022
Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)

DHF1K =========================================================================== Wenguan Wang, J. Shen, M.-M Cheng and A. Borji, Revisiting Video Sal

Wenguan Wang 126 Dec 03, 2022
A Jinja extension (compatible with Flask and other frameworks) to compile and/or compress your assets.

A Jinja extension (compatible with Flask and other frameworks) to compile and/or compress your assets.

Jayson Reis 94 Nov 21, 2022
Online-compatible Unsupervised Non-resonant Anomaly Detection Repository

Online-compatible Unsupervised Non-resonant Anomaly Detection Repository Repository containing all scripts used in the studies of Online-compatible Un

0 Nov 09, 2021
Neural Network to colorize grayscale images

#colornet Neural Network to colorize grayscale images Results Grayscale Prediction Ground Truth Eiji K used colornet for anime colorization Sources Au

Pavel Hanchar 3.6k Dec 24, 2022
Aquarius - Enabling Fast, Scalable, Data-Driven Virtual Network Functions

Aquarius Aquarius - Enabling Fast, Scalable, Data-Driven Virtual Network Functions NOTE: We are currently going through the open-source process requir

Zhiyuan YAO 0 Jun 02, 2022
High dimensional black-box optimizer using Latent Action Monte Carlo Tree Search algorithm

LA-MCTS The code is based of paper Learning Search Space Partition for Black-box Optimization using Monte Carlo Tree Search. Component LA-MCTS has thr

Meta Research 18 Oct 24, 2022
IndoNLI: A Natural Language Inference Dataset for Indonesian

IndoNLI: A Natural Language Inference Dataset for Indonesian This is a repository for data and code accompanying our EMNLP 2021 paper "IndoNLI: A Natu

15 Feb 10, 2022
내가 보려고 정리한 <프로그래밍 기초 Ⅰ> / organized for me

Programming-Basics 프로그래밍 기초 Ⅰ 아카이브 Do it! 점프 투 파이썬 주차 강의주제 비고 1주차 Syllabus 2주차 자료형 - 숫자형 3주차 자료형 - 문자열형 4주차 입력과 출력 5주차 제어문 - 조건문 if 6주차 제어문 - 반복문 whil

KIMMINSEO 1 Mar 07, 2022
Modular Gaussian Processes

Modular Gaussian Processes for Transfer Learning 🧩 Introduction This repository contains the implementation of our paper Modular Gaussian Processes f

Pablo Moreno-Muñoz 10 Mar 15, 2022
Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.

Population-Based Bandits (PB2) Code for the Population-Based Bandits (PB2) Algorithm, from the paper Provably Efficient Online Hyperparameter Optimiza

Jack Parker-Holder 22 Nov 16, 2022
How to Leverage Multimodal EHR Data for Better Medical Predictions?

How to Leverage Multimodal EHR Data for Better Medical Predictions? This repository contains the code of the paper: How to Leverage Multimodal EHR Dat

13 Dec 13, 2022
Arabic Car License Recognition. A solution to the kaggle competition Machathon 3.0.

Transformers Arabic licence plate recognition 🚗 Solution to the kaggle competition Machathon 3.0. Ranked in the top 6️⃣ at the final evaluation phase

Noran Hany 17 Dec 04, 2022
The code of "Dependency Learning for Legal Judgment Prediction with a Unified Text-to-Text Transformer".

Code data_preprocess.py: preprocess data for Dependent-T5. parameters.py: define parameters of Dependent-T5. train_tools.py: traning and evaluation co

1 Apr 21, 2022