Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.

Last update: Dec 17, 2022

Overview

Milano

(This is a research project, not an official NVIDIA product.)

Documentation

https://nvidia.github.io/Milano

Milano (Machine learning autotuner and network optimizer) is a tool for enabling machine learning researchers and practitioners to perform massive hyperparameters and architecture searches.

You can use it to:

Tune your model on a cloud backend of your choice
Benchmark Auto-ML algorithms (see how to add new search algorithm)

Your script can use any framework of your choice, for example, TensorFlow, PyTorch, Microsoft Cognitive Toolkit etc. or no framework at all. Milano only requires minimal changes to what your script accepts via command line and what it returns to stdout.

Currently supported backends:

Azkaban - on a single multi-GPU machine or server with Azkaban installed
AWS - Amazon cloud using GPU instances
SLURM - any cluster which is running SLURM

Prerequisites

Linux
Python 3
Ensure you have Python version 3.5 or later with packages listed in the requirements.txt file.
Backend with NVIDIA GPU

How to Get Started

Install all dependencies with the following command pip install -r requirements.txt.
Follow this mini-tutorial for local machine or this mini-tutorial for AWS

Visualize

We provide a script to convert the csv file output into two kinds of graphs:

Graphs of each hyperparameter with the benchmark (e.g. valid perplexity)
Color graphs that show the relationship between any two hyperparameters and the benchmark

To run the script, use:

python3 visualize.py --file [the name of the results csv file] 
                     --n [the number of samples to visualize]
                     --subplots [the number of subplots to show in a plot]
                     --max [the max value of benchmark you care about]

Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.

Related tags

Overview

Milano

Documentation

Prerequisites

How to Get Started

Visualize

Owner

NVIDIA Corporation

Code for our paper "MG-GAN: A Multi-Generator Model Preventing Out-of-Distribution Samples in Pedestrian Trajectory Prediction" published at ICCV 2021.

PyTorch evaluation code for Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.

Official Implementation for Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations

TextureGAN in Pytorch

Stereo Hybrid Event-Frame (SHEF) Cameras for 3D Perception, IROS 2021

DGCNN - Dynamic Graph CNN for Learning on Point Clouds

Gradient representations in ReLU networks as similarity functions

A modern pure-Python library for reading PDF files

Kaggle-titanic - A tutorial for Kaggle's Titanic: Machine Learning from Disaster competition. Demonstrates basic data munging, analysis, and visualization techniques. Shows examples of supervised machine learning techniques.

Code for the paper A Theoretical Analysis of the Repetition Problem in Text Generation

The first public PyTorch implementation of Attentive Recurrent Comparators

Official codes: Self-Supervised Learning by Estimating Twin Class Distribution

Pytorch implement of 'Unmixing based PAN guided fusion network for hyperspectral imagery'

Hyperbolic Image Segmentation, CVPR 2022

Hooks for VCOCO

Video Frame Interpolation with Transformer (CVPR2022)

This is our ARTS test set, an enriched test set to probe Aspect Robustness of ABSA.

This is the pytorch code for the paper Curious Representation Learning for Embodied Intelligence.

Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)