Scaling Vision with Sparse Mixture of Experts

This repository contains the code for training and fine-tuning Sparse MoE models for vision (V-MoE) on ImageNet-21k, reproducing the results presented in the paper:

Scaling Vision with Sparse Mixture of Experts, by Carlos Riquelme, Joan Puigcerver, Basil Mustafa, Maxim Neumann, Rodolphe Jenatton, André Susano Pinto, Daniel Keysers, and Neil Houlsby.

We will soon provide a colab analysing one of the models that we have released, as well as "config" files to train from scratch and fine-tune checkpoints. Stay tuned.

Installation

Simply clone this repository.

The file requirements.txt contains the requirements that can be installed via PyPi. However, we recommend installing jax, flax and optax directly from GitHub, since we use some of the latest features that are not part of any release yet.

In addition, you also have to clone the Vision Transformer repository, since we use some parts of it.

If you want to use RandAugment to train models (which we recommend if you train on ImageNet-21k or ILSVRC2012 from scratch), you must also clone the Cloud TPU repository, and name it cloud_tpu.

Checkpoints

We release the checkpoints containing the weights of some models that we trained on ImageNet (either ILSVRC2012 or ImageNet-21k). All checkpoints contain an index file (with .index extension) and one or multiple data files ( with extension .data-nnnnn-of-NNNNN, called shards). In the following list, we indicate only the prefix of each checkpoint. We recommend using gsutil to obtain the full list of files, download them, etc.

V-MoE S/32, 8 experts on the last two odd blocks, trained from scratch on ILSVRC2012 with RandAugment: gs://vmoe_checkpoints/vmoe_s32_last2_ilsvrc2012_randaug_medium.
V-MoE B/16, 8 experts on every odd block, trained from scratch on ImageNet-21k with RandAugment: gs://vmoe_checkpoints/vmoe_b16_imagenet21k_randaug_strong.
- Fine-tuned on ILSVRC2012: gs://vmoe_checkpoints/vmoe_b16_imagenet21k_randaug_strong_ft_ilsvrc2012

Disclaimers

This is not an officially supported Google product.

Scaling Vision with Sparse Mixture of Experts

Related tags

Overview

Scaling Vision with Sparse Mixture of Experts

Installation

Checkpoints

Disclaimers

Owner

Google Research

Code for the paper "Asymptotics of ℓ2 Regularized Network Embeddings"

Node Editor Plug for Blender

Nest Protect integration for Home Assistant. This will allow you to integrate your smoke, heat, co and occupancy status real-time in HA.

Vehicle direction identification consists of three module detection , tracking and direction recognization.

Learning Features with Parameter-Free Layers (ICLR 2022)

This is the repo for the paper `SumGNN: Multi-typed Drug Interaction Prediction via Efficient Knowledge Graph Summarization'. (published in Bioinformatics'21)

Si Adek Keras is software VR dangerous object detection.

This is the repo for Uncertainty Quantification 360 Toolkit.

These are the materials for the paper "Few-Shot Out-of-Domain Transfer Learning of Natural Language Explanations"

A flexible and extensible framework for gait recognition.

Predicting Price of house by considering ,house age, Distance from public transport

Robustness between the worst and average case

Code for the paper "Attention Approximates Sparse Distributed Memory"

[SIGGRAPH 2020] Attribute2Font: Creating Fonts You Want From Attributes

Expand human face editing via Global Direction of StyleCLIP, especially to maintain similarity during editing.

Pytorch implementation of the DeepDream computer vision algorithm

Numerical Methods with Python, Numpy and Matplotlib

Aggragrating Nested Transformer Official Jax Implementation

HarDNeXt: Official HarDNeXt repository

AtlasNet: A Papier-Mâché Approach to Learning 3D Surface Generation