Reproduces ResNet-V3 with pytorch

Last update: Dec 23, 2022

Overview

ResNeXt.pytorch

Reproduces ResNet-V3 (Aggregated Residual Transformations for Deep Neural Networks) with pytorch.

Download

git clone https://github.com/prlz77/resnext.pytorch
cd resnext.pytorch
# git checkout R4.0 or R3.0 for backwards compatibility (not recommended).

Usage

To train on Cifar-10 using 2 gpu:

python train.py ~/DATASETS/cifar.python cifar10 -s ./snapshots --log ./logs --ngpu 2 --learning_rate 0.05 -b 128

It should reach ~3.65% on Cifar-10, and ~17.77% on Cifar-100.

After train phase, you can check saved model.

Thanks to @AppleHolic we have now a test script:

To test on Cifar-10 using 2 gpu:

python test.py ~/DATASETS/cifar.python cifar10 --ngpu 2 --load ./snapshots/model.pytorch --test_bs 128

Configurations

From the original paper:

cardinality	base_width	parameters	Error cifar10	error cifar100	default
8	64	34.4M	3.65	17.77	x
16	64	68.1M	3.58	17.31

Update: widen_factor has been disentangled from base_width because it was confusing. Now widen factor is set to consant 4, and base_width is the same as in the original paper.

Trained models and curves

Link to trained models corresponding to the following curves:

Update: several commits have been pushed after training the models in Mega, so it is recommended to revert to e10c37d8cf7a958048bc0f58cd86c3e8ac4e707d

Other frameworks

torch (@facebookresearch). (Original) Cifar and Imagenet
caffe (@terrychenism). Imagenet
MXNet (@dmlc). Imagenet

Cite

@article{xie2016aggregated,
  title={Aggregated residual transformations for deep neural networks},
  author={Xie, Saining and Girshick, Ross and Doll{\'a}r, Piotr and Tu, Zhuowen and He, Kaiming},
  journal={arXiv preprint arXiv:1611.05431},
  year={2016}
}

Reproduces ResNet-V3 with pytorch

Related tags

Overview

ResNeXt.pytorch

Download

Usage

Configurations

Trained models and curves

Other frameworks

Cite

Owner

Pau Rodriguez

Official codebase for Legged Robots that Keep on Learning: Fine-Tuning Locomotion Policies in the Real World

[WACV21] Code for our paper: Samuel, Atzmon and Chechik, "From Generalized zero-shot learning to long-tail with class descriptors"

This is a code repository for the paper "Graph Auto-Encoders for Financial Clustering".

You Only Sample (Almost) Once: Linear Cost Self-Attention Via Bernoulli Sampling

This project deals with the detection of skin lesions within the ISICs dataset using YOLOv3 Object Detection with Darknet.

Source code for Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

Solving Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge

Official Pytorch implementation of Meta Internal Learning

Anchor-free Oriented Proposal Generator for Object Detection

The code release of paper Low-Light Image Enhancement with Normalizing Flow

Code for "Intra-hour Photovoltaic Generation Forecasting based on Multi-source Data and Deep Learning Methods."

Composing methods for ML training efficiency

An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities.

Official Repository for the paper "Improving Baselines in the Wild".

A Tensorflow implementation of BicycleGAN.

Python package to add text to images, textures and different backgrounds

Implementation for the paper: Invertible Denoising Network: A Light Solution for Real Noise Removal (CVPR2021).

Create animations for the optimization trajectory of neural nets

Code for models used in Bashiri et al., "A Flow-based latent state generative model of neural population responses to natural images".

SAGE: Sensitivity-guided Adaptive Learning Rate for Transformers