EfficientNetV2-with-TPU

EfficientNet

EfficientNetV2 adalah jenis jaringan saraf convolutional yang memiliki kecepatan pelatihan lebih cepat dan efisiensi parameter yang lebih baik dari model sebelumnya . Untuk mengembangkan model ini, penulis menggunakan kombinasi pencarian dan penskalaan arsitektur saraf yang sadar pelatihan , untuk bersama-sama mengoptimalkan kecepatan pelatihan. Model dicari dari ruang pencarian yang diperkaya dengan operasi baru seperti Fused-MBConv .

Secara arsitektur perbedaan utama adalah:

EfficientNetV2 secara ekstensif menggunakan MBConv dan fusi-MBConv yang baru ditambahkan di lapisan awal.
EfficientNetV2 lebih memilih rasio ekspansi yang lebih kecil untuk MBConv karena rasio ekspansi yang lebih kecil cenderung memiliki lebih sedikit overhead akses memori.
EfficientNetV2 lebih menyukai ukuran kernel 3x3 yang lebih kecil, tetapi menambahkan lebih banyak lapisan untuk mengkompensasi bidang reseptif yang berkurang yang dihasilkan dari ukuran kernel yang lebih kecil.
EfficientNetV2 sepenuhnya menghapus tahap stride-1 terakhir di EfficientNet asli, mungkin karena ukuran parameternya yang besar dan overhead akses memori

Note

Model	Size	acc-val	top-5	acc-test	weight
EfficientNetV2B0	224	90.68	99.76	89.86	imagenet
EfficientNetV2B1	240	90.76	99.78	90.07	imagenet
EfficientNetV2B2	260	87.08	99.48	86.85	imagenet
EfficientNetV2B3	300	90.38	99.80	89.29	imagenet
EfficientNetV2T	320	92.80	99.86	92.53	imagenet
EfficientNetV2S	384	89.94	99.74	89.27	imagenet
EfficientNetV2M	480	91.86	99.70	90.53	imagenet
EfficientNetV2L	480	93.10	99.80	92.38	imagenet
EfficientNetV2XL	512	93.24	99.72	93.41	imagenet21K-ft1k

Train 90%(45000rb)
Validation 10%(5000rb)
Test(10000rb)
Epochs = 25
WeightDecay = 1e-5
Batchsize = 16 * 8(strategy.num_replicas_in_sync)
optimizers adabelief dengan LearningRateSchduler(Triangular2CyclicalLearningRate) dan Rectified = True(mencegah overshoot)
cifar-10 tidak di sarankan untuk di ubah ukuran nya, saya mengubah ukuran nya hanya untuk milihat apakah bagus/tidak efficientnetv2 saat mempelajari cifar-10

EfficientNetV2-with-TPU - Cifar-10 case study

Related tags

Overview

EfficientNetV2-with-TPU

Note

Referensi

Owner

Sultan syach

Identifying a Training-Set Attack’s Target Using Renormalized Influence Estimation

A PyTorch implementation of "ANEMONE: Graph Anomaly Detection with Multi-Scale Contrastive Learning", CIKM-21

SegNet including indices pooling for Semantic Segmentation with tensorflow and keras

DeepAL: Deep Active Learning in Python

Semi-supervised Transfer Learning for Image Rain Removal. In CVPR 2019.

SEOVER: Sentence-level Emotion Orientation Vector based Conversation Emotion Recognition Model

An Official Repo of CVPR '20 "MSeg: A Composite Dataset for Multi-Domain Segmentation"

ParaGen is a PyTorch deep learning framework for parallel sequence generation

Voxel Transformer for 3D object detection

RaceBERT -- A transformer based model to predict race and ethnicty from names

Fast, accurate and reliable software for algebraic CT reconstruction

Vit-ImageClassification - Pytorch ViT for Image classification on the CIFAR10 dataset

An Exact Solver for Semi-supervised Minimum Sum-of-Squares Clustering

A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.

Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms

SpeechBrain is an open-source and all-in-one speech toolkit based on PyTorch.

Code of paper "Compositionally Generalizable 3D Structure Prediction"

Official implementation for paper Render In-between: Motion Guided Video Synthesis for Action Interpolation

The DL Streamer Pipeline Zoo is a catalog of optimized media and media analytics pipelines.

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity