https://arxiv.org/abs/2102.11005

Last update: Dec 19, 2022

Related tags

Overview

LogME

LogME: Practical Assessment of Pre-trained Models for Transfer Learning

How to use

Just feed the features f and labels y to the function, and you can get a nice score which well correlates with the transfer learning performance.

from LogME import LogME
score = LogME(f, y)

Then you can use the score to quickly select a good pre-trained model. The larger the score is, the better transfer performance you get.

Experimental results

We extensively validate the generality and superior performance of LogME on 14 pre-trained models and 17 downstream tasks, covering various pre-trained models (supervised pre-trained and unsupervised pre-trained), downstream tasks (classification and regression), and modalities (vision and language). Check the paper for all the results.

Computer vision

9 datasets and 10 pre-trained models. LogME is a reasonably good indicator for transfer performance.

NLP

7 tasks and 4 pre-trained models. LogME is a good indicator for transfer performance.

Speedup

LogME provides a dramatic speedup for assessing pre-trained models. The speedup comes from two aspects:

LogME does not need hyper-parameter tuning whereas vanilla fine-tuning requires extensive hyper-parameter tuning.
We designed a fast algorithm to further speedup the computation of LogME.

Citation

If you find it useful, please cite the following paper:

@article{you_logme:_2021,
	title = {LogME: Practical Assessment of Pre-trained Models for Transfer Learning},
	author = {You, Kaichao and Liu, Yong and Long, Mingsheng and Wang, Jianmin},
	journal = {arxiv},
	volume = {abs/2102.11005},
	year = {2021},
	url = {https://arxiv.org/abs/2102.11005},
}

Contact

If you have any question or want to use the code, please contact [email protected] .

https://arxiv.org/abs/2102.11005

Related tags

Overview

LogME

How to use

Experimental results

Computer vision

NLP

Speedup

Citation

Contact

Owner

THUML: Machine Learning Group @ THSS

ppo_pytorch_cpp - an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch

A Python library for common tasks on 3D point clouds

EgoNN: Egocentric Neural Network for Point Cloud Based 6DoF Relocalization at the City Scale

Generalized and Efficient Blackbox Optimization System.

An efficient and effective learning to rank algorithm by mining information across ranking candidates. This repository contains the tensorflow implementation of SERank model. The code is developed based on TF-Ranking.

Official release of MSHT: Multi-stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer axriv: http://arxiv.org/abs/2112.13513

Head2Toe: Utilizing Intermediate Representations for Better OOD Generalization

Codes for AAAI 2022 paper: Context-aware Health Event Prediction via Transition Functions on Dynamic Disease Graphs

Experiment about Deep Person Re-identification with EfficientNet-v2

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking

Google-drive-to-sqlite - Create a SQLite database containing metadata from Google Drive

ThunderSVM: A Fast SVM Library on GPUs and CPUs

Collection of common code that's shared among different research projects in FAIR computer vision team.

Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization

Python scripts for performing stereo depth estimation using the HITNET Tensorflow model.

Our VMAgent is a platform for exploiting Reinforcement Learning (RL) on Virtual Machine (VM) scheduling tasks.

A state-of-the-art semi-supervised method for image recognition

CTC segmentation python package

LTR_CrossEncoder: Legal Text Retrieval Zalo AI Challenge 2021

FCOSR: A Simple Anchor-free Rotated Detector for Aerial Object Detection