A curated list of awesome Active Learning

Overview

Awesome Active Learning Awesome

๐Ÿคฉ A curated list of awesome Active Learning ! ๐Ÿคฉ

Background

(image source: Settles, Burr)

What is Active Learning?

Active learning is a special case of machine learning in which a learning algorithm can interactively query a oracle (or some other information source) to label new data points with the desired outputs.

(image source: Settles, Burr)

There are situations in which unlabeled data is abundant but manual labeling is expensive. In such a scenario, learning algorithms can actively query the oracle for labels. This type of iterative supervised learning is called active learning. Since the learner chooses the examples, the number of examples to learn a concept can often be much lower than the number required in normal supervised learning. With this approach, there is a risk that the algorithm is overwhelmed by uninformative examples. Recent developments are dedicated to multi-label active learning, hybrid active learning and active learning in a single-pass (on-line) context, combining concepts from the field of machine learning (e.g. conflict and ignorance) with adaptive, incremental learning policies in the field of online machine learning.

(source: Wikipedia)

Contributing

If you find the awesome paper/code/book/tutorial or have some suggestions, please feel free to pull requests or contact [email protected] to add papers using the following Markdown format:

Year | Paper Name | Conference | [Paper](link) | [Code](link) | Tags | Notes |

Thanks for your valuable contribution to the research community. ๐Ÿ˜ƒ

Table of Contents

Books

Surveys

Papers

Tags

Sur.: survey | Cri.: critics | Pool.: pool-based sampling | Str.: stream-based sampling | Syn.: membership query synthesize | Meta.: meta learning | SSL.: semi-supervised learning | RL.: reinforcement learning | FS.: few-shot learning | SS.: self-supervised |

Before 2017

Year Title Conf Paper Code Tags Notes
1994 Improving Generalization with Active Learning Machine Learning paper
2007 Discriminative Batch Mode Active Learning NIPS paper
2008 Active Learning with Direct Query Construction KDD paper
2008 An Analysis of Active Learning Strategies for Sequence Labeling Tasks EMNLP paper
2008 Hierarchical Sampling for Active Learning ICML paper
2010 Active Instance Sampling via Matrix Partition NIPS paper
2011 Ask Me Better Questions: Active Learning Queries Based on Rule Induction KDD paper
2011 Active Learning from Crowds ICML paper
2011 Bayesian Active Learning for Classification and Preference Learning CoRR paper
2011 Active Learning Using On-line Algorithms KDD paper
2012 Bayesian Optimal Active Search and Surveying ICML paper
2012 Batch Active Learning via Coordinated Matching ICML paper
2013 Active Learning for Multi-Objective Optimization ICML paper
2013 Active Learning for Probabilistic Hypotheses Usingthe Maximum Gibbs Error Criterion NIPS paper
2014 Active Semi-Supervised Learning Using Sampling Theory for Graph Signals KDD paper
2014 Beyond Disagreement-based Agnostic Active Learning NIPS paper
2016 Cost-Effective Active Learning for Deep Image Classification TCSVT paper
2016 Active Image Segmentation Propagation CVPR paper

2017

Title Conf Paper Code Tags Notes
Active Decision Boundary Annotation with Deep Generative Models ICCV paper
Active One-shot Learning CoRR paper code Str. RL. FS.
A Meta-Learning Approach to One-Step Active-Learning [email protected]/ECML paper Pool. Meta.
Generative Adversarial Active Learning arXiv paper Pool. Syn.
Active Learning from Peers NIPS paper
Learning Active Learning from Data NIPS paper code Pool.
Learning Algorithms for Active Learning ICML paper
Deep Bayesian Active Learning with Image Data ICML paper code Pool.

2018

Title Conf Paper Code Tags Notes
The Power of Ensembles for Active Learning in Image Classification CVPR paper
Adversarial Learning for Semi-Supervised Semantic Segmentation BMVC paper code Pool. SSL.
A Variance Maximization Criterion for Active Learning Pattern Recognition paper
Meta-Learning Transferable Active Learning Policies by Deep Reinforcement Learning ICLR-WS paper Pool. Meta. RL.
Active Learning for Convolutional Neural Networks: A Core-Set Approach ICLR paper
Adversarial Active Learning for Sequence Labeling and Generation IJCAI paper
Meta-Learning for Batch Mode Active Learning ICLR-WS paper

2019

Title Conf Paper Code Tags Notes
ViewAL: Active Learning with Viewpoint Entropy for Semantic Segmentation CVPR paper Pool.
Bayesian Generative Active Deep Learning ICML paper code Pool. Semi.
Variational Adversarial Active Learning ICCV paper code Pool. SSL.
Integrating Bayesian and Discriminative Sparse Kernel Machines for Multi-class Active Learning NeurIPS paper
Active Learning via Membership Query Synthesisfor Semi-supervised Sentence Classification CoNLL paper
Discriminative Active Learning arXiv paper
Semantic Redundancies in Image-Classification Datasets: The 10% You Donโ€™t Need arXiv paper
Bayesian Batch Active Learning as Sparse Subset Approximation NIPS paper
Learning Loss for Active Learning CVPR paper code Pool.
Rapid Performance Gain through Active Model Reuse IJCAI paper
Parting with Illusions about Deep Active Learning arXiv paper Cri.
BatchBALD: Efficient and Diverse Batch Acquisition for Deep Bayesian Active Learning NIPS paper

2020

Title Conf Paper Code Tags Notes
Reinforced active learning for image segmentation ICLR paper code Pool. RL.
[BADGE] Batch Active learning by Diverse Gradient Embeddings ICLR paper code Pool.
Adversarial Sampling for Active Learning WACV paper Pool.
Online Active Learning of Reject Option Classifiers AAAI paper
Deep Active Learning for Biased Datasets via Fisher Kernel Self-Supervision CVPR paper
Deep Reinforcement Active Learning for Medical Image Classification MICCAI paper Pool. RL.
State-Relabeling Adversarial Active Learning CVPR paper code Pool.
Towards Robust and Reproducible Active Learning Using Neural Networks arXiv paper Cri.
Consistency-Based Semi-supervised Active Learning: Towards Minimizing Labeling Cost ECCV paper Pool. SSL.

2021

Title Conf Paper Code Tags Notes
MedSelect: Selective Labeling for Medical Image Classification Combining Meta-Learning with Deep Reinforcement Learning arXiv paper Pool. Meta. RL.
Can Active Learning Preemptively Mitigate Fairness Issues ICLR-RAI paper code Pool. Thinking fairness issues
Sequential Graph Convolutional Network for Active Learning CVPR paper code Pool.
Task-Aware Variational Adversarial Active Learning CVPR paper code Pool.
Effective Evaluation of Deep Active Learning on Image Classification Tasks arXiv paper Cri.
Semi-Supervised Active Learning for Semi-Supervised Models: Exploit Adversarial Examples With Graph-Based Virtual Labels ICCV paper Pool. SSL.
Contrastive Coding for Active Learning under Class Distribution Mismatch ICCV paper code Pool. Defines a good question
Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering ACL-IJCNLP paper code Pool. Thinking about outliers
LADA: Look-Ahead Data Acquisition via Augmentation for Active Learning NeurIPS paper Pool.
Multi-Anchor Active Domain Adaptation for Semantic Segmentation ICCV paper code Pool.
Active Learning for Lane Detection: A Knowledge Distillation Approach ICCV paper Pool.
Active Contrastive Learning of Audio-Visual Video Representations ICLR paper code Pool.
Multiple instance active learning for object detection CVPR paper code Pool.
SEAL: Self-supervised Embodied Active Learning using Exploration and 3D Consistency NeurIPS paper Robot exploration
Influence Selection for Active Learning ICCV paper code Pool.
Reducing Label Effort: Self-Supervised meets Active Learning arXiv paper Pool. SS. Cri. A meaningful attempt on the combination of SS & AL

Turtorials

Tools

Owner
BAI Fan
Deep Learning, Active Learning, Robotics, Artificial Intelligence.
BAI Fan
PURE: End-to-End Relation Extraction

PURE: End-to-End Relation Extraction This repository contains (PyTorch) code and pre-trained models for PURE (the Princeton University Relation Extrac

Princeton Natural Language Processing 657 Jan 09, 2023
Synthetic LiDAR sequential point cloud dataset with point-wise annotations

SynLiDAR dataset: Learning From Synthetic LiDAR Sequential Point Cloud This is official repository of the SynLiDAR dataset. For technical details, ple

78 Dec 27, 2022
An Exact Solver for Semi-supervised Minimum Sum-of-Squares Clustering

PC-SOS-SDP: an Exact Solver for Semi-supervised Minimum Sum-of-Squares Clustering PC-SOS-SDP is an exact algorithm based on the branch-and-bound techn

Antonio M. Sudoso 1 Nov 13, 2022
Cervix ROI Segmentation Using U-NET

Cervix ROI Segmentation Using U-NET Overview This code illustrate how to segment the ROI in cervical images using U-NET. The ROI here meant to include

Scotty Kwok 35 Sep 14, 2022
RITA is a family of autoregressive protein models, developed by LightOn in collaboration with the OATML group at Oxford and the Debora Marks Lab at Harvard.

RITA: a Study on Scaling Up Generative Protein Sequence Models RITA is a family of autoregressive protein models, developed by a collaboration of Ligh

LightOn 69 Dec 22, 2022
Unofficial implementation of Proxy Anchor Loss for Deep Metric Learning

Proxy Anchor Loss for Deep Metric Learning Unofficial pytorch, tensorflow and mxnet implementations of Proxy Anchor Loss for Deep Metric Learning. Not

Geonmo Gu 3 Jun 09, 2021
Boundary IoU API (Beta version)

Boundary IoU API (Beta version) Bowen Cheng, Ross Girshick, Piotr Dollรกr, Alexander C. Berg, Alexander Kirillov [arXiv] [Project] [BibTeX] This API is

Bowen Cheng 177 Dec 29, 2022
Toolchain to build Yoshi's Island from source code

Project-Y Toolchain to build Yoshi's Island (J) V1.0 from source code, by MrL314 Last updated: September 17, 2021 Setup To begin, download this toolch

MrL314 19 Apr 18, 2022
Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Skyformer This repository is the official implementation of Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr"om Method (NeurIPS 2021).

Qi Zeng 46 Sep 20, 2022
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)

T-Zero This repository serves primarily as codebase and instructions for training, evaluation and inference of T0. T0 is the model developed in Multit

BigScience Workshop 253 Dec 27, 2022
load .txt to train YOLOX, same as Yolo others

YOLOX train your data you need generate data.txt like follow format (per line- one image). prepare one data.txt like this: img_path1 x1,y1,x2,y2,clas

LiMingf 18 Aug 18, 2022
Model Serving Made Easy

The easiest way to build Machine Learning APIs BentoML makes moving trained ML models to production easy: Package models trained with any ML framework

BentoML 4.4k Jan 08, 2023
The official implementation code of "PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction."

PlantStereo This is the official implementation code for the paper "PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction".

Wang Qingyu 14 Nov 28, 2022
InterFaceGAN - Interpreting the Latent Space of GANs for Semantic Face Editing

InterFaceGAN - Interpreting the Latent Space of GANs for Semantic Face Editing Figure: High-quality facial attributes editing results with InterFaceGA

GenForce: May Generative Force Be with You 1.3k Jan 09, 2023
Code and data for "TURL: Table Understanding through Representation Learning"

TURL This Repo contains code and data for "TURL: Table Understanding through Representation Learning". Environment and Setup Data Pretraining Finetuni

SunLab-OSU 63 Nov 23, 2022
MAVE: : A Product Dataset for Multi-source Attribute Value Extraction

MAVE: : A Product Dataset for Multi-source Attribute Value Extraction The dataset contains 3 million attribute-value annotations across 1257 unique ca

Google Research Datasets 89 Jan 08, 2023
Share a benchmark that can easily apply reinforcement learning in Job-shop-scheduling

Gymjsp Gymjsp is an open source Python library, which uses the OpenAI Gym interface for easily instantiating and interacting with RL environments, and

134 Dec 08, 2022
Architecture Patterns with Python (TDD, DDD, EDM)

architecture-traning Architecture Patterns with Python (TDD, DDD, EDM) Chapter 5. ๋†’์€ ๊ธฐ์–ด๋น„์™€ ๋‚ฎ์€ ๊ธฐ์–ด๋น„์˜ TDD 5.2 ๋„๋ฉ”์ธ ๊ณ„์ธต ํ…Œ์ŠคํŠธ๋ฅผ ์„œ๋น„์Šค ๊ณ„์ธต์œผ๋กœ ์˜ฎ๊ฒจ์•ผ ํ•˜๋Š”๊ฐ€? ๋„๋ฉ”์ธ ๊ณ„์ธต ํ…Œ์ŠคํŠธ def

minsung sim 2 Mar 04, 2022
Tutorial for the PERFECTING FACTORY 5.0 WITH EDGE-POWERED AI workshop

Workshop Advantech Jetson Nano This tutorial has been designed for the PERFECTING FACTORY 5.0 WITH EDGE-POWERED AI workshop in collaboration with Adva

Edge Impulse 18 Nov 22, 2022
SlideGraph+: Whole Slide Image Level Graphs to Predict HER2 Status in Breast Cancer

SlideGraph+: Whole Slide Image Level Graphs to Predict HER2 Status in Breast Cancer A novel graph neural network (GNN) based model (termed SlideGraph+

28 Dec 24, 2022