The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021)

Last update: Dec 27, 2022

Related tags

Deep Learning bmvc2021

Overview

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels

Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi

Code will be relased soon.

Abstract:

^{Most of us are not experts in specific fields, such as ornithology. Nonetheless, we do have general image and language understanding capabilities that we use to match what we see to expert resources. This allows us to expand our knowledge and perform novel tasks without ad-hoc external supervision. On the contrary, machines have a much harder time consulting expert-curated knowledge bases unless trained specifically with that knowledge in mind. Thus, in this paper we consider a new problem: fine-grained image recognition without expert annotations, which we address by leveraging the vast knowledge available in web encyclopedias. First, we learn a model to describe the visual appearance of objects using non-expert image descriptions. We then train a fine- grained textual similarity model that matches image descriptions with documents on a sentence-level basis. We evaluate the method on two datasets and compare with several strong baselines and the state of the art in cross-modal retrieval.}

Citation

@inproceedings{choudhury2021curious,
author = {Choudhury, Subhabrata and Laina, Iro and Rupprecht, Christian and Vedaldi, Andrea},
booktitle = {British Machine Vision Conference}
title = {The Curious Layperson: Fine-Grained Image Recognition without Expert Labels}
volume = {32},
year = {2021}
}

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021)

Related tags

Overview

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels

Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi

Code will be relased soon.

Abstract:

Citation

Owner

Subhabrata Choudhury

Materials for my scikit-learn tutorial

A tensorflow model that predicts if the image is of a cat or of a dog.

Hyperparameter tuning for humans

Source code for paper: Knowledge Inheritance for Pre-trained Language Models

Apply Graph Self-Supervised Learning methods to graph-level task(TUDataset, MolculeNet Datset)

MlTr: Multi-label Classification with Transformer

LAMDA: Label Matching Deep Domain Adaptation

The official implementation of the paper, "SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning"

Python Jupyter kernel using Poetry for reproducible notebooks

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Source code and data in paper "MDFEND: Multi-domain Fake News Detection (CIKM'21)"

Simple streamlit app to demonstrate HERE Tour Planning

A clean and scalable template to kickstart your deep learning project 🚀 ⚡ 🔥

Unbiased Learning To Rank Algorithms (ULTRA)

Spam your friends and famly and when you do your famly will disown you and you will have no friends.

An ML & Correlation platform for transforming disparate data points of interest into usable intelligence.

A self-supervised learning framework for audio-visual speech

The codes of paper 'Active-LATHE: An Active Learning Algorithm for Boosting the Error exponent for Learning Homogeneous Ising Trees'

AI assistant built in python.the features are it can display time,say weather,open-google,youtube,instagram.

Deep-learning X-Ray Micro-CT image enhancement, pore-network modelling and continuum modelling