The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021)

Last update: Dec 27, 2022

Related tags

Deep Learning bmvc2021

Overview

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels

Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi

Code will be relased soon.

Abstract:

^{Most of us are not experts in specific fields, such as ornithology. Nonetheless, we do have general image and language understanding capabilities that we use to match what we see to expert resources. This allows us to expand our knowledge and perform novel tasks without ad-hoc external supervision. On the contrary, machines have a much harder time consulting expert-curated knowledge bases unless trained specifically with that knowledge in mind. Thus, in this paper we consider a new problem: fine-grained image recognition without expert annotations, which we address by leveraging the vast knowledge available in web encyclopedias. First, we learn a model to describe the visual appearance of objects using non-expert image descriptions. We then train a fine- grained textual similarity model that matches image descriptions with documents on a sentence-level basis. We evaluate the method on two datasets and compare with several strong baselines and the state of the art in cross-modal retrieval.}

Citation

@inproceedings{choudhury2021curious,
author = {Choudhury, Subhabrata and Laina, Iro and Rupprecht, Christian and Vedaldi, Andrea},
booktitle = {British Machine Vision Conference}
title = {The Curious Layperson: Fine-Grained Image Recognition without Expert Labels}
volume = {32},
year = {2021}
}

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021)

Related tags

Overview

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels

Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi

Code will be relased soon.

Abstract:

Citation

Owner

Subhabrata Choudhury

Official Pytorch implementation of Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations

Cooperative Driving Dataset: a dataset for multi-agent driving scenarios

custom pytorch implementation of MoCo v3

Repository accompanying the "Sign Pose-based Transformer for Word-level Sign Language Recognition" paper

Pre-trained model, code, and materials from the paper "Impact of Adversarial Examples on Deep Learning Models for Biomedical Image Segmentation" (MICCAI 2019).

DeepFaceLive - Live Deep Fake in python, Real-time face swap for PC streaming or video calls

Official PyTorch implementation of the Fishr regularization for out-of-distribution generalization

[ICCV-2021] An Empirical Study of the Collapsing Problem in Semi-Supervised 2D Human Pose Estimation

Multiple custom object count and detection using YOLOv3-Tiny method

Edge Restoration Quality Assessment

Seach Losses of our paper 'Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search', accepted by ICLR 2021.

Final project code: Implementing BicycleGAN, for CIS680 FA21 at University of Pennsylvania

Spectrum Surveying: Active Radio Map Estimation with Autonomous UAVs

September-Assistant - Open-source Windows Voice Assistant

A Light CNN for Deep Face Representation with Noisy Labels

Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data

(AAAI2022) Style Mixing and Patchwise Prototypical Matching for One-Shot Unsupervised Domain Adaptive Semantic Segmentation

CRNN With PyTorch

The 3rd place solution for competition

Ranger deep learning optimizer rewrite to use newest components