CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

Last update: Jan 07, 2023

Overview

CLIP (Contrastive Language–Image Pre-training)

Experiments (Evaluation)

Model	Dataset	Acc (%)
ViT-B/32 (Paper)	CIFAR100	65.1
ViT-B/32 (Our)	CIFAR100	61.71
ViT-B/32 (Paper	CIFAR10	91.3
ViT-B/32 (Our)	CIFAR10	88.8

Overview

Training

Work In Process

Usage

Evaluation

python evaluation.py --dataset CIFAR100 --cuda True

args
- dataset (str): CIFAR10, CIFAR100 (default: CIFAR100)
- num_workers (int): default: 0
- batch_size (int): default: 128
- cuda (bool): False
Training
- Prepare Data
  - Visual Genome Dataset link
  - Download (images, region descriptions)
- training
```
python main.py --base_dir ./ --cuda True
```

Reference

paper link
Author: Alec Radford, Jong Wook Kim, Chris Hallacy, Girish Sastry, Amanda Askell, Pamela Mishkin, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Jack Clark, Gretchen Krueger, Ilya Sutskever
OpenAI

Owner

Myeongjun Kim

Computer Vision Research using Deep Learning

GitHub Repository

Testability-Aware Low Power Controller Design with Evolutionary Learning, ITC2021

Testability-Aware Low Power Controller Design with Evolutionary Learning This repo contains the source code of Testability-Aware Low Power Controller

1 Dec 26, 2021

Selene is a Python library and command line interface for training deep neural networks from biological sequence data such as genomes.

323 Jan 01, 2023

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

AST: Audio Spectrogram Transformer Introduction Citing Getting Started ESC-50 Recipe Speechcommands Recipe AudioSet Recipe Pretrained Models Contact I

603 Jan 07, 2023

EEGEyeNet is benchmark to evaluate ET prediction based on EEG measurements with an increasing level of difficulty

Introduction EEGEyeNet EEGEyeNet is a benchmark to evaluate ET prediction based on EEG measurements with an increasing level of difficulty. Overview T

23 Dec 22, 2022

This repository accompanies the ACM TOIS paper "What can I cook with these ingredients?" - Understanding cooking-related information needs in conversational search

In this repository you find data that has been gathered when conducting in-situ experiments in a conversational cooking setting. These data include tr

6 Sep 22, 2022

Problem-943.-ACMP - Problem 943. ACMP

Problem-943.-ACMP В "main.py" расположен вариант моего решения задачи 943 с серв

2 Aug 19, 2022

Pansharpening by convolutional neural networks in the full resolution framework

Z-PNN: Zoom Pansharpening Neural Network Pansharpening by convolutional neural networks in the full resolution framework is a deep learning method for

20 Nov 24, 2022

OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

OCTIS : Optimizing and Comparing Topic Models is Simple! OCTIS (Optimizing and Comparing Topic models Is Simple) aims at training, analyzing and compa

478 Jan 01, 2023

Graph-based community clustering approach to extract protein domains from a predicted aligned error matrix

Using a predicted aligned error matrix corresponding to an AlphaFold2 model , returns a series of lists of residue indices, where each list corresponds to a set of residues clustering together into a

24 Nov 23, 2022

View model summaries in PyTorch!

torchinfo (formerly torch-summary) Torchinfo provides information complementary to what is provided by print(your_model) in PyTorch, similar to Tensor

1.5k Jan 05, 2023

[ICCV 2021] Official Tensorflow Implementation for "Single Image Defocus Deblurring Using Kernel-Sharing Parallel Atrous Convolutions"

KPAC: Kernel-Sharing Parallel Atrous Convolutional block This repository contains the official Tensorflow implementation of the following paper: Singl

50 Dec 29, 2022

Riemannian Convex Potential Maps

Modeling distributions on Riemannian manifolds is a crucial component in understanding non-Euclidean data that arises, e.g., in physics and geology. The budding approaches in this space are limited b

61 Nov 28, 2022

Official PyTorch implementation of the paper "Graph-based Generative Face Anonymisation with Pose Preservation" in ICIAP 2021

Contents AnonyGAN Installation Dataset Preparation Generating Images Using Pretrained Model Train and Test New Models Evaluation Acknowledgments Citat

10 May 24, 2022

Keras Model Implementation Walkthrough

17 Sep 27, 2022

Source code for the plant extraction workflow introduced in the paper “Agricultural Plant Cataloging and Establishment of a Data Framework from UAV-based Crop Images by Computer Vision”

Plant extraction workflow Source code for the plant extraction workflow introduced in the paper "Agricultural Plant Cataloging and Establishment of a

0 Apr 22, 2022

🗺 General purpose U-Network implemented in Keras for image segmentation

TF-Unet General purpose U-Network implemented in Keras for image segmentation Getting started • Training • Evaluation Getting started Looking for Jupy

2 Aug 31, 2022

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

DeCLIP Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm. Our paper is available in arxiv Updates ** Ou

470 Dec 30, 2022

[ICCV21] Code for RetrievalFuse: Neural 3D Scene Reconstruction with a Database

RetrievalFuse Paper | Project Page | Video RetrievalFuse: Neural 3D Scene Reconstruction with a Database Yawar Siddiqui, Justus Thies, Fangchang Ma, Q

75 Dec 22, 2022

A general and strong 3D object detection codebase that supports more methods, datasets and tools (debugging, recording and analysis).

ALLINONE-Det ALLINONE-Det is a general and strong 3D object detection codebase built on OpenPCDet, which supports more methods, datasets and tools (de

5 Nov 03, 2022

MAterial del programa Misión TIC 2022

Mision TIC 2022 Esta iniciativa, aparece como respuesta frente a los retos de la Cuarta Revolución Industrial, y tiene como objetivo la formación de 1

6 May 25, 2022

CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

Related tags

Overview

CLIP (Contrastive Language–Image Pre-training)

Experiments (Evaluation)

Overview

Training

Usage

Reference

Owner

Myeongjun Kim

Testability-Aware Low Power Controller Design with Evolutionary Learning, ITC2021

Selene is a Python library and command line interface for training deep neural networks from biological sequence data such as genomes.

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

EEGEyeNet is benchmark to evaluate ET prediction based on EEG measurements with an increasing level of difficulty

This repository accompanies the ACM TOIS paper "What can I cook with these ingredients?" - Understanding cooking-related information needs in conversational search

Problem-943.-ACMP - Problem 943. ACMP

Pansharpening by convolutional neural networks in the full resolution framework

OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

Graph-based community clustering approach to extract protein domains from a predicted aligned error matrix

View model summaries in PyTorch!

[ICCV 2021] Official Tensorflow Implementation for "Single Image Defocus Deblurring Using Kernel-Sharing Parallel Atrous Convolutions"

Riemannian Convex Potential Maps

Official PyTorch implementation of the paper "Graph-based Generative Face Anonymisation with Pose Preservation" in ICIAP 2021

Keras Model Implementation Walkthrough

Source code for the plant extraction workflow introduced in the paper “Agricultural Plant Cataloging and Establishment of a Data Framework from UAV-based Crop Images by Computer Vision”

🗺 General purpose U-Network implemented in Keras for image segmentation

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

[ICCV21] Code for RetrievalFuse: Neural 3D Scene Reconstruction with a Database

A general and strong 3D object detection codebase that supports more methods, datasets and tools (debugging, recording and analysis).

MAterial del programa Misión TIC 2022