INTRODUCTION

This is a modification of the OpenAI-CLIP repo of moein-shariatnia(https://github.com/moein-shariatnia/OpenAI-CLIP).

The current training dataset supports flicker-8k or flicker-30k, and the image encoder supports Resnet50 or ViT(vit_base_patch16_384).

Text encoder supports only DistilBert like moein-shariatnia.

ENVIRONTMENT SETTING

$ virtualenv .venv --python=python3.6
$ source .venv/bin/activate
$ pip install -r requirements.txt

EXECUTTION

Pretrain

$ python3 pretrain.py

Inference

$ python3 inference.py --qeury={YOUR QUERY}

CAUTION

You must set(or check) some options in config.py before pretrain & inference

ex1) dataset("8k" or "30k"): Train dataset(flicker-8k or flicker-30k)

ex2) model_name("resnet50" or "vit_base_patch16_384"): Type of image encoder

ex3) pretrained(True or False): Decide whether to learn by loading pretrain versions of text encoder(DistilBert) and image encoder(resnet50 or ViT)

ex4) batch_size: Set according to the capacity of the machine

This is a modification of the OpenAI-CLIP repository of moein-shariatnia

Related tags

Overview

INTRODUCTION

ENVIRONTMENT SETTING

EXECUTTION

CAUTION

Owner

Sangwon Beak

Translates basic English sentences into the Huna language (hoo-NAH)

End-to-end MLOps pipeline of a BERT model for emotion classification.

Tool which allow you to detect and translate text.

A simple Speech Emotion Recognition (SER) API created using Flask and running in a Docker container.

Code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation".

Spam filtering made easy for you

The swas programming language

一个基于Nonebot2和go-cqhttp的娱乐性qq机器人

Smart discord chatbot integrated with Dialogflow

Weakly-supervised Text Classification Based on Keyword Graph

An easy-to-use framework for BERT models, with trainers, various NLP tasks and detailed annonations

Pre-Training with Whole Word Masking for Chinese BERT

Search with BERT vectors in Solr and Elasticsearch

Contains analysis of trends from Fitbit Dataset (source: Kaggle) to see how the trends can be applied to Bellabeat customers and Bellabeat products

Implementation of TF-IDF algorithm to find documents similarity with cosine similarity

Repository of the Code to Chatbots, developed in Python

State-of-the-art NLP through transformer models in a modular design and consistent APIs.

This is the Alpha of Nutte language, she is not complete yet / Essa é a Alpha da Nutte language, não está completa ainda

Yet Another Compiler Visualizer

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.