The code for CVPR2022 paper "Likert Scoring with Grade Decoupling for Long-term Action Assessment".

Last update: Oct 21, 2022

Related tags

Computer Vision CVPR22_GDLT

Overview

Likert Scoring with Grade Decoupling for Long-term Action Assessment

This is the code for CVPR2022 paper "Likert Scoring with Grade Decoupling for Long-term Action Assessment".

Environments

RTX2080Ti
CUDA: 10.2
Python: 3.9.7
PyTorch: 1.10.1+cu102

Features

The features and label files of Rhythmic Gymnastics dataset can be download here.

Running

Please fill in or select the args enclosed by {} first.

Training

CUDA_VISIBLE_DEVICES={device ID} python main.py --video-path {path of video features} --train-label-path {path of label file of training set} --test-label-path {path of label file of test set} --model-name {the name used to save model and log} --action-type {Ball/Clubs/Hoop/Ribbon} --lr 1e-2 --epoch {250/400/500/150} --n_decoder 2 --n_query 4 --alpha 1.0 --margin 1.0 --lr-decay cos --decay-rate 0.01 --dropout 0.3

Testing

CUDA_VISIBLE_DEVICES={device ID} python main.py --video-path {path of video features} --train-label-path {path of label file of training set} --test-label-path {path of label file of test set} --action-type {Ball/Clubs/Hoop/Ribbon} --n_decoder 2 --n_query 4 --dropout 0.3 --test --ckpt {the name of the used checkpoint}

The code for CVPR2022 paper "Likert Scoring with Grade Decoupling for Long-term Action Assessment".

Related tags

Overview

Likert Scoring with Grade Decoupling for Long-term Action Assessment

Environments

Features

Running

Owner

Code related to "Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity" paper

Generate text images for training deep learning ocr model

Multi-choice answer sheet correction system using computer vision with opencv & python.

Text layer for bio-image annotation.

CNN+LSTM+CTC based OCR implemented using tensorflow.

OCR, Object Detection, Number Plate, Real Time

A Screen Translator/OCR Translator made by using Python and Tesseract, the user interface are made using Tkinter. All code written in python.

Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.

Code for CVPR 2022 paper "SoftGroup for Instance Segmentation on 3D Point Clouds"

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).

Virtual Zoom Gesture using OpenCV

Shape Detection - It's a shape detection project with OpenCV and Python.

PyNeuro is designed to connect NeuroSky's MindWave EEG device to Python and provide Callback functionality to provide data to your application in real time.

Driver Drowsiness Detection with OpenCV & Dlib

Polaris is a Face recognition attendance system .

Library used to deskew a scanned document

An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".

Random maze generator and solver

Sign Language Recognition service utilizing a deep learning model with Long Short-Term Memory to perform sign language recognition.

Color Picker and Color Detection tool for METR4202