Team nan solution repository for FPT data-centric competition. Data augmentation, Albumentation, Mosaic, Visualization, KNN application

Last update: Oct 30, 2022

Overview

FPT data centric competition

Introduction

Deep Learning models have become exceedingly developed and popular in recent years. On the other hand, data processing techniques have not been equally developed compared to models

In this competition, participants are provided with a dataset. The goal is to use processing techniques on that dataset to ensure that model achieves the best performance after training.

Following Reinforcement Learning Competition 2021 success, DataComp is a brand new competition with a new approach for researchers. Besides that, DataComp was created to contribute to the prevention of Covid-19 pandemic, using face mask recognition model.

Competition link: https://datacomp.io/gioi-thieu

Our performance

Achieve top 20/400 teams (5% highest team) having the highest score validated on the private test dataset
Our [email protected] score on private test: 0.545
Team name: "nan"
Leaderboard link: https://datacomp.io/bang-xep-hang-cuoi-cung

Methods

We tried many different data augmentation from the basic types such as rotation, shearing, ... to some quite advance techniques such as mosaic, random safe crop,... The library that we're using albumentation Consequently, the combination of these below technqiues result to the final highest score in our case:

Train dataset -> 934 images after relabeled to make sure the correctness is more than 99%
Validation dataset -> 154 images (design an as much as general set by ultilizing KNN technique which is explained below!)
toGray augmentation -> 100 images
CutOut + HorizontalFlip (p=0.5) -> 400 images
Filter only incorrect-mask label images + HorizontalFlip (p=0.7) -> 200 images
Mosaic augmentation -> 451 images (Note: after do the mosaic augmentation, it's crucial to check the set again to exclude all images having poor-quality bboxes at the edge of each image)
Rotation + Shear (prob 50/50) -> 600 images
- Rotation + Shear (prob 50/50) with no-mask & mask only -> 200 images
- Remaining images augmented normally -> 400 images
B.c model perform poorly with images having people appeared behide the door. Therefore, filter & augment specificailly those images in training dataset -> 100 images

--> TOTAL 2939 augmentation images to submit (training + validation)

KNN ultilization

Briefly instroduce about KNN
The application of KNN in our solution

Used to construct as general as possible validation dataset
Categorize type of images in training set to faster filter images with specific feature, characteristic (Ex: Img having people behide doors, img having people wearing different types of masks)

Team nan solution repository for FPT data-centric competition. Data augmentation, Albumentation, Mosaic, Visualization, KNN application

Related tags

Overview

FPT data centric competition

Introduction

Our performance

Methods

KNN ultilization

Owner

Pham Viet Hoang (Harry)

A Unified Generative Framework for Various NER Subtasks.

Code for our paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021

Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

Implementation of "Efficient Regional Memory Network for Video Object Segmentation" (Xie et al., CVPR 2021).

Unity Propagation in Bayesian Networks Handling Inconsistency via Unity Smoothing

CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation

Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS 2021 Spotlight

NeurIPS'21 Tractable Density Estimation on Learned Manifolds with Conformal Embedding Flows

Original code for "Zero-Shot Domain Adaptation with a Physics Prior"

codes for IKM (arXiv2021, Submitted to IEEE Trans)

Code for the paper titled "Generalized Depthwise-Separable Convolutions for Adversarially Robust and Efficient Neural Networks" (NeurIPS 2021 Spotlight).

Repo for Photon-Starved Scene Inference using Single Photon Cameras, ICCV 2021

This repository contains demos I made with the Transformers library by HuggingFace.

Create UIs for prototyping your machine learning model in 3 minutes

Doing the asl sign language classification on static images using graph neural networks.

Speckle-free Holography with Partially Coherent Light Sources and Camera-in-the-loop Calibration

Weakly- and Semi-Supervised Panoptic Segmentation (ECCV18)

Reproducing-BowNet: Learning Representations by Predicting Bags of Visual Words

Codes for the compilation and visualization examples to the HIF vegetation dataset