Kaggle G2Net Gravitational Wave Detection : 2nd place solution

Last update: Dec 26, 2022

Related tags

Deep Learning kaggle-g2net-public

Overview

Kaggle G2Net Gravitational Wave Detection : 2nd place solution

Solution writeup: https://www.kaggle.com/c/g2net-gravitational-wave-detection/discussion/275341

Instructions

1. Download data

You have to download the competition dataset from competition website, and place the files in input/ directory.

┣ input/
┃   ┣ training_labels.csv
┃   ┣ sample_submission.csv
┃   ┣ train/
┃   ┣ test/
┃
┣ configs.py
┣ ...

(Optional:) Add your hardware configurations

# configs.py
HW_CFG = {
    'RTX3090': (16, 128, 1, 24), # CPU count, RAM amount(GB), GPU count, GPU RAM(GB)
    'A100': (9, 60, 1, 40), 
    'Your config', (128, 512, 8, 40) # add your hardware config!
}

2. Setup python environment

conda

conda env create -n kumaconda -f=environment.yaml
conda activate kumaconda

docker

WIP

3. Prepare data

Two new files - input/train.csv and input/test/.csv will be created.

python prep_data.py

(Optional:) Prepare waveform cache

Optionally you can speed up training by making waveform cache.
This is not recommend if your machine has RAM size smaller than 32GB.
input/train_cache.pickle and input/test_cache.pickle will be created.

python prep_data.py --cache

Then, add cache path to Baseline class in configs.py.

# configs.py
class Baseline:
    name = 'baseline'
    seed = 2021
    train_path = INPUT_DIR/'train.csv'
    test_path = INPUT_DIR/'test.csv'
    train_cache = INPUT_DIR/'train_cache.pickle' # here
    test_cache = INPUT_DIR/'test_cache.pickle' # here
    cv = 5

4. Train nueral network

Each experiment class has a name (e.g. name for Nspec16 is nspec_16).
Outputs of an experiment are

outoffolds.npy : (train size, 1) np.float32
predictions.npy : (cv fold, test size, 1) np.float32
{name}_{timestamp}.log : training log
foldx.pt : pytorch checkpoint

All outputs will be created in results/{name}/.

python train.py --config {experiment class}
# [Options]
# --progress_bar    : Everyone loves progress bar
# --inference       : Run inference only
# --tta             : Run test time augmentations (FlipWave)
# --limit_fold x    : Train a single fold x. You must run inference again by yourself.

5. Train neural network again (pseudo-label)

For experiments with name starting with Pseudo, you must use train_pseudo.py.
Outputs and options are the same as train.py.
Make sure the dependent experiment (see the table below) was successfully run.

python train_pseudo.py --config {experiment class}

Experiments

#	Experiment	Dependency	Frontend	Backend	Input size	CV	Public LB	Private LB
1	Pseudo06	Nspec12	CWT	efficientnet-b2	256 x 512	0.8779	0.8797	0.8782
2	Pseodo07	Nspec16	CWT	efficientnet-b2	128 x 1024	0.87841	0.8801	0.8787
3	Pseudo12	Nspec12arch0	CWT	densenet201	256 x 512	0.87762	0.8796	0.8782
4	Pseudo13	MultiInstance04	CWT	xcit-tiny-p16	384 x 768	0.87794	0.8800	0.8782
5	Pseudo14	Nspec16arch17	CWT	efficientnet-b7	128 x 1024	0.87957	0.8811	0.8800
6	Pseudo18	Nspec21	CWT	efficientnet-b4	256 x 1024	0.87942	0.8812	0.8797
7	Pseudo10	Nspec16spec13	CWT	efficientnet-b2	128 x 1024	0.87875	0.8802	0.8789
8	Pseudo15	Nspec22aug1	WaveNet	efficientnet-b2	128 x 1024	0.87846	0.8809	0.8794
9	Pseudo16	Nspec22arch2	WaveNet	efficientnet-b6	128 x 1024	0.87982	0.8823	0.8807
10	Pseudo19	Nspec22arch6	WaveNet	densenet201	128 x 1024	0.87831	0.8818	0.8804
11	Pseudo17	Nspec23arch3	CNN	efficientnet-b6	128 x 1024	0.87982	0.8823	0.8808
12	Pseudo21	Nspec22arch7	WaveNet	effnetv2-m	128 x 1024	0.87861	0.8831	0.8815
13	Pseudo22	Nspec23arch5	CNN	effnetv2-m	128 x 1024	0.87847	0.8817	0.8799
14	Pseudo23	Nspec22arch12	WaveNet	effnetv2-l	128 x 1024	0.87901	0.8829	0.8811
15	Pseudo24	Nspec30arch2	WaveNet	efficientnet-b6	128 x 1024	0.8797	0.8817	0.8805
16	Pseudo25	Nspec25arch1	WaveNet	efficientnet-b3	256 x 1024	0.87948	0.8820	0.8803
17	Pseudo26	Nspec22arch10	WaveNet	resnet200d	128 x 1024	0.87791	0.881	0.8797
18	PseudoSeq04	Seq03aug3	ResNet1d-18		-	0.87663	0.8804	0.8785
19	PseudoSeq07	Seq12arch4	WaveNet		-	0.87698	0.8796	0.8784
20	PseudoSeq03	Seq09	DenseNet1d-121		-	0.86826	0.8723	0.8703

Kaggle G2Net Gravitational Wave Detection : 2nd place solution

Related tags

Overview

Kaggle G2Net Gravitational Wave Detection : 2nd place solution

Instructions

1. Download data

(Optional:) Add your hardware configurations

2. Setup python environment

conda

docker

3. Prepare data

(Optional:) Prepare waveform cache

4. Train nueral network

5. Train neural network again (pseudo-label)

Experiments

Owner

Hiroshechka Y

This repository gives an example on how to preprocess the data of the HECKTOR challenge

This is the code related to "Sparse-to-dense Feature Matching: Intra and Inter domain Cross-modal Learning in Domain Adaptation for 3D Semantic Segmentation" (ICCV 2021).

This is the official PyTorch implementation for "Mesa: A Memory-saving Training Framework for Transformers".

Learning to Stylize Novel Views

A task-agnostic vision-language architecture as a step towards General Purpose Vision

Fuwa-http - The http client implementation for the fuwa eco-system

A simple root calculater for python

Robotics with GPU computing

Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [2021]

A forwarding MPI implementation that can use any other MPI implementation via an MPI ABI

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

End-to-end speech secognition toolkit

Faune proche - Retrieval of Faune-France data near a google maps location

The official implementation of paper "Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks" (IJCV under review).

Pytoydl: A toy deep learning framework built upon numpy.

Python package for covariance matrices manipulation and Biosignal classification with application in Brain Computer interface

BERT model training impelmentation using 1024 A100 GPUs for MLPerf Training v1.1

DFM: A Performance Baseline for Deep Feature Matching

Code for the paper: On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations

Official implementation of the ICLR 2021 paper