Zalo AI challenge 2021 task hum to song

Last update: Dec 16, 2022

Related tags

Deep Learning hum2song

Overview

Zalo AI challenge 2021 task Hum to Song

pipeline:

Chuẩn bị dữ liệu cho quá trình train:

Sửa các file đường dẫn trong config/preprocess.yaml
- raw_path: đường dẫn đến data thô
- preprocessed_path: đường dẫn đầu ra của quá trình rút trích mel
- temp_dir: đường dẫn chứa dữ liệu mp3 được chuẩn hóa
- Chạy lần lượt các lệnh sau:

        python preprocessing.py

        python utils/split_train_val_by_id.py
   
        python utils/augment_mp3.py
   
        python utils/preprocess_augment.py

Train model:

Sửa các file đường dẫn trong config/config.py
- meta_train: đường dẫn đến file train_meta.csv trong preprocessed_path
- train_root: đường dẫn đến dữ liệu mel đã tiền xử lý
- train_list = 'full_data_train.txt'
- val_list = 'full_data_val.txt'
Chạy lần lượt các lệnh sau:

        python convert_data.py

        python train.py

Infer public test:

Đặt dữ liệu mp3 thô ở địa chỉ /data/public_test (bên trong chứa 2 thư mục full_song và hum)
Chạy lần lượt các lệnh sau:

./predict.sh

Infer private test:

Đặt dữ liệu mp3 thô ở địa chỉ /data/private_test (bên trong chứa 2 thư mục full_song và hum)

Chạy lần lượt các lệnh sau:

./predict_private_test.sh

Team:

Võ Văn Phúc

Nguyễn Văn Thiều

Lâm Bá Thịnh

Zalo AI challenge 2021 task hum to song

Related tags

Overview

Zalo AI challenge 2021 task Hum to Song

pipeline:

Chuẩn bị dữ liệu cho quá trình train:

Train model:

Infer public test:

Infer private test:

Team:

Owner

Vo Van Phuc

EXplainable Artificial Intelligence (XAI)

DirectVoxGO reconstructs a scene representation from a set of calibrated images capturing the scene.

A Semantic Segmentation Network for Urban-Scale Building Footprint Extraction Using RGB Satellite Imagery

Portfolio asset allocation strategies: from Markowitz to RNNs

PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and reinforcement learning

This repo is to be freely used by ML devs to check the GAN performances without coding from scratch.

A curated list of awesome resources combining Transformers with Neural Architecture Search

clustimage is a python package for unsupervised clustering of images.

A Python script that creates subtitles of a given length from text paragraphs that can be easily imported into any Video Editing software such as FinalCut Pro for further adjustments.

Some pvbatch (paraview) scripts for postprocessing OpenFOAM data

Experiments and examples converting Transformers to ONNX

Exporter for Storage Area Network (SAN)

Dynamic Attentive Graph Learning for Image Restoration, ICCV2021 [PyTorch Code]

EgGateWayGetShell py脚本

Keras-retinanet - Keras implementation of RetinaNet object detection.

TriMap: Large-scale Dimensionality Reduction Using Triplets

Stitch it in Time: GAN-Based Facial Editing of Real Videos

Towards Part-Based Understanding of RGB-D Scans

[ICCV 2021] Deep Hough Voting for Robust Global Registration

Pytorch implementation of Masked Auto-Encoder