PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Last update: Dec 26, 2022

Related tags

Deep Learning Dancing2Music

Overview

Dancing to Music

PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Paper

Hsin-Ying Lee, Xiaodong Yang, Ming-Yu Liu, Ting-Chun Wang, Yu-Ding Lu, Ming-Hsuan Yang, Jan Kautz
Dancing to Music Neural Information Processing Systems (NeurIPS) 2019
[Paper] [YouTube] [Project] [Blog] [Supp]

Example Videos

Beat-Matching
1st row: generated dance sequences, 2nd row: music beats, 3rd row: kinematics beats

Multimodality
Generate various dance sequences with the same music and the same initial pose.

Long-Term Generation
Seamlessly generate a dance sequence with arbitrary length.

Photo-Realisitc Videos
Map generated dance sequences to photo-realistic videos.

Train Decomposition

python train_decomp.py --name Decomp

Train Composition

python train_comp.py --name Decomp --decomp_snapshot DECOMP_SNAPSHOT

Demo

python demo.py --decomp_snapshot DECOMP_SNAPSHOT --comp_snapshot COMP_SNAPSHOT --aud_path AUD_PATH --out_file OUT_FILE --out_dir OUT_DIR --thr THR

Flags
- aud_path: input .wav file
- out_file: location of output .mp4 file
- out_dir: directory of output frames
- thr: threshold based on motion magnitude
- modulate: whether to do beat warping
Example

python demo.py -decomp_snapshot snapshot/Stage1.ckpt --comp_snapshot snapshot/Stage2.ckpt --aud_path demo/demo.wav --out_file demo/out.mp4 --out_dir demo/out_frame

Citation

If you find this code useful for your research, please cite our paper:

@inproceedings{lee2019dancing2music,
  title={Dancing to Music},
  author={Lee, Hsin-Ying and Yang, Xiaodong and Liu, Ming-Yu and Wang, Ting-Chun and Lu, Yu-Ding and Yang, Ming-Hsuan and Kautz, Jan},
  booktitle={NeurIPS},
  year={2019}
}

License

Copyright (C) 2020 NVIDIA Corporation. All rights reserved. This work is made available under NVIDIA Source Code License (1-Way Commercial). To view a copy of this license, visit https://nvlabs.github.io/Dancing2Music/LICENSE.txt.

PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Related tags

Overview

Dancing to Music

Paper

Example Videos

Train Decomposition

Train Composition

Demo

Citation

License

Owner

NVIDIA Research Projects

In this project we predict the forest cover type using the cartographic variables in the training/test datasets.

Trainable Bilateral Filter Layer (PyTorch)

[ICME 2021 Oral] CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning

PyTorch implementation for View-Guided Point Cloud Completion

Removing Inter-Experimental Variability from Functional Data in Systems Neuroscience

A pyparsing-based library for parsing SOQL statements

Software associated to AAAI paper "Planning with Biological Neurons and Synapses"

Bolt Online Learning Toolbox

A Runtime method overload decorator which should behave like a compiled language

Low-dose Digital Mammography with Deep Learning

Save-restricted-v-3 - Save restricted content Bot For telegram

A highly efficient and modular implementation of Gaussian Processes in PyTorch

General neural ODE and DAE modules for power system dynamic modeling.

《Lerning n Intrinsic Grment Spce for Interctive Authoring of Grment Animtion》

Libtorch yolov3 deepsort

InterFaceGAN - Interpreting the Latent Space of GANs for Semantic Face Editing

Ultra-lightweight human body posture key point CNN model. ModelSize:2.3MB HUAWEI P40 NCNN benchmark: 6ms/img,

UT-Sarulab MOS prediction system using SSL models

Official code for 'Pixel-wise Energy-biased Abstention Learning for Anomaly Segmentationon Complex Urban Driving Scenes'

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)