[제 13회 투빅스 컨퍼런스] OK Mugle! - 장르부터 멜로디까지, Content-based Music Recommendation

Last update: Oct 09, 2022

Related tags

Deep Learning OkMugle

Overview

Ok Mugle! 🎵

장르부터 멜로디까지, Content-based Music Recommendation

'Ok Mugle!'은 제13회 투빅스 컨퍼런스(2022.01.15)에서 진행한 음악 추천 프로젝트입니다.

Description 📖

본 프로젝트에서는 Kakao Arena에서 제공하는 Melon Playlist Continuation 데이터를 활용하여, 사용자가 검색한 노래와 유사한 노래 추천을 구현하였습니다.

[Model] '유사성'의 기준을 멜로디, 분위기, 상황, 장르 등으로 정의
- 해당 요소 반영하여 Music2Vec, Time Convolutional AutoEncoder, ConsineEmbeddingLoss Multimodal 등의 모델 Building
[Retrieval] Embedding의 Cosine Similarity를 구하여 Retrieval 구성
[Ranking] 다양한 Ranking Method 사용 → 추천 결과 Ensemble
[Serving] 최종적으로 Score Total Top 10 Ranking Method의 추천 결과 활용하여 Web 구현 & 모델 Serving

Usage ✔️

Windows Shell에 아래 명령을 입력하여 실행합니다.

set FLASK_APP=server
set FLASK_ENV=development
flask run

Result (Web) 💻

웹(ToBigs 13th Conference Music Recommendation) 바로가기
웹 메인화면

검색창에 '비투비 - 비밀 (Insane) (Acoustic Ver.)'를 검색한 결과 화면

Presentation 🙋

컨퍼런스 발표영상과 보고서입니다. 자세한 분석 내용은 아래 링크를 통해 확인해주세요!

Contributor 🧑‍🤝‍🧑

본 프로젝트에는 빅데이터 분석 및 인공지능 대표 연합동아리 ToBig's 멤버들이 참여하였습니다.

기수	이름
15기	이성범
16기	김권호
16기	박한나
16기	이승주
16기	이예림
16기	주지훈
7기	이광록(멘토)

File Directory 📂

Ok Mugle!
├── 1. preprocessig
│   ├── make_song_meta_and_playlist.ipynb       # 노래, 플레이리스트 데이터 전처리
│   ├── make_mel_data.ipynb                     # 멜 데이터 전처리
│   └── make_mel_batch_data.ipynb               # 멜 데이터 배치 단위로 전처리
│
├── 2. model
│   ├── genre_embedding_model.ipynb             # Music2Vec
│   ├── mel_embedding_model.ipynb               # Time Convolutional Autoencoder
│   └── genre_and_mel_embedding_model.ipynb     # CosineEmbeddingLoss Multimodal
│
├── 3. embedding-visualization
│   └── embedding_visualization_tsne.ipynb      # t-SNE를 활용한 각 임베딩별 시각화
│
├── 4. ranking
│   ├── make_ranking_data_preprocessig.ipynb    # 각 임베딩별 코사인 유사도 Top50 데이터 셋 제작 
│   ├── make_ranking_data_multiprocessig.py     # make_ranking_data_preprocessig의 multiprocessig을 위한 함수
│   ├── make_ranking_data.ipynb                 # 순위별 가중치 ranking, 각 임베딩 별 상위 Top3 ranking
│   └── cos_sim_music_serving.ipynb             # 각 임베딩, ranking 별 결과
│
└── 5. web
    ├── crawling                                # 결과창 구현을 위한 데이터 수집
    │   └── melon_crawling.py 
    │ 
    ├── data                                    # 웹 제작에 활용된 데이터
    │    ├── ranking_song_id2playlist.json
    │    ├── song_id2artist_name_basket.json
    │    ├── song_id2song_name.json
    │    └── song_name_artist_name2song_id.json
    │ 
    ├── static                                  # 웹 제작에 활용된 css, font, image, js
    │    ├── css
    │    ├── fonts
    │    ├── images
    │    └── js
    │ 
    ├── templates                               # 프론트 구현
    │    ├── about.html
    │    ├── index.html
    │    ├── people.html
    │    └── result.html
    │ 
    └── server.py                               # 백엔드 구현
    │
    └── requirements.txt                        # 필요 패키지 목록

[제 13회 투빅스 컨퍼런스] OK Mugle! - 장르부터 멜로디까지, Content-based Music Recommendation

Related tags

Overview

Ok Mugle! 🎵

장르부터 멜로디까지, Content-based Music Recommendation

Description 📖

Usage ✔️

Result (Web) 💻

Presentation 🙋

Contributor 🧑‍🤝‍🧑

File Directory 📂

Owner

SeongBeomLEE

Reproduces the results of the paper "Finite Basis Physics-Informed Neural Networks (FBPINNs): a scalable domain decomposition approach for solving differential equations".

MoCoGAN: Decomposing Motion and Content for Video Generation

Unsupervised Feature Ranking via Attribute Networks.

TorchDistiller - a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

Self-supervised learning (SSL) is a method of machine learning

Provide baselines and evaluation metrics of the task: traffic flow prediction

Procedural 3D data generation pipeline for architecture

U^2-Net - Portrait matting This repository explores possibilities of using the original u^2-net model for portrait matting.

Semi-Supervised Learning for Fine-Grained Classification

codes for "Scheduled Sampling Based on Decoding Steps for Neural Machine Translation" (long paper of EMNLP-2022)

This repository contains code to run experiments in the paper "Signal Strength and Noise Drive Feature Preference in CNN Image Classifiers."

[CVPR 2022] Structured Sparse R-CNN for Direct Scene Graph Generation

Code for reproducible experiments presented in KSD Aggregated Goodness-of-fit Test.

Developing your First ML Workflow of the AWS Machine Learning Engineer Nanodegree Program

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference in production.

Angora is a mutation-based fuzzer. The main goal of Angora is to increase branch coverage by solving path constraints without symbolic execution.

Adversarial Attacks on Probabilistic Autoregressive Forecasting Models.

A general-purpose encoder-decoder framework for Tensorflow

Relaxed-machines - explorations in neuro-symbolic differentiable interpreters

Official implementation of Long-Short Transformer in PyTorch.