Finetuning Pipeline

Last update: Dec 13, 2022

Related tags

Overview

KLUE Baseline

KLUE-baseline contains the baseline code for the Korean Language Understanding Evaluation (KLUE) benchmark. See our paper for more details about KLUE and the baselines.

Dependencies

Make sure you have installed the packages listed in requirements.txt.

pip install -r requirements.txt

All expereiments are tested under Python 3.7 environment.

KLUE Benchmark Datasets

All train/dev sets of KLUE tasks are publicly available in this repo. You can access them by using git submodules. To clone the repo with datasets:

git clone --recursive https://github.com/KLUE-benchmark/KLUE-Baseline.git

or just download datasets after cloned this repo:

git submodule update --init --recursive

All test sets are not publicly available. To measure performance of your model on test set, you should first train your model on train set and submit the model to our submission system. Alternatively, you can compare dev set performances with our baseline models. They are also reported in our paper.

Train

To reproduce our baselines, run run_all.sh.

NOTE: klue/roberta models accept input length at most 510 tokens. Details are explained here.

Reference

If you use this code or KLUE, please cite:

@misc{park2021klue,
      title={KLUE: Korean Language Understanding Evaluation}, 
      author={Sungjoon Park and Jihyung Moon and Sungdong Kim and Won Ik Cho and Jiyoon Han and Jangwon Park and Chisung Song and Junseong Kim and Yongsook Song and Taehwan Oh and Joohong Lee and Juhyun Oh and Sungwon Lyu and Younghoon Jeong and Inkwon Lee and Sangwoo Seo and Dongjun Lee and Hyunwoo Kim and Myeonghwa Lee and Seongbo Jang and Seungwon Do and Sunkyoung Kim and Kyungtae Lim and Jongwon Lee and Kyumin Park and Jamin Shin and Seonghyun Kim and Lucy Park and Alice Oh and Jung-Woo Ha and Kyunghyun Cho},
      year={2021},
      eprint={2105.09680},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Contribution

Feel free to leave issues if there are any questions or comments. To contribute, please run make style before creating pull requests.

Finetuning Pipeline

Related tags

Overview

KLUE Baseline

Dependencies

KLUE Benchmark Datasets

Train

Reference

Contribution

Owner

This repository is the official implementation of Open Rule Induction. This paper has been accepted to NeurIPS 2021.

Dados coletados e programas desenvolvidos no processo de iniciação científica

Anomaly Detection Based on Hierarchical Clustering of Mobile Robot Data

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

Detecting Potentially Harmful and Protective Suicide-related Content on Twitter

Shallow Convolutional Neural Networks for Human Activity Recognition using Wearable Sensors

Official implement of Paper：A deeply supervised image fusion network for change detection in high resolution bi-temporal remote sening images

Hierarchical Aggregation for 3D Instance Segmentation (ICCV 2021)

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Pytorch Implementation for (STANet+ and STANet)

The Python ensemble sampling toolkit for affine-invariant MCMC

[Pedestron] Generalizable Pedestrian Detection: The Elephant In The Room. @ CVPR2021

[ICCV'2021] Image Inpainting via Conditional Texture and Structure Dual Generation

Python scripts form performing stereo depth estimation using the high res stereo model in PyTorch .

Readings for "A Unified View of Relational Deep Learning for Polypharmacy Side Effect, Combination Therapy, and Drug-Drug Interaction Prediction."

Subgraph Based Learning of Contextual Embedding

Try out deep learning models online on Google Colab

cisip-FIRe - Fast Image Retrieval

[CVPR 2022] TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing

FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid Dataset (CVPR2022)