Code for ACL 2020 paper "Rigid Formats Controlled Text Generation"

Last update: Dec 17, 2022

Overview

SongNet

SongNet: SongCi + Song (Lyrics) + Sonnet + etc.

@inproceedings{li-etal-2020-rigid,
    title = "Rigid Formats Controlled Text Generation",
    author = "Li, Piji and Zhang, Haisong and Liu, Xiaojiang and Shi, Shuming",
    booktitle = "Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics",
    month = jul,
    year = "2020",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2020.acl-main.68",
    doi = "10.18653/v1/2020.acl-main.68",
    pages = "742--751"
}

Run

python prepare_data.py
./train.sh

Evaluation

Modify test.py: m_path = the best dev model
./test.sh
python metrics.py

Polish

./polish.sh

Download

The pretrained Chinese Language Model: https://drive.google.com/file/d/1g2tGyUwPe86vPn2nub1vkQva5lwtZ6Rd/view
The finetuned SongCi model: https://drive.google.com/file/d/16A2AzuU7slf7xj2QdLcBAorUCCaCk650/view

Reference

Guyu: https://github.com/lipiji/Guyu
Pretraining：https://github.com/lipiji/big_tpl_zh_10_base

Code for ACL 2020 paper "Rigid Formats Controlled Text Generation"

Related tags

Overview

SongNet

Run

Evaluation

Polish

Download

Reference

Owner

Piji Li

GPT-3: Language Models are Few-Shot Learners

A demo for end-to-end English and Chinese text spotting using ABCNet.

Just Another Telegram Ai Chat Bot Written In Python With Pyrogram.

Statistics and Mathematics for Machine Learning, Deep Learning , Deep NLP

[Preprint] Escaping the Big Data Paradigm with Compact Transformers, 2021

Python wrapper for Stanford CoreNLP tools v3.4.1

vits chinese, tts chinese, tts mandarin

Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.

A python package to fine-tune transformer-based models for named entity recognition (NER).

Concept Modeling: Topic Modeling on Images and Text

BERN2: an advanced neural biomedical namedentity recognition and normalization tool

DziriBERT: a Pre-trained Language Model for the Algerian Dialect

Transformers and related deep network architectures are summarized and implemented here.

Repositório da disciplina no semestre 2021-2

中文空间语义理解评测

基于pytorch_rnn的古诗词生成

Semi-automated vocabulary generation from semantic vector models

An extensive UI tool built using new data scraped from BBC News

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

一个基于Nonebot2和go-cqhttp的娱乐性qq机器人