Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

Last update: Dec 29, 2022

Related tags

Overview

Multilingual Unsupervised Sentence Simplification

Code and pretrained models to reproduce experiments in "MUSS: Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

Prerequisites

Linux with python 3.6 or above.

Installing

git clone [email protected]:facebookresearch/muss.git
cd muss/
pip install -e .

How to use

Some scripts might still contain a few bugs, if you notice anything wrong, feel free to open an issue or submit a Pull Request.

Simplify sentences from a file using pretrained models

# English
python scripts/simplify.py scripts/examples.en --model-name muss_en_wikilarge_mined
# French
python scripts/simplify.py scripts/examples.fr --model-name muss_fr_mined
# French
python scripts/simplify.py scripts/examples.es --model-name muss_es_mined

Pretrained models should be downloaded automatically, but you can also find them here:
muss_en_wikilarge_mined
muss_en_mined
muss_fr_mined
muss_es_mined

Mine the data

python scripts/mine_sequences.py

Train the models

python scripts/train_model.py

Evaluate simplifications

Please head over to EASSE for Sentence Simplification evaluation.

License

The MUSS license is CC-BY-NC. See the LICENSE file for more details.

Authors

Louis Martin ([email protected])

Citation

If you use MUSS in your research, please cite MUSS: Multilingual Unsupervised Sentence Simplification by Mining Paraphrases

@article{martin2021muss,
  title={MUSS: Multilingual Unsupervised Sentence Simplification by Mining Paraphrases},
  author={Martin, Louis and Fan, Angela and de la Clergerie, {\'E}ric and Bordes, Antoine and Sagot, Beno{\^\i}t},
  journal={arXiv preprint arXiv:2005.00352},
  year={2021}
}

Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

Related tags

Overview

Multilingual Unsupervised Sentence Simplification

Prerequisites

Installing

How to use

Simplify sentences from a file using pretrained models

Mine the data

Train the models

Evaluate simplifications

License

Authors

Citation

Owner

Facebook Research

Deep Video Matting via Spatio-Temporal Alignment and Aggregation [CVPR2021]

abess: Fast Best-Subset Selection in Python and R

HDR Video Reconstruction: A Coarse-to-fine Network and A Real-world Benchmark Dataset (ICCV 2021)

Wider or Deeper: Revisiting the ResNet Model for Visual Recognition

A Review of Deep Learning Techniques for Markerless Human Motion on Synthetic Datasets

The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"

(CVPR 2022) Energy-based Latent Aligner for Incremental Learning

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

Accelerated NLP pipelines for fast inference on CPU and GPU. Built with Transformers, Optimum and ONNX Runtime.

Yolov5 + Deep Sort with PyTorch

a Lightweight library for sequential learning agents, including reinforcement learning

My published benchmark for a Kaggle Simulations Competition

Level Based Customer Segmentation

Official implementation of paper Gradient Matching for Domain Generalization

Incremental Cross-Domain Adaptation for Robust Retinopathy Screening via Bayesian Deep Learning

Baseline model for "GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping" (CVPR 2020)

Modular Probabilistic Programming on MXNet

A python library for self-supervised learning on images.

Official implementation of "SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers"

Camera calibration & 3D pose estimation tools for AcinoSet