NLP codes implemented with Pytorch (w/o library such as huggingface)

Last update: Dec 28, 2021

Related tags

Overview

NLP_scratch

NLP codes implemented with Pytorch (w/o library such as huggingface)

scripts
├── models: Neural Network models
├── data: codes for dataloader and dataset or pre-processing
├── metrics
├── <[inference_for_each_task].py> ex) chit_chat.py, translate.py and classify_doc.py
└── <train_[task_name].py>... ex) train_summarization.py
└── utils: codes for utility and processing operation such as TF-IDF and crawler

Owner

GitHub Repository

Speech Recognition Database Management with python

Speech Recognition Database Management The main aim of this project is to recogn

2 Feb 02, 2022

Transformer related optimization, including BERT, GPT

This repository provides a script and recipe to run the highly optimized transformer-based encoder and decoder component, and it is tested and maintained by NVIDIA.

1.7k Jan 04, 2023

Scikit-learn style model finetuning for NLP

Scikit-learn style model finetuning for NLP Finetune is a library that allows users to leverage state-of-the-art pretrained NLP models for a wide vari

665 Dec 17, 2022

A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.

Crosslingual Coreference Coreference is amazing but the data required for training a model is very scarce. In our case, the available training for non

71 Jan 04, 2023

CoNLL-English NER Task (NER in English)

CoNLL-English NER Task en | ch Motivation Course Project review the pytorch framework and sequence-labeling task practice using the transformers of Hu

2 Jan 14, 2022

A combination of autoregressors and autoencoders using XLNet for sentiment analysis

A combination of autoregressors and autoencoders using XLNet for sentiment analysis Abstract In this paper sentiment analysis has been performed in or

2 Nov 20, 2021

Datasets of Automatic Keyphrase Extraction

This repository contains 20 annotated datasets of Automatic Keyphrase Extraction made available by the research community. Following are the datasets and the original papers that proposed them. If yo

163 Dec 23, 2022

Associated Repository for "Translation between Molecules and Natural Language"

MolT5: Translation between Molecules and Natural Language Associated repository for "Translation between Molecules and Natural Language". Table of Con

67 Dec 15, 2022

Common Voice Dataset explorer

Common Voice Dataset Explorer Common Voice Dataset is by Mozilla Made during huggingface finetuning week Usage pip install -r requirements.txt streaml

22 Nov 16, 2022

NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT

NeuralQA: A Usable Library for (Extractive) Question Answering on Large Datasets with BERT Still in alpha, lots of changes anticipated. View demo on n

220 Dec 11, 2022

Korean stereoypte detector with TUNiB-Electra and K-StereoSet

Korean Stereotype Detector Korean stereotype sentence classifier using K-StereoSet with TUNiB-Electra Web demo you can test this model easily in demo

11 Feb 18, 2022

Code repository of the paper Neural circuit policies enabling auditable autonomy published in Nature Machine Intelligence

9 Jan 08, 2023

A repo for materials relating to the tutorial of CS-332 NLP

CS-332-NLP A repo for materials relating to the tutorial of CS-332 NLP Contents Tutorial 1: Introduction Corpus Regular expression Tokenization Tutori

9 Feb 15, 2022

Use PaddlePaddle to reproduce the paper：mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer

MT5_paddle Use PaddlePaddle to reproduce the paper：mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer English | 简体中文 mT5: A Massively

2 Oct 17, 2021

Using context-free grammar formalism to parse English sentences to determine their structure to help computer to better understand the meaning of the sentence.

Sentance Parser Executing the Program Make sure Python 3.6+ is installed. Install requirements $ pip install requirements.txt Run the program:

12 Sep 28, 2022

NLP codes implemented with Pytorch (w/o library such as huggingface)

Related tags

Overview

NLP_scratch

Owner

Speech Recognition Database Management with python

Transformer related optimization, including BERT, GPT

Scikit-learn style model finetuning for NLP

A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.

CoNLL-English NER Task (NER in English)

A combination of autoregressors and autoencoders using XLNet for sentiment analysis

Datasets of Automatic Keyphrase Extraction

Associated Repository for "Translation between Molecules and Natural Language"

Common Voice Dataset explorer

NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT

Korean stereoypte detector with TUNiB-Electra and K-StereoSet

Code repository of the paper Neural circuit policies enabling auditable autonomy published in Nature Machine Intelligence

A repo for materials relating to the tutorial of CS-332 NLP

Use PaddlePaddle to reproduce the paper：mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer

Using context-free grammar formalism to parse English sentences to determine their structure to help computer to better understand the meaning of the sentence.

Code for paper "Role-oriented Network Embedding Based on Adversarial Learning between Higher-order and Local Features"

Repositório do trabalho de introdução a NLP

🐍 A hyper-fast Python module for reading/writing JSON data using Rust's serde-json.

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

GCRC: A Gaokao Chinese Reading Comprehension dataset for interpretable Evaluation