Natural language processing summarizer using 3 state of the art Transformer models: BERT, GPT2, and T5

Last update: Feb 07, 2022

Related tags

Overview

NLP-Summarizer

Natural language processing summarizer using 3 state of the art Transformer models: BERT, GPT2, and T5

This project aimed to provide insight and explanations to current limitations on Natural Language Processing models by exploring the Transformer model, the latest state-of-the-art NLP solution, as well as discussing possible use cases for such tools in a domestic and workplace environment. An in-depth explanation of the architecture and the limitations it aims to solve was provided, as well as how it can be used to infer various tasks. Numerous use cases of NLP were also explored and how tools such as this can be extremely useful and have a massive impact on today’s society, both domestically and in the workplace. Three specific Transformer models were implemented using a GUI to evaluate their effectiveness. The final artefact provides a user with an interaction between the models for document summarisation tasks of variable output lengths.

Working Example

Following example created using another student's project introduction, original word count was ~1000.

Initial GUI

After Summarization

Getting Started

All code is ran using Python version 3.8.8
The artefact to be operated in it's entirety requires ~20GB of available space for downloads of the pre-trained models.

!pip install transformers
!pip install spacy==2.0.12
!pip install torch
!pip install tk

Runtime will be displayed as an output in console

Natural language processing summarizer using 3 state of the art Transformer models: BERT, GPT2, and T5

Related tags

Overview

NLP-Summarizer

Working Example

Initial GUI

After Summarization

Owner

Samuel Sharkey

Natural language processing summarizer using 3 state of the art Transformer models: BERT, GPT2, and T5

Disfl-QA: A Benchmark Dataset for Understanding Disfluencies in Question Answering

Tool to add main subject to items on Wikidata using a WMFs CirrusSearch for named entity recognition or a manually supplied list of QIDs

We have built a Voice based Personal Assistant for people to access files hands free in their device using natural language processing.

Open solution to the Toxic Comment Classification Challenge

Traditional Chinese Text Recognition Dataset: Synthetic Dataset and Labeled Data

[KBS] Aspect-based sentiment analysis via affective knowledge enhanced graph convolutional networks

DLO8012: Natural Language Processing & CSL804: Computational Lab - II

precise iris segmentation

Espial is an engine for automated organization and discovery of personal knowledge

Simple Annotated implementation of GPT-NeoX in PyTorch

Higher quality textures for the Metal Gear Solid series.

The code from the whylogs workshop in DataTalks.Club on 29 March 2022

[EMNLP 2021] Mirror-BERT: Converting Pretrained Language Models to universal text encoders without labels.

Text to speech is a process to convert any text into voice. Text to speech project takes words on digital devices and convert them into audio. Here I have used Google-text-to-speech library popularly known as gTTS library to convert text file to .mp3 file. Hope you like my project!

Official code for "Parser-Free Virtual Try-on via Distilling Appearance Flows", CVPR 2021

An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.

Twitter-Sentiment-Analysis - Analysis of twitter posts' positive and negative score.

This is a MD5 password/passphrase brute force tool

SimCTG - A Contrastive Framework for Neural Text Generation