NLP tool to extract emotional phrase from tweets 🤩

Last update: Oct 17, 2022

Overview

Emotional phrase extractor

Extract phrase in the given text that is used to express the sentiment. Capturing sentiment in language is important in these times where decisions and reactions are created and updated in seconds. But, which words actually lead to the sentiment description? This project aims to solve this problem.

Powered using Pytorch + hugggingface 🤗

Try it out.

git clone https://github.com/shahules786/twitter-emotions.git

cd twitter-emotions

sudo docker build --tag twitter-emotions:api .

sudo docker run -p 9999:9999  -it twitter-emotions:api python twitteremotions/app.py

Server will start running on port 9999 of localhost

Example

Installation for development

git clone https://github.com/shahules786/twitter-emotions.git

cd twitter-emotions

pip install -r requirements.txt

Train Model on your data

from twitteremotions.emotions import TwitterEmotions
emotions = TwitterEmotions()
emotions.train(train_path="data/train.csv", epochs=10, batch_size=32, max_len=168, test_size=0.25)

Contributing

All contrbutions are welcome 👋

You might also like...

HuggingTweets - Train a model to generate tweets

HuggingTweets - Train a model to generate tweets Create in 5 minutes a tweet generator based on your favorite Tweeter Make my own model with the demo

318 Jan 4, 2023

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.

Colibri Core by Maarten van Gompel, [email protected], Radboud University Nijmegen Licensed under GPLv3 (See http://www.gnu.org/licenses/gpl-3.0.html

122 Nov 17, 2022

Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser)

Frog for Python This is a Python binding to the Natural Language Processing suite Frog. Frog is intended for Dutch and performs part-of-speech tagging

46 Dec 14, 2022

The tool to make NLP datasets ready to use

chazutsu photo from Kaikado, traditional Japanese chazutsu maker chazutsu is the dataset downloader for NLP. import chazutsu r = chazutsu.data

243 Dec 29, 2022

Snips Python library to extract meaning from text

Snips NLU Snips NLU (Natural Language Understanding) is a Python library that allows to extract structured information from sentences written in natur

3.7k Dec 30, 2022

Search for documents in a domain through Google. The objective is to extract metadata

MetaFinder - Metadata search through Google _____ __ ___________ .__ .___ / \

85 Dec 16, 2022

Extract Keywords from sentence or Replace keywords in sentences.

FlashText This module can be used to replace keywords in sentences or extract keywords from sentences. It is based on the FlashText algorithm. Install

5.3k Jan 1, 2023

Snips Python library to extract meaning from text

Snips NLU Snips NLU (Natural Language Understanding) is a Python library that allows to extract structured information from sentences written in natur

3.5k Feb 12, 2021

Textpipe: clean and extract metadata from text

textpipe: clean and extract metadata from text textpipe is a Python package for converting raw text in to clean, readable text and extracting metadata

298 Nov 21, 2022

Comments

avoid confusion : end_tokens instead of start_tokens
Avoid Confusion

Replace start_tokens with end_tokens for the fourth argument to calculate the loss function to avoid confusion :)

While reviewing your amazing project, I noticed that the EmotionData class of the dataloader.py file is returning:

{ ... # start_tokens "start_tokens": torch.tensor(start_tokens, dtype=torch.long), # end_tokens "end_tokens": torch.tensor(end_tokens, dtype=torch.long), }

But in the engine.py file you are passing start_tokens for both the third and fourth arguments of the loss_fn():

loss = loss_fn( start, end, torch.argmax(data["start_tokens"], axis=1), torch.argmax(data["start_tokens"], axis=1) )

But the fourth has to be end_tokens. This minor change will not affect the loss_fn() output function since they are equal in all cases [=1].But, to respect conventions and avoid confusion, it would be better if it looks like the one shown below on the right:
opened by zekaouinoureddine 0

Releases(v1.0.0)

v1.0.0(May 17, 2021)

Trained Roberta base weights for twitter-emotions.
Source code(tar.gz)
Source code(zip)
emotion_torch.pth(475.54 MB)
pytorch_model.bin(477.98 MB)

Owner

Shahul ES

Data Scientist | Kaggle GrandMaster ( Rank 20) | Opensource @mljar

GitHub Repository

Implementation of Natural Language Code Search in the project CodeBERT: A Pre-Trained Model for Programming and Natural Languages.

CodeBERT-Implementation In this repo we have replicated the paper CodeBERT: A Pre-Trained Model for Programming and Natural Languages. We are interest

4 Jul 01, 2022

Phrase-Based & Neural Unsupervised Machine Translation

Unsupervised Machine Translation This repository contains the original implementation of the unsupervised PBSMT and NMT models presented in Phrase-Bas

1.5k Dec 28, 2022

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

Wav2Vec2 STT Python Beta Software Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 mode

22 Dec 29, 2022

Natural language processing summarizer using 3 state of the art Transformer models: BERT, GPT2, and T5

NLP-Summarizer Natural language processing summarizer using 3 state of the art Transformer models: BERT, GPT2, and T5 This project aimed to provide in

1 Feb 07, 2022

Trained T5 and T5-large model for creating keywords from text

text to keywords Trained T5-base and T5-large model for creating keywords from text. Supported languages: ru Pretraining Large version | Pretraining B

61 Nov 24, 2022

SEJE is a prototype for the paper Learning Text-Image Joint Embedding for Efficient Cross-Modal Retrieval with Deep Feature Engineering.

SEJE is a prototype for the paper Learning Text-Image Joint Embedding for Efficient Cross-Modal Retrieval with Deep Feature Engineering. Contents Inst

0 Oct 21, 2021

Visual Automata is a Python 3 library built as a wrapper for Caleb Evans' Automata library to add more visualization features.

55 Nov 17, 2022

NLP tool to extract emotional phrase from tweets 🤩

Related tags

Overview

Emotional phrase extractor

Try it out.

Example

Installation for development

Contributing

You might also like...

HuggingTweets - Train a model to generate tweets

Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser)

The tool to make NLP datasets ready to use

Snips Python library to extract meaning from text

Search for documents in a domain through Google. The objective is to extract metadata

Extract Keywords from sentence or Replace keywords in sentences.

Snips Python library to extract meaning from text

Textpipe: clean and extract metadata from text

Comments

avoid confusion : end_tokens instead of start_tokens

Avoid Confusion

Releases(v1.0.0)

v1.0.0(May 17, 2021)

Owner

Shahul ES

Implementation of Natural Language Code Search in the project CodeBERT: A Pre-Trained Model for Programming and Natural Languages.

Phrase-Based & Neural Unsupervised Machine Translation

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

Natural language processing summarizer using 3 state of the art Transformer models: BERT, GPT2, and T5

Trained T5 and T5-large model for creating keywords from text

SEJE is a prototype for the paper Learning Text-Image Joint Embedding for Efficient Cross-Modal Retrieval with Deep Feature Engineering.

GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning

A python project made to generate code using either OpenAI's codex or GPT-J (Although not as good as codex)

BERTAC (BERT-style transformer-based language model with Adversarially pretrained Convolutional neural network)

A modular Karton Framework service that unpacks common packers like UPX and others using the Qiling Framework.

PyTorch original implementation of Cross-lingual Language Model Pretraining.

source code for paper: WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach.

New Modeling The Background CodeBase

TalkNet: Audio-visual active speaker detection Model

DeBERTa: Decoding-enhanced BERT with Disentangled Attention

Mesh TensorFlow: Model Parallelism Made Easier

open-information-extraction-system, build open-knowledge-graph(SPO, subject-predicate-object) by pyltp(version==3.4.0)

Unsupervised Document Expansion for Information Retrieval with Stochastic Text Generation

Code for hyperboloid embeddings for knowledge graph entities

Visual Automata is a Python 3 library built as a wrapper for Caleb Evans' Automata library to add more visualization features.