Natural Language Processing Specialization

Last update: Oct 06, 2022

Overview

Natural Language Processing Specialization

In this folder, Natural Language Processing Specialization projects and notes can be found.

WHAT I LEARNED

Use logistic regression, naïve Bayes, and word vectors to implement sentiment analysis, complete analogies & translate words.
Use dynamic programming, hidden Markov models, and word embeddings to implement autocorrect, autocomplete & identify part-of-speech tags for words.
Use recurrent neural networks, LSTMs, GRUs & Siamese networks in Trax for sentiment analysis, text generation & named entity recognition.
Use encoder-decoder, causal, & self-attention to machine translate complete sentences, summarize text, build chatbots & question-answering.

There are 4 Courses in this Specialization

Course 1 - Natural Language Processing with Classification and Vector Spaces

In the first course of the Natural Language Processing Specialization
I performed sentiment analysis of tweets using logistic regression and then naïve Bayes,
I used vector space models to discover relationships between words and used PCA to reduce the dimensionality of the vector space and visualize those relationships, and
I wrote a simple English to French translation algorithm using pre-computed word embeddings and locality-sensitive hashing to relate words via approximate k-nearest neighbor search.

Projects

Course 2 - Natural Language Processing with Probabilistic Models

In the second course of the Natural Language Processing Specialization
I wrote a simple auto-correct algorithm using minimum edit distance and dynamic programming,
I applied the Viterbi Algorithm for part-of-speech (POS) tagging, which is vital for computational linguistics,
I wrote a better auto-complete algorithm using an N-gram language model, and
I wrote my own Word2Vec model that uses a neural network to compute word embeddings using a continuous bag-of-words model.

Projects

Course 3 - Natural Language Processing with Sequence Models

In the third course of the Natural Language Processing Specialization
I trained a neural network with GLoVe word embeddings to perform sentiment analysis of tweets,
I generated synthetic Shakespeare text using a Gated Recurrent Unit (GRU) language model,
I trained a recurrent neural network to perform named entity recognition (NER) using LSTMs with linear layers, and
I used so-called ‘Siamese’ LSTM models to compare questions in a corpus and identify those that are worded differently but have the same meaning.

Projects

Course 4 - Natural Language Processing with Attention Models

In the fourth course of the Natural Language Processing Specialization
I translated complete English sentences into German using an encoder-decoder attention model,
I built a Transformer model to summarize text,
I used T5 and BERT models to perform question-answering, and
I built a chatbot using a Reformer model.

Projects

Disclaimer

DeepLearning.AI makes course notes available for educational purposes.
Project solutions are just for educational purposes. I highly recommend trying and solving project/program assignments on your own.

All the best 🤘

Natural Language Processing Specialization

Related tags

Overview

Natural Language Processing Specialization

WHAT I LEARNED

There are 4 Courses in this Specialization

Course 1 - Natural Language Processing with Classification and Vector Spaces

Projects

Course 2 - Natural Language Processing with Probabilistic Models

Projects

Course 3 - Natural Language Processing with Sequence Models

Projects

Course 4 - Natural Language Processing with Attention Models

Projects

Disclaimer

Owner

Kaan BOKE

Convolutional 2D Knowledge Graph Embeddings resources

Implementation of ProteinBERT in Pytorch

BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model

This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"

BERT score for text generation

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

An IVR Chatbot which can exponentially reduce the burden of companies as well as can improve the consumer/end user experience.

Unsupervised intent recognition

Research code for "What to Pre-Train on? Efficient Intermediate Task Selection", EMNLP 2021

The aim of this task is to predict someone's English proficiency based on a text input.

customer care chatbot made with Rasa Open Source.

中文空间语义理解评测

Addon for adding subtitle files to blender VSE as Text sequences. Using pysub2 python module.

Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech tagging and word segmentation.

BERN2: an advanced neural biomedical namedentity recognition and normalization tool

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

Pytorch NLP library based on FastAI

Yet Another Sequence Encoder - Encode sequences to vector of vector in python !

"Investigating the Limitations of Transformers with Simple Arithmetic Tasks", 2021

Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch