Text classification on IMDB dataset using Keras and Bi-LSTM network

Last update: Sep 27, 2022

Overview

Text classification on IMDB dataset using Keras and Bi-LSTM

Text classification on IMDB dataset using Keras and Bi-LSTM network.

Usage

python3 main.py

Hyper Parameter

Epoch: 12
Batch size: 128
Dropout: 0.5

Model Accuracy

Loss: 0.0574
Accuracy: 0.9809
Validation Loss: 0.6073
Validation Accuracy: 0.8534

Terminology

Recurrent Neural Network

Recurrent neural networks (RNN) is a type of neural network that uses previous information during model training. It remember the sequence of the data and use data patterns to give the prediction.

RNN uses feedback loops which makes it different from other neural networks. Those loops help RNN to process the sequence of the data. This loop allows the data to be shared to different nodes and predictions according to the gathered information. This process can be called memory.

RNN and the loops create the networks that allow RNN to share information, and also, the loop structure allows the neural network to take the sequence of input data. RNN converts an independent variable to a dependent variable for its next layer.

Long Short Term Memory

Long short term memory networks (LSTM) are a special kind of RNN. They were introduced to avoid the long-term dependency problem. In regular RNN, the problem frequently occurs when connecting previous information to new information. If RNN could do this, they’d be very useful. This problem is called long-term dependency.

The repeating module in a standard RNN contains a single layer. To remember the information for long periods in the default behaviour of the LSTM. LSTM networks have a similar structure to the RNN, but the memory module or repeating module has a different LSTM. The block diagram of the repeating module will look like the image below.

Bi-Directional Long Short Term Memory

Bidirectional long-short term memory (Bi-LSTM) is the process of making any neural network o have the sequence information in both directions backwards (future to past) or forward (past to future).

In bidirectional, our input flows in two directions, making a Bi-LSTM different from the regular LSTM. With the regular LSTM, we can make input flow in one direction, either backwards or forward. However, in bidirectional, we can make the input flow in both directions to preserve the future and the past information. For a better explanation, let’s have an example.

In the sentence "boys go to…" we can not fill the blank space. Still, when we have a future sentence “boys come out of school”, we can easily predict the past blank space the similar thing we want to perform by our model and bidirectional LSTM allows the neural network to perform this.

Text classification on IMDB dataset using Keras and Bi-LSTM network

Related tags

Overview

Text classification on IMDB dataset using Keras and Bi-LSTM

Usage

Hyper Parameter

Model Accuracy

Terminology

Recurrent Neural Network

Long Short Term Memory

Bi-Directional Long Short Term Memory

Owner

Hamza Rashid

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

Translators - is a library which aims to bring free, multiple, enjoyable translation to individuals and students in Python

This is Assignment1 code for the Web Data Processing System.

Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

multi-label，classifier，text classification，多标签文本分类，文本分类，BERT，ALBERT，multi-label-classification，seq2seq，attention，beam search

Simple telegram bot to convert files into direct download link.you can use telegram as a file server 🪁

pyupbit 라이브러리를 활용하여 upbit에서 비트코인을 자동매매하는 코드입니다. 조코딩 유튜브 채널에서 자세한 강의 영상을 보실 수 있습니다.

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

天池中药说明书实体识别挑战冠军方案；中文命名实体识别；NER; BERT-CRF & BERT-SPAN & BERT-MRC；Pytorch

Twewy-discord-chatbot - Build a Discord AI Chatbot that Speaks like Your Favorite Character

Sample data associated with the Aurora-BP study

Harvis is designed to automate your C2 Infrastructure.

Twitter Sentiment Analysis using #tag, words and username

Stanford CoreNLP provides a set of natural language analysis tools written in Java

A linter to manage all your python exceptions and try/except blocks (limited only for those who like dinosaurs).

HAIS_2GNN: 3D Visual Grounding with Graph and Attention

Code for paper: An Effective, Robust and Fairness-awareHate Speech Detection Framework

Grover is a model for Neural Fake News -- both generation and detectio

RuCLIP-SB (Russian Contrastive Language–Image Pretraining SWIN-BERT) is a multimodal model for obtaining images and text similarities and rearranging captions and pictures. Unlike other versions of the model we use BERT for text encoder and SWIN transformer for image encoder.

a test times augmentation toolkit based on paddle2.0.