Image Captioning using CNN ,LSTM and Attention

Last update: Dec 16, 2021

Related tags

Deep Learning imagecaptioningproject

Overview

Image Captioning using CNN ,LSTM and Attention

This is a deeplearning model which tries to summarize an image into a text .

Installation

Install this project with pip3. Use python version 3.7

  pip3 install -R requirements.txt
  python3 app.py

these commands are applicable if you want to try the website in localhost.

you can also install docker and build an image from the docker file and run it.

  docker build -f Dockerfile -t imagecaptioning:api .
  docker run -p 8080:8080 -ti imagecaptioning

Deployment

To deploy this project in google cloud app engine . First create an project in app engine. Install google SDK to push ptojects into your local machine then run the following commands.

  gcloud init
  gcloud app deploy

choose the right project and then push the application to the cloud. This is an monolithic application so a single docker image is complied on the app engine.

Demo

link to demo-https://lucky-dahlia-333406.el.r.appspot.com/index

FAQ

why is this project implimented in tensorflow ?

Tensorflow is actively maintained by google and is very convenient to deploy on a server .It automatically switches to gpu while training if it finds one.

what is BELU score ?

BLEU, or the Bilingual Evaluation Understudy, is a score for comparing a candidate translation of text to one or more reference translations.Although developed for translation, it can be used to evaluate text generated for a suite of natural language processing tasks.

In this project, you will discover the BLEU score for evaluating and scoring candidate text using the NLTK library in Python.

Authors

License

MIT

Image Captioning using CNN ,LSTM and Attention

Related tags

Overview

Image Captioning using CNN ,LSTM and Attention

Installation

Deployment

Demo

FAQ

why is this project implimented in tensorflow ?

what is BELU score ?

Authors

License

Owner

ASUTOSH GHANTO

Offical implementation for "Trash or Treasure? An Interactive Dual-Stream Strategy for Single Image Reflection Separation".

YOLOX-RMPOLY

The code release of paper Low-Light Image Enhancement with Normalizing Flow

Anomaly detection related books, papers, videos, and toolboxes

Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track (SIGIR 2021 Full Paper).

Code for the paper Open Sesame: Getting Inside BERT's Linguistic Knowledge.

Meta-TTS: Meta-Learning for Few-shot SpeakerAdaptive Text-to-Speech

NVIDIA Deep Learning Examples for Tensor Cores

Vis2Mesh: Efficient Mesh Reconstruction from Unstructured Point Clouds of Large Scenes with Learned Virtual View Visibility ICCV2021

Creating predictive checklists from data using integer programming.

CAMoE + Dual SoftMax Loss (DSL): Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss

Official repository for Few-shot Image Generation via Cross-domain Correspondence (CVPR '21)

FedML: A Research Library and Benchmark for Federated Machine Learning

Dynamic vae - Dynamic VAE algorithm is used for anomaly detection of battery data

CNN Based Meta-Learning for Noisy Image Classification and Template Matching

Negative Interactions for Improved Collaborative Filtering:

Optimal Camera Position for a Practical Application of Gaze Estimation on Edge Devices,

Ludwig Benchmarking Toolkit

PyTorch code for the paper "FIERY: Future Instance Segmentation in Bird's-Eye view from Surround Monocular Cameras"

Shitty gaze mouse controller