Quick insights from Zoom meeting transcripts using Graph + NLP

Last update: Sep 17, 2022

Overview

Transcript Analysis - Graph + NLP

This program extracts insights from Zoom Meeting Transcripts (.vtt) using TigerGraph and NLTK.

In order to run this program, modify the auth.ini file with your proper graph solution credentials and file paths. Then, simply run main.py. A sample transcript has been provided, but feel free to add your own into the \a_raw_transcripts directory!

As of now, this program performs the following tasks:

Convert .vtt into compact version (stored in \b_cmt_transcripts)
NLP analysis of compact transcript (using NLTK)
- Sentiment analysis
- Trigrams (collocations)
- Frequency of words (plotted)
- Meaningful words (shown as wordcloud)
- Number of speakers, names of speakers
- Who spoke the longest, least, average
Graph analysis of compact transcript (using TigerGraph)
- Analyze relationships between speakers
- Asked the most/least questions
- Pair w/ the most back-and-forth
- (TODO): Linking topics in semantic graph
- (TODO): Named-Entity Recognition
Visual output of all determined insights

Usage

A TigerGraph Cloud Portal solution (https://tgcloud.io/) will be required to run this program.

Kindly find the GraphStudio link here: https://transcript-analysis.i.tgcloud.io/

The schema utilized in this graph is fleshed out below:

Vertex: speaker

(PRIMARY ID) name - STRING

Edge: asked_question

text - STRING

Edge: answered_question

Here is an example of the graph populated with the sample transcript provided:

Analysis

Here is a screenshot of the command-line output produced:

Here is a frequency chart of meaningful words generated:

Here is a word cloud that visualizes common, key terms:

More features coming soon! In the meantime, feel free to continue creating and adding new insights 😁 😁

Quick insights from Zoom meeting transcripts using Graph + NLP

Related tags

Overview

Transcript Analysis - Graph + NLP

Usage

Analysis

References

Owner

Advit Deepak

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS)

The tool to make NLP datasets ready to use

A linter to manage all your python exceptions and try/except blocks (limited only for those who like dinosaurs).

Extract rooms type, door, neibour rooms, rooms corners nad bounding boxes, and generate graph from rplan dataset

A sentence aligner for comparable corpora

This is a project built for FALLABOUT2021 event under SRMMIC, This project deals with NLP poetry generation.

Bpe algorithm can finetune tokenizer - Bpe algorithm can finetune tokenizer

Natural Language Processing

spaCy plugin for Transformers , Udify, ELmo, etc.

Anomaly Detection 이상치 탐지 전처리 모듈

ReCoin - Restoring our environment and businesses in parallel

A BERT-based reverse-dictionary of Korean proverbs

A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational engines. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.

Simple tool/toolkit for evaluating NLG (Natural Language Generation) offering various automated metrics.

Global Rhythm Style Transfer Without Text Transcriptions

超轻量级bert的pytorch版本，大量中文注释，容易修改结构，持续更新

Natural Language Processing Specialization

String Gen + Word Checker

Text vectorization tool to outperform TFIDF for classification tasks

A NLP program: tokenize method, PoS Tagging with deep learning