This is a Text Data Analysis Project Involving (YouTube Case Study).

Overview

Text_Data_Analysis

This is a Text Data Analysis Project Involving (YouTube Case Study).

Problem Statement => Sentiment Analysis.

Package1:

There are many Sentiment Packages such as Vader, Pacy. In this project i am using TextBlob which is a NLP library.

When considering Texblob Sentiment Analysis, there are two keys involved which are Polarity and Subjectivity.

-- Polarity: Which ranges from [-1 to +1] for negative and positive sentiments.

-- Subjectivity: When there are no Sentiments in a sentence.

Error_Handling

Use try,except to handle error in your code.

Package2:

The second package I'll be using to perform visualization on the sentiment Analysis is Wordcloud.

-- wordcloud:analysis give regard to the keyword with the bigger Font, therfore any keyword with this attribute has the higher priority.

-- wordcloud : data must also be stored in String nature before being passed.

Instaling wordcloud.

wordcloud can be tricky when installing . irrespective of command prompt or conda prompt.. open and run as administrator and excute the following line of codes below.

-- for conda. conda install -c https://conda.anaconda.org/conda-forge wordcloud

-- For command prompt.

git clone https://github.com/amueller/word_cloud.git
cd word_cloud
pip install .

stopword.

This are words that donot make any sense in Analysis. such as He, Him, Is, The.

-- wordcloud has parameters that removes this stopwords . ie (stopwords = reset(STOPWORDS).

Negative comment Visuation.

Negative_comment

Positive comment Visuation.

Positive_comment

Problem Statement => Emoji Analysis.

-- !pip install emoji on Jupyter notebook.

-- pip install emoji on conda prompt or Cmd prompt open and run as administrator

Note:

After iterating the emoji_list, you need to compute it into frequncies, which means you need to come up with data in the form of Dict. this can be implemented completely from Scratch, or using count. But in this analysis we shall be using Collections models.

Emoji Visualization.

emoji

Problem Statement => Collecting the Entire data of Youtube.

-- The first way to do this by using the OS, interating with the OS and using a path to access all the files.

-- The second way is Glob, which is consider to be the best way.

Most Common econdings for reading data.

-- latin, UTF-8, iso-8859-1 (essential in reading complex data example is japanese data), cp-1252

Problem Statement => Which Category has the Maximum Likes.

The category file was clean and coverted to dict, then,the Category_name in the dict was map with category_id in the full_df and store in a column created in the full_df.

-- finding the maximum likes in the category can be archived with groupby as well but we used boxplot to visualized.

Visualized most liked category.

most_like_category

Problem Statement => Find out weather Audience are Engaged or not.

The three most important keys to consider when solving this kind of problem are Like_rate, Dislike-rate, Comment_rate.

Like rate:

likes_rate

Analysis weather your Viewes will affect your likes or not.

In this case you can use Scatter plot or Regplot to check and also correlation and visualized it using heatmap

Note.

when using seaborn regplot in jupyter notebook always set ci=None, for it to execute.

Problem Statement => Analyse trending videos

In this project the data frame had only channel_tile and video_id, so we used a groupby function to group the video_id's according to there channel_title. and renamed the video_id's column to total_video according to there channel_title respectfully, the channel with the highest count of video_id had the trending vidoes.

Visualized Trending videos.

trending videos

Problem Statement => Does Punctuation in a title and tags have any relations with views,likes,dislikes and comments?.

We have to extracts all the punctuations marks from title, channel_title or tags, and this can be done completely from scratch or using python build in modules like Regular expression, Strings. in this project we shall use the string module.

--- To check if the punctuatons will affect the views,likes,dislikes or comments. use correlatoin to check the count punctuations from title, channel_title or tags on the views,likes,dislikes or comments.

title punctuation count plot.

title_punc_count

Owner
Data Science || Data Analyst.
A simple GUI video downloader built off of the python module 'yt-dlp'

Simple-Youtube-DL-Gui Supported Operating Systems Windows 7 (x64), Windows 8 (x64), and Windows 10 (x64) How to use Main Gui Extract program from arch

12 Dec 30, 2022
Download YOUR files, documents from vk.

vk-documents-downloader Кароч эта симпл херня качает все ВАШИ документы с вк. Или я еблан, но в гх и тмб гугле я подобного не нашел. py main.py Login:

4 Jun 10, 2022
A program which takes an Anime name or URL and downloads the specified range of episodes.

super-anime-downloader A console application written in Python3.x (GUI will be added soon) which takes a Anime Name/URL as input and downloads the ran

Sayyid Ali Sajjad Rizavi 26 Jul 18, 2022
Python script designed to search and fetch direct download links from nxbrew.com

SwitchGamesDownloader Only for windows nxbrew.com is a website, accessible only using a proxy, where the majority of games for the Nintendo Switch are

Backend 91 Dec 28, 2022
Arxiv2Kindle is a simple script written in python that converts LaTeX source downloaded from Arxiv and recompiles it to better fit a Kindle or other similar reading devices.

Arxiv2Kindle is a simple script written in python that converts LaTeX source downloaded from Arxiv and recompiles it to better fit a read

Soumik Rakshit 8 Jul 09, 2022
This is a tool to allow downloading any links from r/Roms Magethread

ILYFPR This is a tool to allow downloading any links from r/Roms Magethread Limitations: It downloads ALL roms of the system you choose. This will be

Erase 4 Nov 04, 2021
bing image downloader app used to download bulk images for a specific search term created using streamlit and bing_image_downloader python packages

bing image downloader app bing image downloader app is used to download bulk images for a specific search term. bing image downloader app gets the sea

Siva Prakash 8 Apr 05, 2022
命令行版本的HLS/DASH流下载器,支持标准AES-128-CBC解密

XstreamDL-CLI 基于python 3.7.4+的,命令行版本的,HLS/DASH流下载器,支持标准AES-128-CBC解密 使用 首先安装必要的库

xhlove 239 Dec 31, 2022
A user-friendly GUI for the ZSpotify music downloader.

ZSpotifyGUI A user-friendly desktop app for ZSpotify music downloader for Windows, MacOs, and Linux Discord Server - Matrix Server - Gitea Mirror - Ma

94 Dec 17, 2022
利用python3,爬取并下载91porn网站上面的视频

91porn_python 利用python3,爬取并下载91porn网站上面的视频 增加爬取t66y论坛图片的脚本 该脚本支持一下功能: 支持多线程 下载视频有进度条显示 支持从特定页的特定视频开始下载 将m3u8和mp4格式的视频下载到不同文件夹,加以分类 自动过滤已经下载过的视频

253 Feb 23, 2021
Parallels Desktop dmg downloader

parallelsdesktop-dl Parallels Desktop dmg file downloader Usage usage: pd-dl [-h] [--dlv [DLV]] [-v] Parallels Desktop downloader optional arguments

2 Sep 13, 2022
This is Yt Downloader. Coded with Python (my first repository)

Get Started Download & install Python first before using this software. Download Python Installing Python and Pytube Library (IMPORTANT) Installing Py

Qi 2 Oct 25, 2021
Python Program that downloads gaming required packages based on your Linux Distribution.

LibreGaming Python Program that downloads gaming required packages based on your Linux Distribution. Table of contents Distributions Prerequisites Dep

Ahmed Al Balochi 195 Jan 01, 2023
PyDownloader - Downloads files and folders at high speed (based on your interent speed).

PyDownloader - Downloads files and folders at high speed (based on your interent speed).

Armen._.G 4 Feb 24, 2022
A toolkit to automatically crawl the paper list and download paper pdfs of ACL Ahthology.

ACL-Anthology-Crawler A toolkit to automatically crawl the paper list and download paper pdfs of ACL Anthology

Ray GG 9 Oct 09, 2022
Simple Python script to download images and videos from public subreddits without using Reddit's API 😎

Subreddit Media Downloader Download images and videos from any public subreddit without using Reddit's API Made with ❤ by Nico 💬 About: This script a

Nico 106 Jan 07, 2023
Download your Spotify playlists and songs along with album art and metadata

spotDL Download your Spotify playlists and songs along with album art and metadata The fastest, easiest, and most accurate command-line music download

10.6k Jan 03, 2023
Youtube_dl_helper - A hacky python script meant to automate the process of downloading mp3 files from YouTube using youtube-dl library

youtube_dl_helper A helper program meant to automate the process of downloading mp3 files from YouTube using youtube-dl library Dependencies In order

Guilherme Bittencourt de Borba 1 Jan 04, 2022
Twitter Media Downloader (Telegram Bot)

Twitter Media Downloader (Telegram Bot)

Matin Baloochestani 8 Oct 27, 2022
A youtube-dl fork with additional features and fixes

yt-dlp is a youtube-dl fork based on the now inactive youtube-dlc. The main focus of this project is adding new features and patches while also keepin

yt-dlp 37.1k Jan 03, 2023