Fake news detector filters - Smart filter project allow to classify the quality of information and web pages

Last update: Jan 04, 2022

Related tags

Overview

fake-news-detector-1.0

Lists, lists and more lists...

Spam filter list, quality keyword list, stoplist list, top-domains urls list, news agencies websites list, university websites list, business websites lists and government organizations lists.

This gives us an initial score for the authority presenting the information.

If we can verify the source we are on the right track for building a fake news detector.

SPAM FILTER

The spam filter also gives us clues on the quality of the source.

TESTING, TESTING and TESTING

Next step is running more tests to see if the concept works.

Then we need to find more lists and maybe other tools like an API that can give us clues im discovering fake news.

I don't have all the answers but I am willing to code. It is a complicated problem and we may be limited on what can be done.

API may be a solution

I found two API that can make the project work.

URL Reputation API https://www.apivoid.com/api/url-reputation/

With this URL Reputation API you can detect potentially phishing and malicious URLs. We deeply analyze the URL (including the URL content, URL pattern, domain name, HTTP headers, domain TLD, etc) It not free so I will abandonne the API for the moment.

I found another API that could help the project in a more complicated way.

Search API worldwide news https://newsapi.org/?ref=apilist.fun

We could cross reference news events with this API. We could us it to validate if the story is fake or is trending. But this could get complicated.

Memo Sim @ Fake news detector filters project

AFTER TESTING : THE LIST CONCEPT WORKS VERY WELL

The lists work very well together and the system is able to detect bad and good sites. I am very happy with this module. We are also able to get nice quality indicators and statistics for web page quality source evaluation.

Fake news detector filters - Smart filter project allow to classify the quality of information and web pages

Related tags

Overview

Owner

Memo Sim

Subtitle Workshop (subshop): tools to download and synchronize subtitles

This repository structures data in title, summary, tags, sentiment given a fragment of a conversation

超轻量级bert的pytorch版本，大量中文注释，容易修改结构，持续更新

GPT-3 command line interaction

[ICLR 2021 Spotlight] Pytorch implementation for "Long-tailed Recognition by Routing Diverse Distribution-Aware Experts."

PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop

Switch spaces for knowledge graph embeddings

Task-based datasets, preprocessing, and evaluation for sequence models.

A 10000+ hours dataset for Chinese speech recognition

Opal-lang - A WIP programming language based on Python

The model is designed to train a single and large neural network in order to predict correct translation by reading the given sentence.

REST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.

Resources for "Natural Language Processing" Coursera course.

TalkNet: Audio-visual active speaker detection Model

Create a semantic search engine with a neural network (i.e. BERT) whose knowledge base can be updated

A high-level yet extensible library for fast language model tuning via automatic prompt search

Anuvada: Interpretable Models for NLP using PyTorch

An open source library for deep learning end-to-end dialog systems and chatbots.

KoBART model on huggingface transformers

Search msDS-AllowedToActOnBehalfOfOtherIdentity