This is my implementation on the K-nearest neighbors algorithm from scratch using Python

Last update: Jan 08, 2022

Related tags

Overview

K Nearest Neighbors (KNN) algorithm

In this Machine Learning world, there are various algorithms designed for classification problems such as Logistic Regression, Decision Tree, Random Forest, Gradient Boosting over Decision Trees, etc. and KNN is not exceptional. When approaching to a classification problem, off the top of my head I will come up with the algorithm (KNN) first because it may be the most basic and simple algorithm to implement. In this project, I'm going to build a KNN model to do tasks on classifying flower types in the dataset Iris.

About data

The dataset I'm going to use is a flower dataset called Iris. This is a bit of information about this dataset that you should care about:

The data set contains 3 classes (Iris Setosa, Iris Versicolor, Iris Virginica) of 50 instances each, where each class refers to a type of iris plant.
There are 4 attributes in total, which are sepal length in cm, sepal width in cm, petal length in cm, petal width in cm.
Classes: Iris setosa, Iris versicolor, Iris virginica

Environment

MacOS Monterey 12.1, Anaconda Virtual Environment, Python 3.9.7 64-bit

Requirements

KNN procedures

Load the data
Initialise the value of K
For getting the predicted class, iterate from 1 to the total number of training data points

3.1 Calculate the distance between a test sample and each row of training data. Here we will use L2 norm (Euclidean distance) as our distance metric since it’s the most popular method

3.2 Sort the calculated distances in ascending order based on distance values

3.3 Get top K nearest neighbors from the sorted array

3.4 Get a class with the maximum number of votes

3.5 Return the predicted class

How to run this program

In your project folder, open a terminal and run python knn.py

Result

Reference:

For more information about the KNN algorithm, please follow this link:

Introduction to K-Nearest Neighbors: A powerful Machine Learning Algorithm

This is my implementation on the K-nearest neighbors algorithm from scratch using Python

Related tags

Overview

K Nearest Neighbors (KNN) algorithm

About data

Environment

Requirements

KNN procedures

How to run this program

Result

Reference:

Owner

sonny1902

Software Engineer Salary Prediction

PLUR is a collection of source code datasets suitable for graph-based machine learning.

Turning images into '9-pan' palettes using KMeans clustering from sklearn.

A repository for collating all the resources such as articles, blogs, papers, and books related to Bayesian Statistics.

Time series changepoint detection

虚拟货币(BTC、ETH)炒币量化系统项目。在一版本的基础上加入了趋势判断

Predict profitability of trades based on indicator buy / sell signals

Merlion: A Machine Learning Framework for Time Series Intelligence

ThunderSVM: A Fast SVM Library on GPUs and CPUs

Quantum Machine Learning

XAI - An eXplainability toolbox for machine learning

ANNchor is a python library which constructs approximate k-nearest neighbour graphs for slow metrics.

CD) in machine learning projectsImplementing continuous integration & delivery (CI/CD) in machine learning projects

Datetimes for Humans™

Machine-Learning with python (jupyter)

InfiniteBoost: building infinite ensembles with gradient descent

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

Neighbourhood Retrieval (Nearest Neighbours) with Distance Correlation.

The code from the Machine Learning Bookcamp book and a free course based on the book

ClearML - Auto-Magical Suite of tools to streamline your ML workflow. Experiment Manager, MLOps and Data-Management