Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video.

Last update: Jul 22, 2022

Related tags

Overview

Datashredder

Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video.

You can chose the chance of corruption e.g i have a chance of 100 therfore there is a 1 in 100 chance of the next peice of data to be corrupted this allows you to controll how much corruption you want.

You can also chose to have a random peice of corruption data or random e.g Corruption data is FF

Not Corrupted: 30 32 35 53 f0 72

Corrupted: 30 FF 35 53 FF 72

A random corruption would chose a random corruption data each iteration

Examples

Cats

Each image has a corruption data of 00

There is 206824 iterations on this image

Not corrupted image

Corrupted images

Image #	Chance	Corruptions
1	2000	39
2	1500	133
3	1000	200
4	500	432
5	200	1020
6	100	2069

simple way to build the declarative and destributed data pipelines with python

unipipeline simple way to build the declarative and distributed data pipelines. Why you should use it Declarative strict config Scaffolding Fully type

0 Jan 26, 2022

A python package which can be pip installed to perform statistics and visualize binomial and gaussian distributions of the dataset

GBiStat package A python package to assist programmers with data analysis. This package could be used to plot : Binomial Distribution of the dataset p

4 Oct 17, 2022

Python data processing, analysis, visualization, and data operations

Python This is a Python data processing, analysis, visualization and data operations of the source code warehouse, book ISBN: 9787115527592 Descriptio

1 Jan 16, 2022

PrimaryBid - Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift

Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift This project is composed of two parts: Part1 and Part2

1 Jan 19, 2022

A computer algebra system written in pure Python

SymPy See the AUTHORS file for the list of authors. And many more people helped on the SymPy mailing list, reported bugs, helped organize SymPy's part

9.9k Dec 31, 2022

Very basic but functional Kakuro solver written in Python.

kakuro.py Very basic but functional Kakuro solver written in Python. It uses a reduction to exact set cover and Ali Assaf's elegant implementation of

4 Jan 15, 2022

Catalogue data - A Python Scripts to prepare catalogue data

catalogue_data Scripts to prepare catalogue data. Setup Clone this repo. Install

3 Mar 3, 2022

Convert tables stored as images to an usable .csv file

Convert an image of numbers to a .csv file This Python program aims to convert images of array numbers to corresponding .csv files. It uses OpenCV for

711 Dec 26, 2022

fds is a tool for Data Scientists made by DAGsHub to version control data and code at once.

Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc

359 Dec 22, 2022

Releases(0.2.17)

0.2.17(Nov 18, 2021)
Changes:

Bug patches 9433cbf501bf18b2871df117121e8dbaed9a46dd

Removed tqdm 9ad0d65c49226755f5d7dffad99a5698ada68d22

Install Command: pip install pip install Datashredder==0.2.17

Full Changelog: https://github.com/awesomelewis2007/Datashredder/compare/0.2.15...0.2.17
Source code(tar.gz)
Source code(zip)
Datashredder-0.2.17-py3-none-any.whl(16.34 KB)
Datashredder-0.2.17.tar.gz(16.13 KB)
0.2.15(Nov 14, 2021)
Changes:

Added C installer

Added C help file

Added Makefile

Added pyproject.toml

Added setup.py

Improved Demo

Install Command: pip install pip install Datashredder==0.2.15

Full Changelog: https://github.com/awesomelewis2007/Datashredder/compare/0.1.10...0.2.15
Source code(tar.gz)
Source code(zip)
Datashredder-0.2.15-py3-none-any.whl(14.97 KB)
Datashredder-0.2.15.tar.gz(15.82 KB)
0.1.10(Oct 31, 2021)

This is the first release of datashredder

This release is not on pypi Full Changelog: https://github.com/awesomelewis2007/Datashredder/commits/0.1.10
Source code(tar.gz)
Source code(zip)

Owner

GitHub Repository

Driver Analysis with Factors and Forests: An Automated Data Science Tool using Python

Driver Analysis with Factors and Forests: An Automated Data Science Tool using Python 📊

2 May 26, 2022

Senator Trades Monitor

Senator Trades Monitor This monitor will grab the most recent trades by senators and send them as a webhook to discord. Installation To use the monito

5 Jun 11, 2022

ETL flow framework based on Yaml configs in Python

ETL framework based on Yaml configs in Python A light framework for creating data streams. Setting up streams through configuration in the Yaml file.

18 Jul 06, 2022

This python script allows you to manipulate the audience data from Sl.ido surveys

Slido-Automated-VoteBot This python script allows you to manipulate the audience data from Sl.ido surveys Since Slido blocks interference from automat

1 Jan 24, 2022

Building house price data pipelines with Apache Beam and Spark on GCP

This project contains the process from building a web crawler to extract the raw data of house price to create ETL pipelines using Google Could Platform services.

1 Nov 22, 2021

CS50 pset9: Using flask API to create a web application to exchange stocks' shares.

C$50 Finance In this guide we want to implement a website via which users can “register”, “login” “buy” and “sell” stocks, like below: Background If y

1 Jan 24, 2022

Option Pricing Calculator using the Binomial Pricing Method (No Libraries Required)

Binomial Option Pricing Calculator Option Pricing Calculator using the Binomial Pricing Method (No Libraries Required) Background A derivative is a fi

1 Nov 29, 2021

A forecasting system dedicated to smart city data

smart-city-predictions System prognostyczny dedykowany dla danych inteligentnych miast Praca inżynierska realizowana przez Michała Stawikowskiego and

1 Nov 08, 2021

PrimaryBid - Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift

Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift This project is composed of two parts: Part1 and Part2

1 Jan 19, 2022

PipeChain is a utility library for creating functional pipelines.

PipeChain Motivation PipeChain is a utility library for creating functional pipelines. Let's start with a motivating example. We have a list of Austra

2 Aug 07, 2022

Data cleaning tools for Business analysis

Datacleaning datacleaning tools for Business analysis This program is made for Vicky's work. You can use it, too. 数据清洗该数据清洗工具是为了商业分析这个程序是为了Vicky的工作而

3 Nov 16, 2021

Elementary is an open-source data reliability framework for modern data teams. The first module of the framework is data lineage.

Data lineage made simple, reliable, and automated. Effortlessly track the flow of data, understand dependencies and analyze impact. Features Visualiza

898 Jan 09, 2023

Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video.

Related tags

Overview

Datashredder

Examples

Cats

Not corrupted image

Corrupted images

You might also like...

simple way to build the declarative and destributed data pipelines with python

A python package which can be pip installed to perform statistics and visualize binomial and gaussian distributions of the dataset

Python data processing, analysis, visualization, and data operations

PrimaryBid - Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift

A computer algebra system written in pure Python

Very basic but functional Kakuro solver written in Python.

Catalogue data - A Python Scripts to prepare catalogue data

Convert tables stored as images to an usable .csv file

fds is a tool for Data Scientists made by DAGsHub to version control data and code at once.

Releases(0.2.17)

0.2.17(Nov 18, 2021)

0.2.15(Nov 14, 2021)

0.1.10(Oct 31, 2021)

Owner

Driver Analysis with Factors and Forests: An Automated Data Science Tool using Python

Senator Trades Monitor

ETL flow framework based on Yaml configs in Python

This python script allows you to manipulate the audience data from Sl.ido surveys

Building house price data pipelines with Apache Beam and Spark on GCP

CS50 pset9: Using flask API to create a web application to exchange stocks' shares.

Option Pricing Calculator using the Binomial Pricing Method (No Libraries Required)

A forecasting system dedicated to smart city data

PrimaryBid - Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift

PipeChain is a utility library for creating functional pipelines.

Data cleaning tools for Business analysis

Elementary is an open-source data reliability framework for modern data teams. The first module of the framework is data lineage.

Statsmodels: statistical modeling and econometrics in Python

A multi-platform GUI for bit-based analysis, processing, and visualization

Numerical Analysis toolkit centred around PDEs, for demonstration and understanding purposes not production

This is a repo documenting the best practices in PySpark.

Implementation in Python of the reliability measures such as Omega.

PLStream: A Framework for Fast Polarity Labelling of Massive Data Streams

Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Theano

Integrate bus data from a variety of sources (batch processing and real time processing).