This is a tool for speculation of ancestral allel, calculation of sfs and drawing its bar plot.

Last update: Dec 16, 2022

Related tags

Overview

superSFS

This is a tool for speculation of ancestral allel, calculation of sfs and drawing its bar plot. It is easy-to-use and runing fast. What you should prepare is the phased vcf file containg the data of populations you intrested and the outgroup, the outgroup name file, and the annotation file. Enjoy it!!!

It has four models:

0：Using all function, from original vcf data to sfs barplot
1: Only speculate the ancestral allel and output new vcf file using speculated allel as reference
2: Only count the frequency of derived allel in each snp of each population
3: Only draw bar polt of sfs using data generated from the results of calutation of sfs

Example:

Model 0: python superSFS 0 ogdir threshold vcfdir annodir modir coutdir plotdir group
Model 1: python superSFS 1 ogdir threshold vcfdir outdir
Model 2: python superSFS 2 annodir modir coutdir
Model 3: python superSFS 3 coutdir plotdir group

Explation for each parameter:

ogdir: direction of outgroup names file
threshold: a number that if the sum of variant allel in outpgroup greater than it,the variant allel will be counted as ancestral allel
vcfdir: direction of vcf data
vannodir: direction of annotation file with sample names in first column and group name in second colum. This file should has header in first row
vmodir: assign the output direction of generated vcf file using speculated allel as reference
countdir: assign the output direction of calculation of derived allels for each snp in each group
plotdir: assign the output direction of bar plot of sfs
group: the group that you want to analysis

This is a tool for speculation of ancestral allel, calculation of sfs and drawing its bar plot.

Related tags

Overview

superSFS

Owner

Python dataset creator to construct datasets composed of OpenFace extracted features and Shimmer3 GSR+ Sensor datas

Statistical Rethinking course winter 2022

Exploratory Data Analysis for Employee Retention Dataset

Python package to transfer data in a fast, reliable, and packetized form.

A simple and efficient tool to parallelize Pandas operations on all available CPUs

follow-analyzer helps GitHub users analyze their following and followers relationship

PLStream: A Framework for Fast Polarity Labelling of Massive Data Streams

COVID-19 deaths statistics around the world

Automated Exploration Data Analysis on a financial dataset

Program that predicts the NBA mvp based on data from previous years.

Open-source Laplacian Eigenmaps for dimensionality reduction of large data in python.

A notebook to analyze Amazon Recommendation Review Dataset.

Convert tables stored as images to an usable .csv file

📊 Python Flask game that consolidates data from Nasdaq, allowing the user to practice buying and selling stocks.

CSV database for chihuahua (HUAHUA) blockchain transactions

signac-flow - manage workflows with signac

Conduits - A Declarative Pipelining Tool For Pandas

Ejercicios Panda usando Pandas

Analyzing Covid-19 Outbreaks in Ontario

Semi-Automated Data Processing