Allele-specific pipeline for unbiased read mapping(WIP), QTL discovery(WIP), and allelic-imbalance analysis

Last update: Aug 11, 2022

Related tags

Deep Learning WASP2

Overview

WASP2 (Currently in pre-development): Allele-specific pipeline for unbiased read mapping(WIP), QTL discovery(WIP), and allelic-imbalance analysis

Requirements

Python >= 3.7
numpy
pandas
scipy
pysam
pybedtools

Installation

Recommended installation through conda, and given environment

conda env create -f environment.yml

Allelic Imbalance Analysis

Analysis pipeline currently consists of two tools (Count and Analysis)

Count Tool

Counts alleles in ATAC peaks that overlap heterozygous SNP's

Usage

python run_analysis.py count -a [BAM] -g [VCF] -s [VCF Sample] -r [Peaks] {OPTIONS}

Required Arguments

-a/--alignment: BAM file containing alignments.
-g/--genotypes: VCF file with genotypes.
-s/--sample: Sample name in VCF file.
-r/--regions: Regions of interest in narrowPeak, GTF, or BED format. (ONLY narrowPeak support implemented)

Single-Cell Additional Requirements

-sc/--singlecell: Flag that denotes data is single-cell.
-b/--barcodes: 2 Column TSV that contains barcodes and their group/cell mapping.

Optional Arguments

-o/--output: Directory to output counts. (Default. CWD)
--nofilt: Skip step that pre-filters reads that overlap regions of interest
--keeptemps: Keep intermediary files during preprocessing step, outputs to directory if given with flag, otherwise outputs to CWD.

Analysis Tool

Analyzes Allelic Imbalance per ATAC peak given allelic count data

Usage

python run_analysis.py analysis [COUNTS] {OPTIONS}

Required Arguments

COUNTS: first positional argument, output data from count tool

Single-Cell Additional Requirements

-sc/--singlecell: Flag that denotes data is single-cell

Optional Arguments

--min: Minimum allele count needed for analysis. (Default. 10)
-o/--output: Directory to output counts. Defaults to CWD if not given. (Default. CWD)
-m/--model: Model used for measuring imbalance. Choice of "single", "linear", or "binomial". (Default. "single")

TODO

Unbiased Read Mapping Curently in development

Allelic Imbalance Pipeline

Counts
- Need to implement RNA-Seq and Gene support
- More robust for different inputs for bulk and single-cell data
Analysis
- More specific implementations for single-cell data

Allele-specific pipeline for unbiased read mapping(WIP), QTL discovery(WIP), and allelic-imbalance analysis

Related tags

Overview

WASP2 (Currently in pre-development): Allele-specific pipeline for unbiased read mapping(WIP), QTL discovery(WIP), and allelic-imbalance analysis

Requirements

Installation

Allelic Imbalance Analysis

Count Tool

Analysis Tool

TODO

Owner

McVicker Lab

Repository for MDPGT

Simple API for UCI Machine Learning Dataset Repository (search, download, analyze)

DuBE: Duple-balanced Ensemble Learning from Skewed Data

Makes patches from huge resolution .svs slide files using openslide

Pose Transformers: Human Motion Prediction with Non-Autoregressive Transformers

PyDeepFakeDet is an integrated and scalable tool for Deepfake detection.

A robust camera and Lidar fusion based velocity estimator to undistort the pointcloud.

Frequency Domain Image Translation: More Photo-realistic, Better Identity-preserving

Easily Process a Batch of Cox Models

CLNTM - Contrastive Learning for Neural Topic Model

Implementation of PersonaGPT Dialog Model

Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation (ACM MM 2020)

LaneDetectionAndLaneKeeping - Lane Detection And Lane Keeping

Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging

Official PyTorch implementation of "Preemptive Image Robustification for Protecting Users against Man-in-the-Middle Adversarial Attacks" (AAAI 2022)

SurfEmb (CVPR 2022) - SurfEmb: Dense and Continuous Correspondence Distributions

Disagreement-Regularized Imitation Learning

Code, environments, and scripts for the paper: "How Private Is Your RL Policy? An Inverse RL Based Analysis Framework"

Neural Tangent Generalization Attacks (NTGA)

A series of Jupyter notebooks with Chinese comment that walk you through the fundamentals of Machine Learning and Deep Learning in python using Scikit-Learn and TensorFlow.