DANA Supplements

This repository stores the data, results, and R scripts to generate these reuslts and figures for the corresponding paper Depth Normalization of Small RNA Sequencing: Using Data and Biology to Select a Suitable Method. The DANA package is available on github: https://github.com/LXQin/DANA

DANA is an approach for assessing the performance of normalization for microRNA-Seq data based on biology-motivated and data-driven metrics. Our approach takes advantage of well-known biological features of microRNAs for their expression pattern and polycistronic clustering to assess (1) how effectively normalization removes handling effects and (2) how normalization biases true biological signals. DANA is implemented in R and can be used for assessing any normalization method (under minimal assumptions) for any microRNA-Seq data set and only requires additional information on polycistronic clustering, which is typically readily available.

Installation

This repository is not a package for DANA. It stores the R scripts and data to generate the results and figures in the paper. For simplicity, this package contains a "snapshot" of the DANA implementation as includable R code. This way you don't need to install the DANA package to run the analysis and future updates of the DANA package do not affect the results generated here. You can install the released version of DANA directly from github using devtools.

Dependencies

To run the R code, you need to install the following packages: ggplot2, gridExtra, ggnewscale, corrplot, stargazer, plotly, ggrepel, glmnet, huge, Rcpp, FastGGM, edgeR, DESeq, PoissonSeq, sva, RUVSeq, vsn, DescTools, ffpe. Please make sure to install all dependencies prior to running the code. The code presented here was implemented and tested in R version 4.0.2.

Usage

Download this repository.
Set your R working directory to the root directory of the project.
Run or knit any of the following R markdowns
- MSK_Data_Analysis.Rmd to generate the DANA results for the paired MSK sarcoma data sets
- TCGA_UCEC_Data_Analysis.Rmd to generate the DANA results for the single-batch and mixed-batch data sets from the TCGA-UCEC project.
- TCGA_BRCA_UCS_Data_Analysis.Rmd to generate the DANA results for the combined TCGA-BRCA and TCGA-UCS data set.

All of these markdowns were previously run (so you don't have to) and the resulting knitted html files can be found in the directory docs/

DANA paper supplementary materials

Related tags

Overview

DANA Supplements

Installation

Dependencies

Usage

Owner

Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features

Easy way to add GoogleMaps to Flask applications. maintainer: @getcake

Image-to-image regression with uncertainty quantification in PyTorch

State-Relabeling Adversarial Active Learning

Data and Code for ACL 2021 Paper "Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning"

Code and data for ACL2021 paper Cross-Lingual Abstractive Summarization with Limited Parallel Resources.

Built a deep neural network (DNN) that functions as an end-to-end machine translation pipeline

🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥

[CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object Detection with Transformers

Unified MultiWOZ evaluation scripts for the context-to-response task.

Neural style in TensorFlow! 🎨

PyTorch code for 'Efficient Single Image Super-Resolution Using Dual Path Connections with Multiple Scale Learning'

Minimal deep learning library written from scratch in Python, using NumPy/CuPy.

Codebase for the Summary Loop paper at ACL2020

Testing and Estimation of structural breaks in Stata

lightweight python wrapper for vowpal wabbit

Discriminative Condition-Aware PLDA

Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness

A cross-lingual COVID-19 fake news dataset

Public Code for NIPS submission SimiGrad: Fine-Grained Adaptive Batching for Large ScaleTraining using Gradient Similarity Measurement