Maintained by the Love Lab


highly used

DESeq2 usage stats

  • Test for differential expression based on Negative Binomial GLM.
    Collaboration with Simon Anders and Wolfgang Huber (EMBL Heidelberg).

tximport usage stats

  • Imports transcript-level abundance, estimated counts and transcript lengths.
    Collaboration with Charlotte Soneson (FMI) and Mark Robinson (UZH Zürich).

tximeta usage stats

  • Import transcript abundances with automatic population of metadata.
    Collaboration with Rob Patro (UMD), Charlotte Soneson (FMI), and Peter Hickey (WEHI).

apeglm usage stats

  • Bayesian shrinkage estimators for effect sizes for a variety of GLM models.
    Developed by Anqi Zhu (UNC-CH), collaboration with Joseph Ibrahim (UNC-CH).
    apeglm methods can be accessed via lfcShrink in the DESeq2 package.

newly developed

nullranges

  • Modular package for generation of sets of ranges representing the null hypothesis. These can take the form of bootstrap samples of ranges or sets of control ranges that are matched across one or more covariates. Developed by Wancen Mu, Eric Davis, and Douglas Phanstiel (UNC-CH). Contributions from other Bioconductor developers as well. Funding provided by CZI EOSS award.

airpart

  • Identification of differential cell-type-specific allelic imbalance across cell types or states, utilizing single-cell allelic counts. Provides partitioning of cell types by allelic signal using generalized fused lasso, plus many EDA and QC plotting functions. Developed by Wancen Mu, in collaboration with the Patro group (UMD) and KB Choi (JAX).

MRLocus

  • Bayesian estimation of the gene-to-trait effect from eQTL and GWAS summary data for loci displaying allelic heterogeneity, that is, containing multiple LD-independent eQTLs. Developed in collaboration with Anqi Zhu, Nana Matoba, and Jason Stein (UNC-CH).

DeCompress

  • A semi-reference free method that uses compressed sensing to deconvolve tissue compartments from bulk mRNA expression from targeted panels, like NanoString nCounter. Developed by Arjun Bhattacharya (UNC-CH).

MOSTWAS

  • Suite of tools to prioritize distal variants in transcriptomic prediction, and conduct TWAS-like association testing using GWAS summary statistics. Developed by Arjun Bhattacharya (UNC-CH).

actor

  • A latent Dirichlet model with Dirichlet Multinomial observations to compare expressed isoform proportions in a dataset to an independent reference panel. Developed by Sean McCabe (UNC-CH), collaboration with Andrew Nobel (STOR, UNC-CH).

fishpond

  • swish is a nonparametric differential transcript and gene analysis method making use of inferential replicate counts. Collaboration with Anqi Zhu and Joseph Ibrahim (UNC-CH), and Avi Srivastava and Rob Patro (UMD). swish lives in the Bioconductor package fishpond.

movie

  • A framework for evaluating variance classification methods using multi-omics data. Using data segmentation, this framework aims to identify the consistency and the extent of overfitting of multi-omics methods. Developed by Sean McCabe (UNC-CH), collaboration with Dan-Yu Lin (UNC-CH).

Published workflows

rnaseqGene

  • RNA-seq workflow: gene-level exploratory analysis and differential expression.
    Developed in collaboration with Simon Anders, Vladislav Kim, Wolfgang Huber (EMBL Heidelberg).
    F1000Research publication

rnaseqDTU

  • Swimming downstream: statistical analysis of differential transcript usage following Salmon quantification.
    Developed in collaboration with Charlotte Soneson (FMI) and Rob Patro (UMD).
    F1000Research publication

fluentGenomics

  • An extended workflow using the plyranges and tximeta packages for fluent genomic data analysis.
    Developed by Stuart Lee (WEHI), in collaboration with Michael Lawrence (Genentech).
    F1000Research publication

Contributor

Salmon

  • Software for quantifying the expression of transcripts using RNA-seq data, developed and maintained by Rob Patro (UMD). The Love lab collaborates with Dr. Patro on bias correction methods, on estimation of uncertainty through Gibbs and bootstrap sampling, and on propagation of metadata from abundance estimation to downstream analysis packages.
  • My Snakemake file for running Salmon

GenomicFiles

  • Provides infrastructure for parallel computations distributed ‘by file’ or ‘by range’. User defined MAPPER and REDUCER functions provide added flexibility for data combination and manipulation. Collaboration with Valerie Obenchain and Martin Morgan (Bioconductor core team).

rafalib — CRAN

  • A series of shortcuts for routine tasks. Collaboration with Rafael Irizarry (DFCI Boston).

Data packages

macrophage This package provides the output of running Salmon on a set of 24 RNA-seq samples from Alasoo, et al., “Shared genetic effects on chromatin and gene expression indicate a role for enhancer priming in immune response”, published in Nature Genetics, January 2018.

oct4 This package provides the output of running Salmon on a set of 12 RNA-seq samples from King & Klose, “The pioneer factor OCT4 requires the chromatin remodeller BRG1 to support gene regulatory element function in mouse embryonic stem cells”, published in eLIFE, March 2017.

airway This package provides a SummarizedExperiment object of read counts in genes for an RNA-Seq experiment on four human airway smooth muscle cell lines treated with dexamethasone. The citation for the experiment is: Himes BE et al (2014).

fission This package provides a SummarizedExperiment object of read counts in genes for a time course RNA-Seq experiment of fission yeast (Schizosaccharomyces pombe) in response to oxidative stress (1M sorbitol treatment) at 0, 15, 30, 60, 120 and 180 mins. The citation for the experiment is: Leong HS et al. (2014).

parathyroidSE This package provides SummarizedExperiment objects of read counts in genes and exonic parts for paired-end RNA-Seq data from experiments on primary cultures of parathyroid tumors. The citation for the experiment is: Haglund F et al (2012).

tximportData This packages provides output files from common transcript estimation software (Salmon, Kallisto, RSEM, Cufflinks) for demonstration of import using tximport. The files are a subset of 6 samples from the GEUVADIS project. The citation for the GEUVADIS project is: Lappalainen et al (2013)

alpineData This packages provides a subset of alignments for demonstration of alpine. The samples aligned are a subset of 4 samples from the GEUVADIS project. The citation for the GEUVADIS project is: Lappalainen et al (2013)


Older packages from the lab

alpine

  • Modeling and correcting fragment sequence bias for RNA-seq transcript abundance estimation. Collaboration with Rafael Irizarry (DFCI Boston).

exomeCopy

  • Detection of copy number variants (CNV) from exome sequencing samples, including unpaired samples. The package implements a hidden Markov model which uses positional covariates, such as background read depth and GC-content, to simultaneously normalize and segment the samples into regions of constant copy count. Collaboration with Alena van Bömmel, Stefan Haas and Martin Vingron (MPI Berlin).

SparseData

  • Efficiently calculate statistics such as group mean, standard deviation and t-statistics on large sparse genomic data sets.