GitHub pages for Love Lab software

Most software from the Love Lab is part of the Bioconductor Project. Many of these packages are mirrored here:

The Love Lab is part of a large collaborative project within Bioconductor, called “tidyomics”. Tidyomics packages enable dplyr-like syntax for complex omics datasets. The Tidyomics Project homepage is here:

Package download statistics for a subset of lab software:

Highly used software

DESeq2 usage stats

  • Test for differential expression based on Negative Binomial GLM.
    Work with Simon Anders and Wolfgang Huber.

apeglm usage stats

  • Bayesian shrinkage estimators for effect sizes for GLM.
    Developed by Anqi Zhu, work with Joseph Ibrahim.
    apeglm methods can be accessed via DESeq2::lfcShrink.

fishpond usage stats

  • swish is a nonparametric DTE and DGE method.
    Developed by Anqi Zhu, work with Joseph Ibrahim, Avi Srivastava, Rob Patro.
  • SEESAW is a suite of methods for allelic expression analysis.
    Developed by Euphy Wu, work with Gary Churchill, KB Choi, Cheryl Ackert-Bicknell, Rob Patro.

tximport usage stats

  • Imports transcript-level abundance, counts and lengths.
    Work with Charlotte Soneson and Mark Robinson.

tximeta usage stats

  • Imports transcript abundances with automatic population of metadata.
    Work with Rob Patro, Charlotte Soneson, and Peter Hickey.

Newly developed software

splicelogic

  • Bioconductor package for detecting alternative splicing events from differential transcript usage results.
    Developed by Bea Campillo.
    Funding provided by NHGRI and Wellcome Trust – EOSS award.

plyxp

  • Bioconductor package that provides efficient dplyr-like syntax for annotated matrix data (SummarizedExperiment). Part of the tidyomics project.
    Developed by Justin Landis.
    Funding provided by NHGRI and Wellcome Trust – EOSS award.

nullranges

  • Modular package for generation of sets of ranges representing the null hypothesis. These can take the form of bootRanges, block bootstrap samples of ranges, or matchRanges, sets of control ranges that are matched across one or more covariates.
    Developed by Wancen Mu, Eric Davis, and Douglas Phanstiel. Funding provided by CZI – EOSS award.

rigvf

  • Bioconductor package that provides an interface to the IGVF Catalog data.
    Work with Martin Morgan.

classifygxt

  • R package for classifying gene-by-treatment (GxT) interactions using Bayesian model selection (BMS). The method is primarily designed for molecular count phenotypes, such as gene expression and chromatin accessibility.
    Developed by Yuriko Harigaya, work with Will Valdar.

diffsig

  • R package for estimating the association of risk factors with mutational signatures, making use of a Bayesian Dirichlet-multinomial hierarchical model for counts.
    Developed by Ji-Eun Park, work with Melissa Troester and Katie Hoadley.

Research packages

MRLocus

  • Bayesian estimation of the gene-to-trait effect from eQTL and GWAS summary data for loci displaying allelic heterogeneity, that is, containing multiple LD-independent eQTLs. Developed in collaboration with Anqi Zhu, Nana Matoba, and Jason Stein (UNC-CH).

airpart

  • Identification of differential cell-type-specific allelic imbalance across cell types or states, utilizing single-cell allelic counts. Provides partitioning of cell types by allelic signal using generalized fused lasso, plus many EDA and QC plotting functions. Developed by Wancen Mu, in collaboration with the Patro group (UMD) and KB Choi (JAX).

DeCompress

  • A semi-reference free method that uses compressed sensing to deconvolve tissue compartments from bulk mRNA expression from targeted panels, like NanoString nCounter. Developed by Arjun Bhattacharya (UNC-CH).

MOSTWAS

  • Suite of tools to prioritize distal variants in transcriptomic prediction, and conduct TWAS-like association testing using GWAS summary statistics. Developed by Arjun Bhattacharya (UNC-CH).

Maintainer

plyranges

  • A consistent interface for importing and wrangling genomics data from a variety of sources. The package defines a grammar of genomic data transformation based on dplyr and GenomicRanges. Developed by Stuart Lee, Di Cook, and Michael Lawrence. Maintained by Love Lab from 2024.

Published workflows

rnaseqGene

  • RNA-seq workflow: gene-level exploratory analysis and differential expression.
    Developed in collaboration with Simon Anders, Vladislav Kim, Wolfgang Huber (EMBL Heidelberg).
    F1000Research publication

rnaseqDTU

  • Swimming downstream: statistical analysis of differential transcript usage following Salmon quantification.
    Developed in collaboration with Charlotte Soneson (FMI) and Rob Patro (UMD).
    F1000Research publication

fluentGenomics

  • An extended workflow using the plyranges and tximeta packages for fluent genomic data analysis.
    Developed by Stuart Lee (WEHI), in collaboration with Michael Lawrence (Genentech).
    F1000Research publication

Contributor

Salmon

  • Software for quantifying the expression of transcripts using RNA-seq data, developed and maintained by Rob Patro (UMD). The Love lab collaborates with Dr. Patro on bias correction methods, on estimation of uncertainty through Gibbs and bootstrap sampling, and on propagation of metadata from abundance estimation to downstream analysis packages.
  • My Snakemake file for running Salmon

GenomicFiles

  • Provides infrastructure for parallel computations distributed ‘by file’ or ‘by range’. User defined MAPPER and REDUCER functions provide added flexibility for data combination and manipulation. Collaboration with Valerie Obenchain and Martin Morgan (Bioconductor core team).

rafalib — CRAN

  • A series of shortcuts for routine tasks. Collaboration with Rafael Irizarry (DFCI Boston).

Data packages

macrophage This package provides the output of running Salmon on a set of 24 RNA-seq samples from Alasoo, et al., “Shared genetic effects on chromatin and gene expression indicate a role for enhancer priming in immune response”, published in Nature Genetics, January 2018.

oct4 This package provides the output of running Salmon on a set of 12 RNA-seq samples from King & Klose, “The pioneer factor OCT4 requires the chromatin remodeller BRG1 to support gene regulatory element function in mouse embryonic stem cells”, published in eLIFE, March 2017.

airway This package provides a SummarizedExperiment object of read counts in genes for an RNA-Seq experiment on four human airway smooth muscle cell lines treated with dexamethasone. The citation for the experiment is: Himes BE et al (2014).

fission This package provides a SummarizedExperiment object of read counts in genes for a time course RNA-Seq experiment of fission yeast (Schizosaccharomyces pombe) in response to oxidative stress (1M sorbitol treatment) at 0, 15, 30, 60, 120 and 180 mins. The citation for the experiment is: Leong HS et al. (2014).

parathyroidSE This package provides SummarizedExperiment objects of read counts in genes and exonic parts for paired-end RNA-Seq data from experiments on primary cultures of parathyroid tumors. The citation for the experiment is: Haglund F et al (2012).

tximportData This packages provides output files from common transcript estimation software (Salmon, Kallisto, RSEM, Cufflinks) for demonstration of import using tximport. The files are a subset of 6 samples from the GEUVADIS project. The citation for the GEUVADIS project is: Lappalainen et al (2013)

nullrangesData Provides datasets for the nullranges package vignette, in particular example datasets for DNase hypersensitivity sites (DHS), CTCF binding sites, and CTCF genomic interactions. These are used to demonstrate generation of null hypothesis feature sets, either through block bootstrapping or matching, in the nullranges vignette.

alpineData This packages provides a subset of alignments for demonstration of alpine. The samples aligned are a subset of 4 samples from the GEUVADIS project. The citation for the GEUVADIS project is: Lappalainen et al (2013)