|
a |
|
b/vignettes/Tweedieverse-vignette.Rmd |
|
|
1 |
--- |
|
|
2 |
title: "Tweedieverse: Differential analysis of omics data based on the Tweedie distribution" |
|
|
3 |
author: "Himel Mallick, Ali Rahnavard" |
|
|
4 |
date: "`r Sys.Date()`" |
|
|
5 |
output: |
|
|
6 |
rmarkdown::github_document: |
|
|
7 |
toc: yes |
|
|
8 |
toc_depth: 4 |
|
|
9 |
vignette: > |
|
|
10 |
%\VignetteIndexEntry{Tweedieverse : Differential analysis of omics data based on the Tweedie distribution} |
|
|
11 |
%\VignetteEngine{knitr::rmarkdown} |
|
|
12 |
%\VignetteEncoding{UTF-8} |
|
|
13 |
--- |
|
|
14 |
|
|
|
15 |
<!-- Himel Mallick --> |
|
|
16 |
<!-- 2021-03-03 <img src="docs/logo.jpg" align="right" width="365px"/> --> |
|
|
17 |
|
|
|
18 |
## Introduction |
|
|
19 |
|
|
|
20 |
Tweedieverse is an R package for differential analysis of omics data implementing a range of statistical methodology based on the [Tweedie distribution](https://en.wikipedia.org/wiki/Tweedie_distribution). |
|
|
21 |
|
|
|
22 |
Unlike traditional single-omics tools, Tweedieverse is technology-agnostic and can be applied to both count and continuous measurements arising from diverse high-throughput technologies (e.g., transcript abundances from bulk and single-cell RNA-Seq studies in the form of UMI counts or non-UMI counts, microbiome taxonomic and functional profiles in the form of counts or relative abundances, and compound abundance levels or peak intensities from metabolomics and other mass spectrometry-based experiments, among others). |
|
|
23 |
|
|
|
24 |
The software includes multiple analysis methods (e.g., self-adaptive, zero-inflated, and non-zero-inflated statistical models) as well as multiple customization options such as the inclusion of random effects and multiple covariates along with several data exploration capabilities and visualization modules in a unified estimation umbrella. |
|
|
25 |
|
|
|
26 |
## Installation |
|
|
27 |
|
|
|
28 |
To install the latest release version of `Tweedieverse` from [CRAN](https://cran.r-project.org/) (_*not yet available*_) run the following command: |
|
|
29 |
|
|
|
30 |
```r |
|
|
31 |
install.packages("Tweedieverse") |
|
|
32 |
library(Tweedieverse) |
|
|
33 |
``` |
|
|
34 |
Alternatively, the latest development version of `Tweedieverse` can be loaded using the following command (execute from within a fresh R session): |
|
|
35 |
|
|
|
36 |
```r |
|
|
37 |
install.packages('devtools') |
|
|
38 |
library(devtools) |
|
|
39 |
devtools::install_github("himelmallick/Tweedieverse") |
|
|
40 |
library(Tweedieverse) |
|
|
41 |
``` |
|
|
42 |
|
|
|
43 |
After installing `Tweedieverse`, please make sure the following package versions are also installed (a prerequisite for zero-inflated Tweedie models): |
|
|
44 |
|
|
|
45 |
```R |
|
|
46 |
devtools::install_version("statmod", version = "1.4.33", repos ="http://cran.us.r-project.org") |
|
|
47 |
|
|
|
48 |
``` |
|
|
49 |
```R |
|
|
50 |
devtools::install_version("cplm", version = "0.7-8", repos = "http://cran.us.r-project.org") |
|
|
51 |
``` |
|
|
52 |
|
|
|
53 |
## Basic Usage |
|
|
54 |
|
|
|
55 |
```r |
|
|
56 |
Tweedieverse(features, metadata, output) |
|
|
57 |
``` |
|
|
58 |
|
|
|
59 |
## Input |
|
|
60 |
|
|
|
61 |
Tweedieverse requires two input files: |
|
|
62 |
|
|
|
63 |
- **features**: A data frame of omics features such as taxa, genes, transcripts, metabolites, etc. |
|
|
64 |
- **metadata**: A data frame of metadata to be associated. |
|
|
65 |
|
|
|
66 |
For full options, check out the [user manual](https://github.com/himelmallick/Tweedieverse/tree/master/vignettes) or type ``` ?Tweedieverse``` in your R console. |
|
|
67 |
|
|
|
68 |
## Output |
|
|
69 |
|
|
|
70 |
A data frame containing coefficient estimates, p-values, and q-values (multiplicity-adjusted p-values) are returned, along with other parameter estimates from the fitted per-feature models. |
|
|
71 |
|
|
|
72 |
## Getting Started with Tweedieverse |
|
|
73 |
|
|
|
74 |
Check out the [Tweedie Labs](https://github.com/himelmallick/TweedieLabs/) repository for a collection of walkthrough tutorials (available as source codes, cloud-compatible images, and installable packages) on how to use Tweedieverse with various omics data types. |
|
|
75 |
|
|
|
76 |
## Citation |
|
|
77 |
|
|
|
78 |
To cite **`Tweedieverse`** in publications, please use: |
|
|
79 |
|
|
|
80 |
Mallick H et al. (2021). [Differential Expression of Single-cell RNA-seq Data using Tweedie Models](https://www.biorxiv.org/content/10.1101/2021.03.28.437378v1). bioRxiv, https://doi.org/10.1101/2021.03.28.437378. |
|
|
81 |
|
|
|
82 |
To cite the **`Tweedieverse`** software, please use: |
|
|
83 |
|
|
|
84 |
Mallick H et al. (2021). [Tweedieverse - A Unified Statistical Framework for Differential Analysis of Multi-omics Data](https://github.com/himelmallick/Tweedieverse). R package, https://github.com/himelmallick/Tweedieverse. |
|
|
85 |
|
|
|
86 |
## Issues |
|
|
87 |
|
|
|
88 |
We are happy to troubleshoot any issues with the package. Please contact the maintainer via email or [open an issue](https://github.com/himelmallick/tweedieverse/issues) in the GitHub repository. |