Switch to unified view

a/README.md b/README.md
1
<!-- badges: start -->
1
2
  [![R-CMD-check](https://github.com/yigbt/multiGSEA/actions/workflows/test.yaml/badge.svg)](https://github.com/yigbt/multiGSEA/actions/workflows/test.yaml)
3
  <!-- badges: end -->
4
5
6
# The `multiGSEA` `R` package
2
# The `multiGSEA` `R` package
7
3
8
## Authors
4
## Authors
9
5
10
Sebastian Canzler and Jörg Hackermüller
6
Sebastian Canzler and Jörg Hackermüller
11
7
12
[multiGSEA: a GSEA-based pathway enrichment analysis for multi-omics data](https://doi.org/10.1186/s12859-020-03910-x), _BMC Bioinformatics_ 21, 561 (2020)
8
[multiGSEA: a GSEA-based pathway enrichment analysis for multi-omics data](https://doi.org/10.1186/s12859-020-03910-x), _BMC Bioinformatics_ 21, 561 (2020)
13
9
14
## Introduction
10
## Introduction
15
11
16
The `multiGSEA` package was designed to run a robust GSEA-based
12
The `multiGSEA` package was designed to run a robust GSEA-based
17
pathway enrichment for multiple omics layers. The enrichment is
13
pathway enrichment for multiple omics layers. The enrichment is
18
calculated for each omics layer separately and aggregated p-values are
14
calculated for each omics layer separately and aggregated p-values are
19
calculated afterwards to derive a composite multi-omics pathway
15
calculated afterwards to derive a composite multi-omics pathway
20
enrichment.
16
enrichment.
21
17
22
Pathway definitions can be downloaded from up to eight different
18
Pathway definitions can be downloaded from up to eight different
23
pathway databases by means of the
19
pathway databases by means of the
24
[`graphite`](http://bioconductor.org/packages/release/bioc/html/graphite.html)
20
[`graphite`](http://bioconductor.org/packages/release/bioc/html/graphite.html)
25
Bioconductor package.
21
Bioconductor package.
26
22
27
Features of the transcriptome and proteome level can be mapped to the
23
Features of the transcriptome and proteome level can be mapped to the
28
following ID formats:
24
following ID formats:
29
25
30
    * Entrez Gene ID
26
    * Entrez Gene ID
31
    * Uniprot IDs
27
    * Uniprot IDs
32
    * Gene Symbols
28
    * Gene Symbols
33
    * RefSeq
29
    * RefSeq
34
    * Ensembl
30
    * Ensembl
35
    
31
    
36
Features of the metabolome layer can be mapped to:
32
Features of the metabolome layer can be mapped to:
37
33
38
    * Comptox Dashboard IDs (DTXCID, DTXSID)
34
    * Comptox Dashboard IDs (DTXCID, DTXSID)
39
    * CAS-RN numbers
35
    * CAS-RN numbers
40
    * Pubchem IDs (CID)
36
    * Pubchem IDs (CID)
41
    * HMDB IDs
37
    * HMDB IDs
42
    * KEGG IDs
38
    * KEGG IDs
43
    * ChEBI IDs
39
    * ChEBI IDs
44
    * Drugbank IDs
40
    * Drugbank IDs
45
    * Common names
41
    * Common names
46
42
47
  
43
  
48
Please note, that the mapping of metabolite IDs is accomplished
44
Please note, that the mapping of metabolite IDs is accomplished
49
through the `metaboliteIDmapping` package.  This `AnnotationHub`
45
through the `metaboliteIDmapping` package.  This `AnnotationHub`
50
package provides a comprehensive mapping table with more than one
46
package provides a comprehensive mapping table with more than one
51
million compounds (`metaboliteIDmapping` on our [github
47
million compounds (`metaboliteIDmapping` on our [github
52
page](https://github.com/yigbt/metaboliteIDmapping) or at
48
page](https://github.com/yigbt/metaboliteIDmapping) or at
53
[Bioconductor](http://bioconductor.org/packages/metaboliteIDmapping/)).
49
[Bioconductor](http://bioconductor.org/packages/metaboliteIDmapping/)).
54
  
50
  
55
  
51
  
56
## Installation
52
## Installation
57
53
58
There are two ways to install the `multiGSEA` package. For both you
54
There are two ways to install the `multiGSEA` package. For both you
59
have to install and start R in at least version 4.0:
55
have to install and start R in at least version 4.0:
60
56
61
(i) Use the Bioconductor framework:
57
(i) Use the Bioconductor framework:
62
58
63
```R
59
```R
64
if (!requireNamespace("BiocManager", quietly = TRUE))
60
if (!requireNamespace("BiocManager", quietly = TRUE))
65
  install.packages("BiocManager")
61
  install.packages("BiocManager")
66
62
67
BiocManager::install("multiGSEA")
63
BiocManager::install("multiGSEA")
68
```
64
```
69
65
70
(ii) Alternatively, you can install the most up to date version
66
(ii) Alternatively, you can install the most up to date version
71
(development) easily with
67
(development) easily with
72
[devtools](https://github.com/hadley/devtools):
68
[devtools](https://github.com/hadley/devtools):
73
69
74
```R
70
```R
75
install.packages("devtools")
71
install.packages("devtools")
76
devtools::install_github("https://github.com/yigbt/multiGSEA")
72
devtools::install_github("https://github.com/yigbt/multiGSEA")
77
```
73
```
78
74
79
Once installed, just load the `multiGSEA` package with:
75
Once installed, just load the `multiGSEA` package with:
80
76
81
```R
77
```R
82
library(multiGSEA)
78
library(multiGSEA)
83
```
79
```
84
80
85
81
86
# Workflow
82
# Workflow
87
83
88
A common workflow is exemplified in the package vignette and is
84
A common workflow is exemplified in the package vignette and is
89
typically separated in the following steps:
85
typically separated in the following steps:
90
86
91
1. Load required libraries, including the `multiGSEA` package, and
87
1. Load required libraries, including the `multiGSEA` package, and
92
   omics data sets.
88
   omics data sets.
93
2. Create data structure for enrichment analysis.
89
2. Create data structure for enrichment analysis.
94
3. Download and customize the pathway definitions.
90
3. Download and customize the pathway definitions.
95
4. Run the pathway enrichment for each omics layer.
91
4. Run the pathway enrichment for each omics layer.
96
5. Calculate the aggregated pathway enrichment.
92
5. Calculate the aggregated pathway enrichment.
97
93
98
94
99
For more information please have a look in the vignette at our
95
For more information please have a look in the vignette at our
100
[Bioconductor
96
[Bioconductor
101
page](https://bioconductor.org/packages/release/bioc/vignettes/multiGSEA/inst/doc/multiGSEA.html).
97
page](https://bioconductor.org/packages/release/bioc/vignettes/multiGSEA/inst/doc/multiGSEA.html).
102
98
103
99
104
# LICENSE
100
# LICENSE
105
101
106
Copyright (C) 2011 - 2020 Helmholtz Centre for Environmental Research
102
Copyright (C) 2011 - 2020 Helmholtz Centre for Environmental Research
107
UFZ.
103
UFZ.
108
104
109
This program is free software: you can redistribute it and/or modify
105
This program is free software: you can redistribute it and/or modify
110
it under the terms of the GNU General Public License as published by
106
it under the terms of the GNU General Public License as published by
111
the Free Software Foundation, either version 3 of the License, or (at
107
the Free Software Foundation, either version 3 of the License, or (at
112
your option) any later version.
108
your option) any later version.
113
109
114
This program is distributed in the hope that it will be useful, but
110
This program is distributed in the hope that it will be useful, but
115
WITHOUT ANY WARRANTY; without even the implied warranty of
111
WITHOUT ANY WARRANTY; without even the implied warranty of
116
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the UFZ
112
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the UFZ
117
License document for more details:
113
License document for more details:
118
<https://github.com/yigbt/multiGSEA/blob/master/LICENSE.md>
114
<https://github.com/yigbt/multiGSEA/blob/master/LICENSE.md>