|
a |
|
b/README.md |
|
|
1 |
# Protocols-4pub |
|
|
2 |
|
|
|
3 |
## *Reusable code collection* |
|
|
4 |
|
|
|
5 |
##### 26 April, 2021 |
|
|
6 |
|
|
|
7 |
Multi-omics analysis protocols concluded by SajuukLyu <lvyulin@pku.edu.cn>. |
|
|
8 |
|
|
|
9 |
--- |
|
|
10 |
|
|
|
11 |
*声明:该项目仅供学习用途,请共同遵守[开源协议](https://github.com/sajuukLyu/Protocols-4pub/blob/main/LICENSE),禁止用于商业盈利。未经作者同意请勿转载至其它媒体。* |
|
|
12 |
|
|
|
13 |
--- |
|
|
14 |
|
|
|
15 |
Integrated step-by-step vignettes present on the [wiki](https://github.com/sajuukLyu/Protocols-4pub/wiki). |
|
|
16 |
|
|
|
17 |
These files including three main parts: |
|
|
18 |
|
|
|
19 |
- **Preprocess pipeline** (*etc. bulkRNA, bulkATAC, ChIP, WGBS ...*) |
|
|
20 |
|
|
|
21 |
These R scripts are designed to generate a directory tree containing ordered directories for intermediate data and Linux bash scripts for each step. |
|
|
22 |
In convenient to submit jobs to different nodes of a computer cluster at same time, commands for multiple samples can be divided to several batches and than submit together. |
|
|
23 |
|
|
|
24 |
- **Downstream analysis script** (*etc. DEG, GO, GSEA ...*) |
|
|
25 |
|
|
|
26 |
These scripts are the framework of different analyses designed to cope with the most simple scene of biology analysis. |
|
|
27 |
|
|
|
28 |
- **Visualization script** (*etc. heatmap, PCA plot, Venn plot, track plot ...*) |
|
|
29 |
|
|
|
30 |
These scripts can generate some plots suitable for articles with very few post processing. |
|
|
31 |
|
|
|
32 |
### File description |
|
|
33 |
|
|
|
34 |
- Preprocess pipeline |
|
|
35 |
|
|
|
36 |
| file name | description | |
|
|
37 |
| ------------------------------------------------------------ | ------------------------------------------------------------ | |
|
|
38 |
| [bulkATACpre.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/a_PreprocessPipeline/bulkATACpre.R) | Preprocess pipelines for bulk ATAC-seq [*(vignettes)*](https://sajuuklyu.github.io/Protocols-4pub/exampleData/ATAC/bulkATACpre.html) | |
|
|
39 |
| [bulkRNApre.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/a_PreprocessPipeline/bulkRNApre.R) | Preprocess pipelines for bulk RNA-seq [*(vignettes)*](https://sajuuklyu.github.io/Protocols-4pub/exampleData/RNA/bulkRNApre.html) | |
|
|
40 |
| [ChIPpre.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/a_PreprocessPipeline/ChIPpre.R) | Preprocess pipelines for ChIP-seq | |
|
|
41 |
| [WGBSpre.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/a_PreprocessPipeline/WGBSpre.R) | Preprocess pipelines for Whole Genome Bisulfite Sequencing | |
|
|
42 |
|
|
|
43 |
- Downstream analysis script |
|
|
44 |
|
|
|
45 |
| file name | omics | description | |
|
|
46 |
| ------------------------------------------------------------ | ----- | ----------------------------------------------------- | |
|
|
47 |
| [downloadData.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/downloadData.R) | any | Download data from public databases | |
|
|
48 |
| [bulkRNAana_1_loadCount.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkRNAana_1_loadCount.R) | RNA | Load multiple results and convert to count matrix | |
|
|
49 |
| [bulkRNAana_2a_DESeq.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkRNAana_2a_DESeq.R) | RNA | Perform DESeq analysis (for sample without replicate) | |
|
|
50 |
| [bulkRNAana_2b_DESeq2.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkRNAana_2b_DEseq2.R) | RNA | Perform DESeq2 analysis (for sample with replicates) | |
|
|
51 |
| [bulkRNAana_3_GO.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkRNAana_3_GO.R) | RNA | Perform GO analysis | |
|
|
52 |
| [bulkATACana_1_QC.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkATACana_1_QC.R) | ATAC | Quality control | |
|
|
53 |
| [bulkATACana_2_loadCount.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkATACana_2_loadCount.R) | ATAC | Load data and convert to peak matrix | |
|
|
54 |
| [bulkATACana_3_annotatePeak.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkATACana_3_annotatePeak.R) | ATAC | Annotate peaks to nearest genes | |
|
|
55 |
| [bulkATACana_4_GO.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkATACana_4_GO.R) | ATAC | Perform GO analysis | |
|
|
56 |
| [bulkATACana_5_peakTF.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkATACana_5_peakTF.R) | ATAC | Perform gene regulate network analysis | |
|
|
57 |
|
|
|
58 |
- Visualization script |
|
|
59 |
|
|
|
60 |
| file name | omics | description | |
|
|
61 |
| ------------------------------------------------------------ | ----- | ---------------------------------------------- | |
|
|
62 |
| [Visulz_bulkRNA_PCA.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkRNA_PCA.R) | RNA | PCA plot for samples | |
|
|
63 |
| [Visulz_bulkRNA_MAplot.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkRNA_MAplot.R) | RNA | MA plot for DEGs between group of samples | |
|
|
64 |
| [Visulz_bulkRNA_volcano.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkRNA_volcano.R) | RNA | Volcano plot for DEGs between group of samples | |
|
|
65 |
| [Visulz_bulkRNA_heatmap.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkRNA_heatmap.R) | RNA | Heatmap of given genes for samples | |
|
|
66 |
| [Visulz_bulkRNA_GO.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkRNA_GO.R) | RNA | GO plot for DEGs | |
|
|
67 |
| [Visulz_bulkATAC_trackPlot.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkATAC_trackPlot.R) | ATAC | Track plot for samples | |
|
|
68 |
| [Visulz_bulkATAC_PCA.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkATAC_PCA.R) | ATAC | PCA plot for samples | |
|
|
69 |
| [Visulz_bulkATAC_heatmapPeak.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkATAC_heatmapPeak.R) | ATAC | Heatmap of given peaks for samples | |
|
|
70 |
| [Visulz_bulkATAC_heatmapTrack.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkATAC_heatmapTrack.R) | ATAC | Heatmap of given peak tracks for samples | |
|
|
71 |
| [Visulz_bulkATAC_peakAnnoDisp.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkATAC_peakAnnoDisp.R) | ATAC | Histogram for peak annotation distribution | |
|
|
72 |
| [Visulz_bulkATAC_motifEnrich.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkATAC_motifEnrich.R) | ATAC | Scatter plot for enriched TFs for peak sets | |
|
|
73 |
| [Visulz_bulkATAC_network.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkATAC_network.R) | ATAC | Network plot for peak sets | |
|
|
74 |
|
|
|
75 |
### Pipeline |
|
|
76 |
|
|
|
77 |
- bulk RNA-seq |
|
|
78 |
|
|
|
79 |
<img src="mermaidPlot\bulkRNApre.svg" align=center> |
|
|
80 |
|
|
|
81 |
- bulk ATAC-seq |
|
|
82 |
|
|
|
83 |
<img src="mermaidPlot\bulkATACpre.svg" align=center> |
|
|
84 |
|
|
|
85 |
- ChIP-seq |
|
|
86 |
|
|
|
87 |
<img src="mermaidPlot\ChIPpre.svg" align=center> |
|
|
88 |
|
|
|
89 |
- WGBS |
|
|
90 |
|
|
|
91 |
<img src="mermaidPlot\WGBSpre.svg" align=center> |
|
|
92 |
|
|
|
93 |
|