Diff of /README.md [000000] .. [d06c2b]

Switch to unified view

a b/README.md
1
# Protocols-4pub
2
3
## *Reusable code collection*
4
5
##### 26 April, 2021
6
7
Multi-omics analysis protocols concluded by SajuukLyu <lvyulin@pku.edu.cn>.
8
9
---
10
11
*声明:该项目仅供学习用途,请共同遵守[开源协议](https://github.com/sajuukLyu/Protocols-4pub/blob/main/LICENSE),禁止用于商业盈利。未经作者同意请勿转载至其它媒体。*
12
13
---
14
15
Integrated step-by-step vignettes present on the [wiki](https://github.com/sajuukLyu/Protocols-4pub/wiki).
16
17
These files including three main parts:
18
19
- **Preprocess pipeline** (*etc. bulkRNA, bulkATAC, ChIP, WGBS ...*)
20
21
    These R scripts are designed to generate a directory tree containing ordered directories for intermediate data and Linux bash scripts for each step.
22
    In convenient to submit jobs to different nodes of a computer cluster at same time, commands for multiple samples can be divided to several batches and than submit together.
23
24
- **Downstream analysis script** (*etc. DEG, GO, GSEA ...*)
25
26
    These scripts are the framework of different analyses designed to cope with the most simple scene of biology analysis.
27
28
- **Visualization script** (*etc. heatmap, PCA plot, Venn plot, track plot ...*)
29
30
    These scripts can generate some plots suitable for articles with very few post processing.
31
32
### File description
33
34
- Preprocess pipeline
35
36
| file name                                                    | description                                                  |
37
| ------------------------------------------------------------ | ------------------------------------------------------------ |
38
| [bulkATACpre.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/a_PreprocessPipeline/bulkATACpre.R) | Preprocess pipelines for bulk ATAC-seq [*(vignettes)*](https://sajuuklyu.github.io/Protocols-4pub/exampleData/ATAC/bulkATACpre.html) |
39
| [bulkRNApre.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/a_PreprocessPipeline/bulkRNApre.R) | Preprocess pipelines for bulk RNA-seq [*(vignettes)*](https://sajuuklyu.github.io/Protocols-4pub/exampleData/RNA/bulkRNApre.html) |
40
| [ChIPpre.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/a_PreprocessPipeline/ChIPpre.R) | Preprocess pipelines for ChIP-seq                            |
41
| [WGBSpre.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/a_PreprocessPipeline/WGBSpre.R) | Preprocess pipelines for Whole Genome Bisulfite Sequencing   |
42
43
- Downstream analysis script
44
45
| file name                                                    | omics | description                                           |
46
| ------------------------------------------------------------ | ----- | ----------------------------------------------------- |
47
| [downloadData.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/downloadData.R) | any   | Download data from public databases                   |
48
| [bulkRNAana_1_loadCount.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkRNAana_1_loadCount.R) | RNA   | Load multiple results and convert to count matrix     |
49
| [bulkRNAana_2a_DESeq.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkRNAana_2a_DESeq.R) | RNA   | Perform DESeq analysis (for sample without replicate) |
50
| [bulkRNAana_2b_DESeq2.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkRNAana_2b_DEseq2.R) | RNA   | Perform DESeq2 analysis (for sample with replicates)  |
51
| [bulkRNAana_3_GO.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkRNAana_3_GO.R) | RNA   | Perform GO analysis                                   |
52
| [bulkATACana_1_QC.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkATACana_1_QC.R) | ATAC  | Quality control                                       |
53
| [bulkATACana_2_loadCount.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkATACana_2_loadCount.R) | ATAC  | Load data and convert to peak matrix                  |
54
| [bulkATACana_3_annotatePeak.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkATACana_3_annotatePeak.R) | ATAC  | Annotate peaks to nearest genes                       |
55
| [bulkATACana_4_GO.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkATACana_4_GO.R) | ATAC  | Perform GO analysis                                   |
56
| [bulkATACana_5_peakTF.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/b_DownstreamAnalysisScript/bulkATACana_5_peakTF.R) | ATAC  | Perform gene regulate network analysis                |
57
58
- Visualization script
59
60
| file name                                                    | omics | description                                    |
61
| ------------------------------------------------------------ | ----- | ---------------------------------------------- |
62
| [Visulz_bulkRNA_PCA.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkRNA_PCA.R) | RNA   | PCA plot for samples                           |
63
| [Visulz_bulkRNA_MAplot.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkRNA_MAplot.R) | RNA   | MA plot for DEGs between group of samples      |
64
| [Visulz_bulkRNA_volcano.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkRNA_volcano.R) | RNA   | Volcano plot for DEGs between group of samples |
65
| [Visulz_bulkRNA_heatmap.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkRNA_heatmap.R) | RNA   | Heatmap of given genes for samples             |
66
| [Visulz_bulkRNA_GO.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkRNA_GO.R) | RNA   | GO plot for DEGs                               |
67
| [Visulz_bulkATAC_trackPlot.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkATAC_trackPlot.R) | ATAC  | Track plot for samples                         |
68
| [Visulz_bulkATAC_PCA.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkATAC_PCA.R) | ATAC  | PCA plot for samples                           |
69
| [Visulz_bulkATAC_heatmapPeak.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkATAC_heatmapPeak.R) | ATAC  | Heatmap of given peaks for samples             |
70
| [Visulz_bulkATAC_heatmapTrack.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkATAC_heatmapTrack.R) | ATAC  | Heatmap of given peak tracks for samples       |
71
| [Visulz_bulkATAC_peakAnnoDisp.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkATAC_peakAnnoDisp.R) | ATAC  | Histogram for peak annotation distribution     |
72
| [Visulz_bulkATAC_motifEnrich.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkATAC_motifEnrich.R) | ATAC  | Scatter plot for enriched TFs for peak sets    |
73
| [Visulz_bulkATAC_network.R](https://github.com/sajuukLyu/Protocols-4pub/blob/main/c_VisualizationScript/Visulz_bulkATAC_network.R) | ATAC  | Network plot for peak sets                     |
74
75
### Pipeline
76
77
- bulk RNA-seq
78
79
<img src="mermaidPlot\bulkRNApre.svg" align=center>
80
81
- bulk ATAC-seq
82
83
<img src="mermaidPlot\bulkATACpre.svg" align=center>
84
85
- ChIP-seq
86
87
<img src="mermaidPlot\ChIPpre.svg" align=center>
88
89
- WGBS
90
91
<img src="mermaidPlot\WGBSpre.svg" align=center>
92
93