a b/docs/source/notes/process_data.rst
1
Processing Data
2
======================================================================================================
3
4
We have downloaded most of the small dataset in the repository. 
5
6
Drug-Target Binding Benchmark Dataset
7
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
8
9
10
We list public **Drug-Target Binding Benchmark Dataset** 
11
that is supported by DeepPurpose and 
12
corresponding downloading and processing function. 
13
14
===============   ===============================================================================================================
15
   Dataset                                        downloading and processing Function
16
===============   ===============================================================================================================
17
   BindingDB                         download_BindingDB() to download the data and process_BindingDB() to process the data
18
   DAVIS                             load_process_DAVIS() to download and process the data
19
   KIBA                               load_process_KIBA() to download and process the data
20
===============   ===============================================================================================================
21
22
* **Download Link**
23
24
    * `BindingDB <https://www.bindingdb.org/bind/index.jsp>`_ 
25
    * `DAVIS <http://staff.cs.utu.fi/~aatapa/data/DrugTarget/>`_
26
    * `KIBA <https://jcheminf.biomedcentral.com/articles/10.1186/s13321-017-0209-z>`_
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
Repurposing Dataset
47
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
48
49
We list public **Repurposing Dataset** 
50
that is supported by DeepPurpose and 
51
corresponding downloading and processing function. 
52
53
54
=======================================   =====================================================================
55
           Dataset                                           downloading and processing Function   
56
=======================================   =====================================================================
57
     Curated Antiviral Drugs Library              load_antiviral_drugs() to load and process the data  
58
       Broad Repurposing Hub                   load_broad_repurposing_hub() downloads and process the data  
59
=======================================   =====================================================================
60
61
* **Download Link**
62
63
    * `Curated Antiviral Drugs Library <https://en.wikipedia.org/wiki/List_of_antiviral_drugs>`_ 
64
    * `Broad Repurposing Hub <https://www.broadinstitute.org/drug-repurposing-hub>`_
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
Bioassay Data for COVID-19
88
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
89
90
=========================   =====================================================================
91
           Dataset                       downloading and processing Function   
92
=========================   =====================================================================
93
           AID1706                load_AID1706_SARS_CoV_3CL() to load and process 
94
=========================   =====================================================================
95
96
97
98
* **Download Link**
99
100
    * `AID1706 <https://pubchem.ncbi.nlm.nih.gov/bioassay/1706>`_ 
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
COVID-19 Targets
121
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
122
123
===============================   =====================================================================
124
           Dataset                       downloading and processing Function   
125
===============================   =====================================================================
126
  SARS-CoV 3CL Protease               load_SARS_CoV_Protease_3CL()
127
  SARS-CoV2 3CL Protease              load_SARS_CoV2_Protease_3CL()
128
  SARS_CoV2 RNA Polymerase            load_SARS_CoV2_RNA_polymerase()
129
  SARS-CoV2 Helicase                  load_SARS_CoV2_Helicase()
130
  SARS-CoV2 3to5_exonuclease          load_SARS_CoV2_3to5_exonuclease()
131
  SARS-CoV2 endoRNAse                  load_SARS_CoV2_endoRNAse()
132
===============================   =====================================================================
133
134
135
136
137
138
139
140
141
142
143
.. toctree::
144
   :glob:
145
   :maxdepth: 1
146
   :caption: Technical Details of Function
147
148
   data/read_file_training_dataset_bioassay  
149
   data/read_file_training_dataset_drug_target_pairs
150
   data/read_file_virtual_screening_drug_target_pairs
151
   data/read_file_repurposing_library
152
   data/read_file_target_sequence
153
   data/download_BindingDB
154
   data/process_BindingDB
155
   data/load_process_DAVIS
156
   data/load_process_KIBA
157
   data/load_AID1706_txt_file 
158
   data/load_AID1706_SARS_CoV_3CL
159
   data/load_antiviral_drugs
160
   data/load_broad_repurposing_hub
161
162
163
164
165
166
167
168
169
170
171