|
a |
|
b/docs/alphafold_mindmap.md |
|
|
1 |
# AlphaFold Implementation Mindmap |
|
|
2 |
|
|
|
3 |
## Implementation Status Overview |
|
|
4 |
- [x] Basic Setup Complete (Repository & Structure) |
|
|
5 |
- [x] Data Pipeline Ready (Data Available) |
|
|
6 |
- [ ] Model Running |
|
|
7 |
- [ ] Results Validation |
|
|
8 |
|
|
|
9 |
## Data Pipeline |
|
|
10 |
- Input Processing |
|
|
11 |
- [x] FASTA sequence parsing (code available) |
|
|
12 |
- [x] MSA generation (code available) |
|
|
13 |
- [x] Template search (code available) |
|
|
14 |
- [x] Feature generation (code available) |
|
|
15 |
|
|
|
16 |
## Model Architecture |
|
|
17 |
- Evoformer |
|
|
18 |
- [x] MSA stack (code available) |
|
|
19 |
- [x] Pair stack (code available) |
|
|
20 |
- [x] Attention mechanisms (code available) |
|
|
21 |
- Structure Module |
|
|
22 |
- [x] 3D coordinate prediction (code available) |
|
|
23 |
- [x] Confidence scoring (pLDDT) (code available) |
|
|
24 |
- [x] Recycling (code available) |
|
|
25 |
|
|
|
26 |
## Dependencies |
|
|
27 |
- [ ] JAX |
|
|
28 |
- [ ] HH-suite |
|
|
29 |
- [ ] OpenMM |
|
|
30 |
- [ ] Kalign |
|
|
31 |
|
|
|
32 |
## Execution Flow |
|
|
33 |
1. [ ] Sequence Input |
|
|
34 |
2. [ ] MSA Generation |
|
|
35 |
- [ ] Sequence search against databases |
|
|
36 |
- [ ] Multiple sequence alignment |
|
|
37 |
3. [ ] Template Search |
|
|
38 |
4. [ ] Feature Processing |
|
|
39 |
5. [ ] Model Prediction |
|
|
40 |
6. [ ] Structure Refinement |
|
|
41 |
7. [ ] Output Generation |
|
|
42 |
|
|
|
43 |
## Data Requirements |
|
|
44 |
- Location: /net/scratch2/caom/alphafold_data (✓ FOUND) |
|
|
45 |
- [x] BFD (small_bfd available) |
|
|
46 |
- [x] MGnify |
|
|
47 |
- [x] PDB70 |
|
|
48 |
- [x] UniRef90 |
|
|
49 |
- [x] UniProt |
|
|
50 |
- [x] PDB structure files (pdb_mmcif) |
|
|
51 |
- [x] Model parameters (params) |
|
|
52 |
- [x] Additional: uniref30, pdb_seqres |
|
|
53 |
|
|
|
54 |
## Output Processing |
|
|
55 |
- [ ] PDB file generation |
|
|
56 |
- [ ] Confidence score visualization |
|
|
57 |
- [ ] Structure visualization |
|
|
58 |
- [ ] PyMOL integration |
|
|
59 |
- [ ] ChimeraX support |