--- a
+++ b/report_pfam/description.txt
@@ -0,0 +1,14 @@
+The aligned protein sequences and their families from Pfam:
+ftp://ftp.ebi.ac.uk/pub/databases/Pfam/releases/Pfam32.0/Pfam-A.seed.gz
+
+Stockholm format is a multiple sequence alignment format used by Pfam.
+See: https://en.wikipedia.org/wiki/Stockholm_format
+
+Attribute description:
+- sequence: amino acid sequence (corresponding to a domain)
+- sequencename: UniProtID/start-end position of the domain
+- family_id: short name of the protein family
+- family_accession: PFam family identifier (PFxxxxx.y)
+- aligned_sequence: single sequence from the multiple sequence alignment
+                    (with the rest of the members of the family in seed),
+                    with gaps retained.