--- a
+++ b/data/README_data.txt
@@ -0,0 +1,28 @@
+
+s1_mapped1.txt: Domain 1 of Simulation 1, branching tree (originally from Liu et al 2019, MMD-MA paper)
+s1_mapped2.txt: Domain 2 of Simulation 1, branching tree (originally from Liu et al 2019, MMD-MA paper)
+s1_label1.txt: Group labels for Domain 1 of Simulation 1
+s1_label2.txt: Group labels for Domain 2 of Simulation 1
+
+s2_mapped1.txt: Domain 1 of Simulation 2, Swiss roll (originally from Liu et al 2019, MMD-MA paper)
+s2_mapped2.txt: Domain 2 of Simulation 2, Swiss roll (originally from Liu et al 2019, MMD-MA paper)
+s2_label1.txt: Group labels for Domain 1 of Simulation 2
+s2_label2.txt: Group labels for Domain 2 of Simulation 2
+
+s3_mapped1.txt: Domain 1 of Simulation 3, circular frustum (originally from Liu et al 2019, MMD-MA paper)
+s3_mapped2.txt: Domain 2 of Simulation 3, circular frustum (originally from Liu et al 2019, MMD-MA paper)
+s3_label1.txt: Group labels for Domain 1 of Simulation 3
+s3_label2.txt: Group labels for Domain 2 of Simulation 3
+
+scGEM_expression.csv:	Gene expression (scRNA-seq) domain of the scGEM dataset
+scGEM_typeExpression.txt:	Cell-type labels for the gene expression (scRNA-seq) domain of the scGEM dataset
+scGEM_methylation.csv:	DNA methylation (scMethyl-seq) domain of the scGEM dataset
+scGEM_typeMethylation.txt:	Cell-type labels for the DNA methylation (scMethyl-seq) domain of the scGEM dataset
+Label to cell-type mapping:	1: BJ, 2:d8, 3: d16T+, 4: d24T+, 5:iPSc
+
+
+scatac_feat.npy:  Chromatin accessibility (snATAC-seq) domain of the SNARE-seq dataset (in numpy array format)
+SNAREseq_atac_types.txt:  Cell-type labels for the gene expression domain of the SNARE-seq dataset
+scrna_feat.npy:	  Gene expression (scRNA-seq) domain of the SNARE-seq dataset (in numpy array format)
+SNAREseq_rna_types.txt:	  Cell-type labels for the chromatin accessibility domain of the SNARE-seq dataset (same as the gene expression domain because gene expression domain proved more reliable for cell-type annotation)
+SNAREseq_metadata.txt: 	Some metadata on the SNARE-seq datasets, such as the label number -- cell-type mapping and the barcode names for cells (in order)