|
a |
|
b/R/data.R |
|
|
1 |
#' CRISPR data |
|
|
2 |
#' |
|
|
3 |
#' Example training dataset consisting of a sequence of nucleotides of CRISPR loci |
|
|
4 |
#' Filtered for unambiguous characters and contains only characters in the vocabulary \{A,G,G,T |
|
|
5 |
#' \}. |
|
|
6 |
#' Can be loaded to workspace via `data(crispr_sample)`. |
|
|
7 |
#' @format Large character of 442.41 kB |
|
|
8 |
#' @usage data(crispr_sample) |
|
|
9 |
#' @references \url{https://github.com/philippmuench} |
|
|
10 |
"crispr_sample" |
|
|
11 |
|
|
|
12 |
#' Parenthesis data |
|
|
13 |
#' |
|
|
14 |
#' Training dataset of synthetic parenthesis language. |
|
|
15 |
#' Can be loaded to workspace via `data(parenthesis)`. |
|
|
16 |
#' @format Large character of 1.00 MB |
|
|
17 |
#' @usage data(parenthesis) |
|
|
18 |
#' @references \url{https://github.com/philippmuench} |
|
|
19 |
"parenthesis" |
|
|
20 |
|
|
|
21 |
#' Ecoli subset |
|
|
22 |
#' |
|
|
23 |
#' Subset of the E. coli genome for evaluation. |
|
|
24 |
#' Can be loaded to workspace via `data(ecoli_small)`. |
|
|
25 |
#' @format character 326.73 kB |
|
|
26 |
#' @usage data(ecoli_small) |
|
|
27 |
#' @references \url{https://www.science.org/doi/10.1126/science.277.5331.1453} |
|
|
28 |
"ecoli_small" |
|
|
29 |
|