[f8624c]: / ai_genomics / pipeline / samples / README.md

Download this file

4 lines (2 with data), 270 Bytes

Making samples from the AI Genomics datasets

Run python ai_genomics/pipeline/samples/make_samples.py to generate a csv for each of the four AI Genomics datasets: patents, OpenAlex, GtR and Crunchbase. The csvs will be saved to S3 at inputs/ai_genomics_samples/.