Download this file
Run python ai_genomics/pipeline/samples/make_samples.py to generate a csv for each of the four AI Genomics datasets: patents, OpenAlex, GtR and Crunchbase. The csvs will be saved to S3 at inputs/ai_genomics_samples/.
python ai_genomics/pipeline/samples/make_samples.py
inputs/ai_genomics_samples/