Our analysis of startup activity in the intersection of AI and genomics relies on CrunchBase, a widely used database with information about just under 4.9 million technology organisations and startups and their investments. We have collected and stored all this data locally.
Each CrunchBase company is assigned a set of category labels which we have explored in order to identify relevant categories for us. While there is an "Artificial Intelligence" category in the data, the closest category to genomics that we are able to find is "Genetics". We have found 26,059 organisations in the Artificial Intelligence category, 1,546 organisations in the Genetics category, and 51 companies in their intersection.
We complement this with a text analysis of organisations descriptions also available from CrunchBase, which are available for just under 50% of the companies in the data. What we do here is to concatenate "short" and "long" CrunchBase descriptions and search, within them, for terms related to AI ("artificial intelligence", "machine learning", "deep learning", "data science') and to Genomics ("genomic","genome"). This reveals 21,195 organisations with AI-related terms in their descriptions, and 1,736 organisations with genomics-related terms. We find 113 organisations in the intersection.
We have provisionally tagged with the "AI" and "Genomics" category any companies that are either labelled with relevant categories or have relevant terms in their descriptions. This yields a combined list of 34,345 AI organisations, 2,768 genetics or genomics organisations and 199 organisations in the intersection between both categories. We present the names and excerpts from the descriptions of 5 randomly drawn companies from each group.
These examples illustrates some of the opportunities and challenges in the CrunchBase data: while the AI genomics companies that we have identified look generally relevant (with the exception of HelloGov, where the use of the term "Social genome" trips up our string-matching method), company descriptions are in several cases short and lack the detail which might be required for an in-depth quantitative analysis vs what might be possible with article and patent abstracts. We note that the number of observations in the data are small enough for manual review and classification in themes.
name | description_combined | category |
---|---|---|
Moodstocks | Moodstocks is a provider of an API and ready-to-use cross-platform SDK for developers to integrate scanning._Moodstocks bridges the physical and digital worlds thanks to smartphone image recognition. They provide an API and a ready-to-use cross-platform SDK so that any developers can integrate scanning in minutes. Their award winning platform is robust, real-time and scalable. Give it a try!... | ai_combined |
Kip | Kip is a smart assistant that helps you manage team supplies and coordinate food ordering for your team._Kip is a digital assistant that helps offices and organizations with collecting and coordinating their team shopping purchases. Teams add supplies and pantry essentials into a Kip standing order. Admins can send a last call that pings all members with final purchase requests and checkout an... | ai_combined |
Lexeme | Textual knowledge discovery and reading comprehension system for scientific and technical content_Lexeme, formerly known as SignalMaven, is a research driven technology company based in San Francisco. Founded in 2017, we develop software products to help scientists, engineers and STEM researchers comprehend complicated technical topics. Natural Language Understanding is in our DNA and we are heav... | ai_combined |
Linguistic Agents | Linguistic Agents is a privately held AI and natural language company based in Jerusalem, Israel. Linguistic Agents_Linguistic Agents is a privately held company based in Jerusalem, Israel. It is specializing in the field of artificial intelligence and natural language. ... | ai_combined |
FRINZA | World's 1st AI/ML oriented online gifting marketplace, based on an annual-recurring-subscription revenue model_nan... | ai_combined |
ZaBeCor Pharmaceuticals | ZaBeCor Pharmaceuticals, LLC, a clinical stage biotechnology company, focuses on the development of siRNA, antisense, and other molecular_ZaBeCor Pharmaceuticals, LLC, a clinical stage biotechnology company, focuses on the development of siRNA, antisense, and other molecular technologies for the treatment of asthma, allergy, and other inflammatory and infectious diseases. Its products include Exce... | gen_combined |
GenieUs | GenieUs is a bio-research company that uses the human genome to help neurodegenerative diseases._nan... | gen_combined |
Genex | Genex develops industry-leading animal genetics and customized reproductive solutions to the dairy farming industry._nan... | gen_combined |
KinDex Pharmaceuticals | KinDex Pharmaceuticals is a developer of novel therapeutics used to help patients suffering from metabolic disorders._KinDex Pharmaceuticals is a developer of novel therapeutics used to help patients suffering from metabolic disorders. It was established to focus on the discovery and development of molecules that improve women’s health and impact chronic disease. KinDex Pharmaceuticals is developi... | gen_combined |
Pencil Biosciences | Pencil Biosciences is a novel gene modulation company._Pencil Biosciences aims to develop a novel, programmable genome-modulation tool that overcomes the challenges associated with the technologies currently available.... | gen_combined |
Protera | Protera is advancing the science of protein engineering to address some of the most critical problems and opportunities._Protera is advancing the science of protein engineering to address some of the most critical problems and opportunities in food, agriculture, the environment and human health. We streamline protein design and development with MADI, our proprietary deep learning algorithm, to pre... | ai_gen_combined |
HelloGov | HelloGov is a full-stack AI/ML company, providing computer vision, NLP, and high quality training data for the public sector._HelloGov® is an SBA Certified Woman-Owned Small Business (WOSB) and customer-centric deep-tech company born out of the MIT Social Genome Project. Our mission is to build the nation's leading federal Artificial Intelligence (AI)/Machine Learning (ML) company while democratiz... | ai_gen_combined |
Hive | Hive Biosystems helps pharmaceutical companies streamline genetics-driven drug discovery by rapidly generating transgenic animals._Gene sequencing and CRISPR/Cas9 technologies have enabled affordable genome mapping and editing with unprecedented precision; however, the bottleneck in understanding human disease mechanisms and bringing breakthrough therapies to patients faster now lies in the manual... | ai_gen_combined |
GenomeKey | GenomeKey is an award-winning, early-phase company that is exploiting innovative genomic technology._GenomeKey is an award-winning, early-phase company that is exploiting innovative genomic technology and cutting-edge AI machine learning to address critical healthcare challenges.... | ai_gen_combined |
Janus Genomics | Janus Genomics is a company that builds AI tools for sharing of biomedical data while preserving data privacy._Janus Genomics is developing a platform that allows training of algorithms on a combination of different biomedical datasets. The technology enables companies to monetize their data and keep it private. Janus Genomics has received support and was accelerated by Entrepreneurs First, Max Pl... | ai_gen_combined |
:Examples of companies in different categories {#tbl:cb_examples}