▶
AI-powered Semantic Integration of Public Research Data
9 million biomedical samples already exist. Most researchers still cannot meaningfully use them.
AI is shifting public research data from static archives into searchable discovery engines.
Arjun Krishnan (Associate Professor, Biomedical Informatics at University of Colorado Anschutz) presents: From Data Archives to Discovery Engines: AI-powered Semantic Integration of Massive Public Research Data.
Timestamps:
00:00 The problem with massive public biomedical data
02:10 Why biomedical metadata is broken
05:47 Using LLMs for semantic annotation and dataset discovery
07:34 Cross-species transfer learning and gene networks
11:30 AI discovery through worms, zebrafish, and disease mapping
14:20 From static archives to AI-powered discovery engines
Public biomedical datasets contain decades of untapped biological signals, but fragmented metadata, inconsistent terminology, and incomplete annotations make large-scale discovery extremely difficult.
The core argument is that AI should not only generate new content. It should unlock the scientific value already buried inside existing datasets.
Arjun Krishnan explains how large language models, graph neural networks, and semantic integration pipelines can transform unstructured biomedical metadata into searchable, biologically meaningful knowledge systems. The objective is not better keyword search, but contextual scientific discovery across genomics, transcriptomics, proteomics, and clinical data.
The talk also explores cross-species biological transfer learning, where AI models map relationships between humans and model organisms such as mice, worms, flies, and zebrafish to uncover non-obvious disease mechanisms and therapeutic insights.
A recurring theme throughout the presentation: the bottleneck in biomedical AI is increasingly not data generation, but data accessibility, interoperability, and semantic understanding.
📌 Applied Healthcare AI Summit 2026 — what actually works in real-world healthcare AI, from pilots to production systems.
#BiomedicalAI #HealthcareAI #LLM #Bioinformatics #MachineLearning #Genomics #AIResearch
