🔬 Training Transformers to solve 95% failure rate of Cancer Trials — Ron Alfa & Daniel Bear, Noetik
Get the full intelligence
Search transcripts, export clips, track mentions, and explore all topics from “🔬 Training Transformers to solve 95% failure rate of Cancer Trials — Ron Alfa & Daniel Bear, Noetik” inside PodZeus.
Ron Alfa and Dan Baer of Noetic join the Latent Space podcast to discuss their mission of using AI to solve the 95% failure rate of cancer drug trials by fundamentally rethinking patient selection. They argue that most drug failures stem not from poor pharmacology, but from flawed patient cohort selection—driven by outdated models based on immortalized cell lines and animal studies that poorly reflect human biology. Noetic’s solution is to build a human-centric, multimodal foundation model trained on massive, high-quality data generated in-house, including spatial transcriptomics, H&E pathology, protein stains, and DNA data. Their approach uses self-supervised learning on over 100 million spatially resolved cells to discover biologically meaningful patient subtypes invisible to traditional biomarkers. The company has developed novel transformer architectures like Tario and OctoVirtualCell to simulate drug responses in silico, enabling virtual clinical trials and repurposing of existing drugs. A landmark $50M deal with GSK validates their model licensing strategy, marking a shift in pharma toward broad access to foundational AI models. The episode underscores the necessity of large-scale, intentional data generation—comparable to PDB or ImageNet—as the bedrock for transformative progress in AI for biology.
95% of cancer drugs fail in the clinic due to poor patient selection, not flawed drug design.
Noetic generates its own multimodal data (H&E, spatial transcriptomics, protein stains) at scale to train human-centric AI models.
Their foundation models discover biologically meaningful patient subtypes invisible to traditional biomarkers.
Virtual cell simulations allow in silico testing of drug responses without wet lab experiments.
A $50M deal with GSK marks the first major foundational model licensing deal in biopharma.
…and 1 more takeaway available in PodZeus
The 95% Failure Rate of Cancer Trials
“95% of cancer drugs fail in the clinic. Why do they fail? Not because we're bad at pharmacology... we're bad at selecting which patients those drugs are in.”
Building a Human-Centric Data Foundation
Noetic's core strategy is generating its own high-quality, multimodal data in-house. This includes sourcing human tumor samples, building custom processing pipelines, and generating paired data across H&E, spatial transcriptomics, and protein stains at scale.
The Power of Multimodal Data and Self-Supervised Learning
“We want the model to learn... how many different therapeutically relevant subtypes of lung cancers are just from self-supervised learning from the data.”
Virtual Cell Simulations and In Silico Trials
“You can simulate this sort of counterfactual perturbation idea without even having to collect the data to do that.”
Validating Models with In Vivo Perturbations
Noetic uses a mouse platform called PerturbMap to validate their models. By barcoding hundreds of genetically perturbed tumors in mice, they can test predictions about immune response and drug efficacy in a living system.
“95% of cancer drugs fail in the clinic. Why do they fail? Not because we're bad at pharmacology... we're bad at selecting which patients those drugs are in.”
“It was the first announced foundational licensing deal in the space. The substrate of the deal is not a molecule. It's a model.”
“We want the model to learn... how many different therapeutically relevant subtypes of lung cancers are just from self-supervised learning from the data.”
Hosts
Guests
Noetic
organization
Ron Alfa
person
H&E
other
Dan Baer
person
Spatial Transcriptomics
other
GSK
organization
PDB
other
OctoVirtualCell
other
Recursion
organization
Tario
other
Moonlake: Causal World Models should be Multimodal, Interactive, and Efficient — with Chris Manning and Fan-yun Sun
Latent Space: The AI Engineer Podcast • 1h 6m • 4/2/2026
Marc Andreessen introspects on The Death of the Browser, Pi + OpenClaw, and Why "This Time Is Different"
Latent Space: The AI Engineer Podcast • 1h 16m • 4/3/2026
Extreme Harness Engineering for Token Billionaires: 1M LOC, 1B toks/day, 0% human code, 0% human review — Ryan Lopopolo, OpenAI Frontier & Symphony
Latent Space: The AI Engineer Podcast • 1h 12m • 4/7/2026
Notion’s Token Town: 5 Rebuilds, 100+ Tools, MCP vs CLIs and the Software Factory Future — Simon Last & Sarah Sachs of Notion
Latent Space: The AI Engineer Podcast • 1h 17m • 4/15/2026
Shopify’s AI Phase Transition: 2026 Usage Explosion, Unlimited Opus-4.6 Token Budget, Tangle, Tangent, SimGym — with Mikhail Parakhin, Shopify CTO
Latent Space: The AI Engineer Podcast • 1h 12m • 4/22/2026
Get the full intelligence
Search transcripts, export clips, track mentions, and explore all topics from “🔬 Training Transformers to solve 95% failure rate of Cancer Trials — Ron Alfa & Daniel Bear, Noetik” inside PodZeus.
Start discovering podcast insights today
Start with a 7-day trial and explore a growing catalog of popular podcasts. No credit card required.
No credit card required • 7-day trial • Cancel anytime
