Next-Token Predictor Is An AI's Job, Not Its Species
Get the full intelligence
Search transcripts, export clips, track mentions, and explore all topics from “Next-Token Predictor Is An AI's Job, Not Its Species” inside PodZeus.
Scott Alexander explores the central misconception that large language models (LLMs) are merely 'next token predictors,' arguing that this view confuses different levels of explanation. Drawing a parallel between human cognition and AI development, he illustrates how both are shaped by high-level optimization processes—evolution for humans, corporate profit motives for AI—while their inner workings involve complex, abstract representations like predictive coding in the brain and helical manifolds in AI. He emphasizes that just as humans don't consciously think about survival or reproduction when solving math problems, AIs don't 'think' in terms of next-token prediction when generating responses. Instead, both use sophisticated, often unintuitive internal mechanisms to model the world. The episode uses the example of Claude’s line-breaking behavior, which relies on 6D helical manifolds, to show how AI systems develop strange but effective computational structures. Ultimately, Alexander contends that labeling AI as a 'stochastic parrot' is a category error, akin to saying humans are just survival machines because evolution optimized for reproduction. The real intelligence lies in the high-level thought processes, not the low-level training mechanics.
Next token prediction is a training mechanism, not the internal experience of an AI—just as survival isn't the conscious thought behind human decisions.
Both humans and AIs develop complex, abstract world models (like predictive coding or helical manifolds) that operate far beyond their foundational optimization processes.
The inner workings of AI—such as 6D helical manifolds for line-breaking—are not literal 'next token' calculations but sophisticated computational hacks to represent real-world constraints.
Confusing the optimization level (e.g., evolution or profit motive) with the operational level (e.g., math or tiger avoidance) leads to flawed conclusions about consciousness or intelligence.
AI's 'thoughts' feel normal and intuitive—just like human thoughts—despite being built on fundamentally different low-level mechanisms.
Introduction: The Next Token Predictor Debate
Scott Alexander introduces the episode's central theme: the widespread claim that AIs are just 'next token predictors' and why this is a misunderstanding of levels of explanation.
Human Cognition as Next Sense Datum Prediction
Alexander draws a parallel between human brains and AI, showing that both use predictive coding—predicting the next sensory input—as a core learning mechanism, even though we don't consciously experience it.
AI's Evolution: From Profit Motive to Next Token Prediction
The AI equivalent of evolution is corporate profit motives. Companies train models via next token prediction, but this doesn't mean the AI's internal processes resemble literal token prediction.
The Inner World of AI: Helical Manifolds and Abstract Representations
“The AI represents various features of the line breaking process as one-dimensional helical manifolds in a six-dimensional space, then rotates the manifolds in some way that corresponds to multiplying or comparing the numbers that they're representing.”
Conclusion: Confusing Levels of Optimization
“There will be some algorithmic differences, and some of those might be important, but they're downstream of what specific prediction tasks each entity was trained on and what strengths and weaknesses their own evolutionary history gives them.”
“This is like expecting humans to be just survival and reproduction machines, because survival and reproduction were the optimization criteria in our evolutionary history.”
“The AI represents various features of the line breaking process as one-dimensional helical manifolds in a six-dimensional space, then rotates the manifolds in some way that corresponds to multiplying or comparing the numbers that they're representing.”
“The most compelling analogy? This is like expecting humans to be just survival and reproduction machines...”
Host
Scott Alexander
person
Evolution
other
Claude
other
Predictive Coding
other
Astral Codex Ten
media
AI Companies
organization
Tigers
other
Anthropic
organization
Entorhinal Cells
other
Kelsey Piper
person
Get the full intelligence
Search transcripts, export clips, track mentions, and explore all topics from “Next-Token Predictor Is An AI's Job, Not Its Species” inside PodZeus.
Start discovering podcast insights today
Start with a 7-day trial and explore a growing catalog of popular podcasts. No credit card required.
No credit card required • 7-day trial • Cancel anytime
