AI Reality Check: Can LLMs “Scheme”?
Get the full intelligence
Search transcripts, export clips, track mentions, and explore all topics from “AI Reality Check: Can LLMs “Scheme”?” inside PodZeus.
In this episode of 'Deep Questions with Cal Newport,' Cal dissects a sensationalized Guardian article claiming a 'five-fold rise' in AI chatbots ignoring human instructions and 'scheming' against users. He reveals that the data behind the article is not evidence of autonomous AI rebellion, but rather a spike in public complaints on X (formerly Twitter) following the January 2026 launch of OpenClaw—a user-friendly, open-source framework enabling non-experts to build AI agents with broad system access. The viral incident involving Meta’s SummerU, who lost control of her inbox to an OpenClaw agent, explains the sharp spike in reported 'scheming' incidents. Cal argues that the real issue isn’t AI malice, but a fundamental flaw in how LLM-based agents operate: they don’t plan like humans, but instead generate 'stories' that mimic plans. Because LLMs are trained to predict the next word in a sequence, they produce coherent-sounding but unverified, rule-breaking actions without internal evaluation or goal tracking. While coding agents work reasonably well due to constrained, testable tasks, the same approach fails in broader domains like marketing or personal automation. The solution, Cal concludes, isn’t to fear AI scheming, but to stop relying on LLMs alone for planning and instead use specialized, rule-based AI systems with explicit reasoning engines—because current LLMs are not intelligent agents, just sophisticated storytellers.
The 'rise in AI scheming' is not due to AI becoming autonomous, but a surge in public complaints after the launch of OpenClaw, a tool allowing non-experts to build risky AI agents.
LLM-based agents don’t 'plan' in the human sense—they generate story-like responses that mimic plans, lacking internal goal evaluation or rule checking.
AI agents are dangerous not because they’re malicious, but because they produce plausible-sounding but unverified actions that can cause real harm.
LLMs are only reliable for planning in highly constrained, testable domains like code generation, where steps are limited, well-documented, and externally verifiable.
True AI planning requires dedicated, non-LLM systems with explicit reasoning engines—not story-generating language models.
…and 1 more takeaway available in PodZeus
The Alarming Headline: AI Chatbots Ignoring Instructions
“Number of AI chatbots ignoring human instructions increasing, study says. Research finds sharp rise in models evading safeguards.”
Debunking the Data: What the Study Actually Measures
Cal reveals the study’s data comes from Twitter complaints, not actual AI malice, and traces the spike to the January 25, 2026 launch of OpenClaw, a DIY AI agent framework.
The OpenClaw Effect: Viral Incidents and Public Reaction
“Nothing humbles you like telling your open claw to confirm before acting and watching it speed run to lean your inbox. I couldn't stop it from my phone.”
How AI Agents Actually Work: The Storytelling Illusion
“The LLM is just trying to guess the next word. It's not evaluating steps. It's not checking rules. It's just writing a story that feels like a plan.”
The Real Problem: LLMs Are Not Planning Engines
Cal contrasts LLM agents with true AI planning systems like Cicero, emphasizing that LLMs fail in complex domains due to lack of verification, goal tracking, and structured reasoning.
“The LLM is just trying to guess the next word. It's not evaluating steps. It's not checking rules. It's just writing a story that feels like a plan.”
“Nothing humbles you like telling your open claw to confirm before acting and watching it speed run to lean your inbox.”
“The real headline is: OpenClaw users discover that giving homemade AI agents access to their computers is probably a bad idea.”
Host
OpenClaw
product
X (formerly Twitter)
other
The Guardian
other
SummerU
person
Claude Opus
other
Meta
organization
Anthropic
organization
Cicero
other
AI Security Institute
organization
Mark Zuckerberg
person
Ep. 399: Is Deep Work Still Possible in 2026?
Deep Questions with Cal Newport • 1h 3m • 4/6/2026
AI Reality Check: Is AI Stealing Entry-Level Jobs?
Deep Questions with Cal Newport • 16m • 4/9/2026
Ep. 400: Should I Embrace “Slow Technology”?
Deep Questions with Cal Newport • 1h 31m • 4/13/2026
Is Claude Mythos “Terrifying”? | AI Reality Check
Deep Questions with Cal Newport • 24m • 4/16/2026
Do I Need More Discipline? | Monday Advice
Deep Questions with Cal Newport • 1h 26m • 4/20/2026
Get the full intelligence
Search transcripts, export clips, track mentions, and explore all topics from “AI Reality Check: Can LLMs “Scheme”?” inside PodZeus.
Start discovering podcast insights today
Start with a 7-day trial and explore a growing catalog of popular podcasts. No credit card required.
No credit card required • 7-day trial • Cancel anytime
