Colin Whitlatch: Advanced In-Platform AI at Kahua
Get the full intelligence
Search transcripts, export clips, track mentions, and explore all topics from “Colin Whitlatch: Advanced In-Platform AI at Kahua” inside PodZeus.
In this episode of Constructed Futures, host Hugh Seaton sits down with Colin Whitlatch, CTO of Kahua, to explore how the company has rapidly integrated advanced in-platform AI into its enterprise software, not as a gimmick but as a natural evolution of its core mission. Whitlatch explains that Kahua’s AI strategy is rooted in decades of platform development focused on security, performance, and usability—values that now align perfectly with AI’s strengths in natural language processing and dynamic workflows. Rather than relying on third-party APIs, Kahua built a compliant, government-ready AI infrastructure using only two core capabilities: generative AI and vector embeddings, enabling secure, high-speed operations across both commercial and government networks. A key innovation is their 'ephemeral RAG' system, which creates temporary, on-the-fly vector databases from user-specific data, drastically reducing latency by eliminating external round trips. This, combined with fast JSON parsing and a 'mixture of assistants' architecture inspired by models like Gemini’s mixture of experts, allows for near-instant responses and context-aware task execution. The team also prioritized token efficiency from day one, avoiding wasteful practices like sending entire PDFs to LLMs, and introduced 'pseudo tool calls' to maintain security and performance by intercepting and optimizing AI-driven actions. Finally, they’ve implemented a natural language intent layer between user queries and AI processing, allowing the system to understand business context and deliver precise, concise results without overloading users with irrelevant information.
Build AI into your product only after establishing a strong foundation in security, performance, and usability.
Use ephemeral RAG to create on-the-fly vector databases from user-specific data, reducing latency and eliminating external dependencies.
Adopt a 'mixture of assistants' architecture to break down complex tasks into focused, fast-executing components.
Prioritize token efficiency from the start—avoid sending large documents to LLMs and optimize tool calls to reduce cost and delay.
Implement a natural language intent layer to interpret user goals before engaging AI, improving accuracy and relevance.
…and 2 more takeaways available in PodZeus
The AI Inflection Point at Kahua
Hugh Seaton introduces Colin Whitlatch, CTO of Kahua, and sets the stage by asking what drove Kahua’s rapid AI adoption over the past nine months, framing it as a strategic alignment with natural language and business logic rather than a tech trend.
Foundations Before AI: Security, Performance, and Usability
Whitlatch emphasizes that Kahua’s AI success stems from a mature platform built on security, performance, and usability—prerequisites that allowed AI to be integrated meaningfully rather than as an afterthought.
Building AI for Government Compliance
Kahua’s dual-network architecture (commercial and government) required a unique AI strategy. They built a compliant system using only two core AI components—generative models and vector embeddings—ensuring consistency and security across both environments.
Ephemeral RAG: The Secret to Lightning-Fast AI
“We can actually very quickly generate the vectors that are needed to be able to run these RAG queries. And once you have that available, you have it right there alongside your chat completions.”
Mixture of Assistants and Token Efficiency
“We don't want to get into that whole usage-based concept if we can avoid it. We're going to see where the world goes. But you're 1,000% right. And it all comes down to efficiency because efficiency, if you get it, that's where the speed comes from.”
“We know what's going on. We know what you want. And we can do it. Sometimes AI might be calling a tool to get a result. We actually may know better and say, ah, we already know what the result is so we need to show it.”
“We can actually very quickly generate the vectors that are needed to be able to run these RAG queries. And once you have that available, you have it right there alongside your chat completions.”
“We don't want to get into that whole usage-based concept if we can avoid it. We're going to see where the world goes. But you're 1,000% right. And it all comes down to efficiency because efficiency, if you get it, that's where the speed comes from.”
Host
Guest
Kahua
organization
Colin Whitlatch
person
Azure Gov
other
Gemini
other
Curt Conference
other
Western Winter Workshop
other
React
other
Netflix
organization
AWS
other
Dan Rosenberg-Legal Considerations for AI Use in Construction
Constructed Futures, A Quantum Rise Podcast • 31m • 4/13/2026
Alex Kelleher: Turning AI into Business Outcomes at Quantum Rise
Constructed Futures, A Quantum Rise Podcast • 24m • 4/20/2026
Tom Yeshurun: Layout for Civil Construction at Civ Robotics
Constructed Futures, A Quantum Rise Podcast • 23m • 4/28/2026
Get the full intelligence
Search transcripts, export clips, track mentions, and explore all topics from “Colin Whitlatch: Advanced In-Platform AI at Kahua” inside PodZeus.
Start discovering podcast insights today
Start with a 7-day trial and explore a growing catalog of popular podcasts. No credit card required.
No credit card required • 7-day trial • Cancel anytime
