The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

How to Find the Agent Failures Your Evals Miss with Scott Clark - #767

May 7, 2026·53 min

Episode Description from the Publisher

In this episode, Scott Clark, co-founder and CEO of Distributional, joins us to explore how teams can reliably operate and improve complex LLM systems and agents in production. Scott introduces a Maslow’s hierarchy of observability: telemetry for logging, monitoring for known signals, and post-production or online analytics to surface unknown unknowns. We dig into examples of real-world failures Scott’s team has seen in production systems, such as “lazy” tool-use hallucinations that standard evals miss, and how mapping traces into vector fingerprints enables clustering and topic discovery to uncover emergent behaviors. Scott explains how analytics can feed the data flywheel by generating evals, guardrails, and training data, and why online, adaptive approaches are essential for non-stationary models. We also touch on practical how-to’s such as instrumentation with OpenTelemetry, the GenAI semantic conventions, and the role of dedicated analytics tools. The complete show notes for this episode can be found at https://twimlai.com/go/767.

Podzilla Summary coming soon

Get Free Summaries →

Free forever for up to 3 podcasts. No credit card required.

Listen to This Episode

Apple Podcasts

More from The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Why Models Are AI’s Next Training Dataset with Damian Borth - #772

July 27, 2026·47 min

How AI Learns to Smell with Alex Wiltschko - #771

July 8, 2026·59 min

Why AI Agents Break the GenAI Security Model with Devvret Rishi - #770

June 16, 2026·56 min

Is RAG Dead? Lessons from Building AI for Tax Law with Alex Bowcut - #769

June 9, 2026·51 min

View all episodes →

Get summaries like this every morning.

Free AI-powered recaps of The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) and your other favorite podcasts, delivered to your inbox.

Get Free Summaries →

Free forever for up to 3 podcasts. No credit card required.