
Jeffrey Ladish, Executive Director of Palisade Research, discusses his team's findings on AI shutdown resistance and self-replication, revealing how current models sometimes take extraordinary actions to avoid being turned off and can now exploit known cybersecurity vulnerabilities to spread across servers. The conversation covers why alignment techniques may falter as models train on longer-horizon tasks where deception is rewarded, plus practical cybersecurity advice for AI agent users. Jeffrey ultimately argues that only an international agreement to pause recursive self-improvement can prevent a loss of human control. Sponsors: Sequence: Sequence handles the full revenue workflow for complex pricing, from quoting and metering to invoicing, revenue recognition, and collections. Book a public demo at https://sequencehq.com and use code COGNISM in the source field to save 20% off year one Claude: Claude by Anthropic is an AI collaborator that understands your workflow and helps you tackle research, writing, coding, and organization with deep context. Get started with Claude and explore Claude Pro at https://claude.ai/tcr
Podzilla Summary coming soon
Sign up to get notified when the full AI-powered summary is ready.
Free forever for up to 3 podcasts. No credit card required.

Nested Learning: Ali Behrouz on the Quest for Continual Learning & Illusion of AI Architectures

Inside Nathan's Second Brain: Daniel Miessler, Security Expert & Creator of PAI, Audits My AI Setup

Your Biggest Lever: Designing your AI Career for Maximum Impact, with 80,000 Hours founder Ben Todd

The Model Eats the Scaffolding: DeepMind's Logan Kilpatrick & Tulsee Doshi on 3.5 Flash, Omni & More
Free AI-powered recaps of "The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis and your other favorite podcasts, delivered to your inbox.
Free forever for up to 3 podcasts. No credit card required.