
Free Daily Podcast Summary
by Julian Goldie
Latest Podcast
The most recent episodes — sign up to get AI-powered summaries of each one.
Odysseus vs Hermes AI Agent: Side-by-Side Test (Which One Wins?)The video compares PewDiePie’s new open-source Odysseus AI agent with the open-source Hermes Agent by testing them side by side. Odysseus is presented as a polished, self-hosted standalone app with a fixed UI, built-in tools like email and calendar, and a setup geared toward running local models, but it can feel messy due to pop-up tool windows and has a more manual, non-live memory approach. Hermes is positioned as a more configurable standalone agent that can run in a terminal, desktop app, or within an agent operating system, supports pluggable memory and cloud or local models, can run in the background 24/7, and integrates with other agents and systems. Using the same model in both produced similar general responses, but the creator prefers Hermes for customization, orchestration, and long-term flexibility.00:00 Odysseus vs Hermes00:29 Getting Odysseus Setup00:47 App Design Differences01:32 UI Workflow Issues02:02 Memory and Skills03:00 Solo App vs Orchestration03:46 Local vs Cloud Models04:57 Performance and Customization05:48 Who Each Is For06:42 Future Proofing Agents07:40 Why Custom Systems Win08:53 Verdict and Wrap Up09:07 Boardroom and Resources09:41 Final Goodbye
NVIDIA Nemotron-3 Ultra Is Free in Hermes Agent (Setup + Why It Matters)The script introduces NVIDIA Nemotron-3 Ultra, a newly released open-source 550B mixture-of-experts frontier model now available for free for two weeks through News Portal and usable inside Hermes Agent. It explains that the model is designed for long-running agentic work (planning, tool use, failure recovery) and is advertised as five times faster while using 30% fewer tokens on agent tasks by activating only the needed experts. The walkthrough shows how to enable it via Hermes Agent’s new Mission Control dashboard by going to Manage → Models and selecting Nemotron-3 Ultra, then testing it in chat. It highlights potential benefits for Hermes “goal mode,” mentions benchmark comparisons against GLM 5.1, Qwen 3.5, and others, and notes it’s also available on Hugging Face for local hosting.00:00 Nemotron Free Drop00:37 Mission Control Setup00:59 Why It Matters01:48 MoE Speed Explained02:49 Three Step Install03:25 Real World Tests04:18 Goal Mode Power05:16 Benchmarks Breakdown06:07 Wrap Up Recap06:13 Community Pitch07:06 Final Goodbye
Stop Typing! Build AI Voice Agents with MiniMax M3 & OpenClawLearn how to build and interact with real-time AI voice agents using the new MiniMax M3 update for OpenClaw and Hermes. This hands-free workflow allows you to automate tasks and research in real-time without ever touching your keyboard.00:00 - Intro: Real-Time Voice Chat Demo01:11 - Voice Agent Mastery Explained01:57 - Why MiniMax M3 is the Most Agentic Model02:43 - Testing Deep Voice & Social Search03:12 - Operating AI Agents from Your Phone03:37 - The Voice-to-Text-to-Speech Workflow04:43 - Recap: Hands-Free AI Interaction05:55 - Advanced Training & Community Access
Headroom: Free Open-Source Tool to Cut AI Agent Token Use by 60–95%This episode introduces Headroom, a free open-source project trending on GitHub that compresses what AI agents read to reduce token usage by 60–95% without losing meaning. It’s described as a “zip file” for agent context—files, tools, search results, logs, and conversation history—helping agents run faster, cost less (especially via APIs), and forget less due to context-window limits. The script claims Headroom can plug into many agents and tools (including Claude Code, Codex, Cursor, OpenClaude, and Hermes) and offers proof examples such as 92% token savings and compressing 10,144 words to 1,260 while finding the same log error. It outlines a three-step approach: crush, keep (shared reversible memory), and compound (learn from failures).00:00 Token Saving Breakthrough01:07 Why Agents Burn Tokens03:00 Headroom Zip Compression03:03 Quick Install Demo03:27 Three Key Benefits03:39 Goldie Framework Steps04:31 Proof and Benchmarks05:31 Common Objections Answered06:03 Recap and Next Steps06:45 Boardroom Offer and Outro
Odysseus (PewDiePie’s Open-Source AI Agent): Local ChatGPT-Style Workspace TestedThe video tests Odysseus, PewDiePie’s free, open-source, self-hosted AI workspace that runs locally (offline) like ChatGPT or Claude but on your own computer with your data. It shows the app’s retro, somewhat messy UI and key features including chat, an agent for tasks, deep research, document writing, memories/skills, notes, tasks, calendar, email, a library for documents, scheduled task triggers, and model switching. Setup is shown via GitHub instructions (Docker or native macOS), creating an admin account, adding an API “brain,” and running on a local address. The script demonstrates using local models (including Gemma) or cloud models via OpenRouter, the “cookbook” for selecting downloadable local models, and a side-by-side model comparison, concluding that it works and is lightweight aside from local model requirements.00:00 Meet Odysseus00:21 What It Is00:43 Setup Options01:12 Tour The Interface02:38 Tasks And Scheduling03:39 Chat And Agent Test04:38 Cookbook And Compare05:16 Six Step Setup05:56 Who Its For06:32 Recap And Next Steps06:51 Community And Training07:51 Final Thanks
Run Gemma 4 Locally in Hermes Agent (Free AI Automation + New Web UI Setup)The script explains how to combine Google’s newly released Gemma 4 12B model with Hermes Agent to run a free, local AI agent designed for agentic reasoning and automation. It shows examples built with Hermes Agent and an agentic operating system, including games, a Pomodoro timer, a color palette, and animations. Setup is demonstrated via Ollama (download Ollama, run the command to launch Hermes with Gemma 4) or through Hermes Agent’s new web UI to select Gemma 4 as the model. It also suggests configuring a stronger main model with Gemma 4 as a sub-agent to save tokens, and notes an alternative free API option via OpenRouter for Gemma 4 26B/31B if local hardware is limited.00:00 Free Agents With Gemma00:36 What You Can Build00:54 Local Setup With Ollama01:15 Web UI Model Switching01:32 Main Model And Subagent02:07 Free API Option03:01 Body And Brain Explained04:09 Automation Use Cases05:13 Offline And Token Savings06:15 Chat Versus Agent06:34 Hermes Dashboard Tour07:20 SOP Pairing Steps07:53 Benchmarks And Ecosystem08:24 Recap And Offline Travel08:51 Training And Community Pitch09:46 Final Thanks
Headroom: Free Open-Source Tool to Cut AI Agent Token Use by 60–95%This episode introduces Headroom, a free open-source project trending on GitHub that compresses what AI agents read to reduce token usage by 60–95% without losing meaning. It’s described as a “zip file” for agent context—files, tools, search results, logs, and conversation history—helping agents run faster, cost less (especially via APIs), and forget less due to context-window limits. The script claims Headroom can plug into many agents and tools (including Claude Code, Codex, Cursor, OpenClaude, and Hermes) and offers proof examples such as 92% token savings and compressing 10,144 words to 1,260 while finding the same log error. It outlines a three-step approach: crush, keep (shared reversible memory), and compound (learn from failures).00:00 Token Saving Breakthrough01:07 Why Agents Burn Tokens03:00 Headroom Zip Compression03:03 Quick Install Demo03:27 Three Key Benefits03:39 Goldie Framework Steps04:31 Proof and Benchmarks05:31 Common Objections Answered06:03 Recap and Next Steps06:45 Boardroom Offer and Outro
Gemma 4 12B Just Dropped: Free Agentic Open-Source Model You Can Run LocallyGoogle has released Gemma 4 12B, a new free, open-source, agentic model you can access now and run locally, and the creator demonstrates building real apps inside an agent operating system by plugging Gemma 4 into Hermes Agent. Examples shown include a mouse-follow animation app, a color palette designer, a Pomodoro timer, generative art, a simple game, a website, and a wallpaper, emphasizing improved usefulness versus earlier Gemma versions. The script explains how to get Gemma 4 via Ollama (recently updated), or use free Gemma 4 API options like a 26B model via OpenRouter if you lack powerful hardware, and highlights core specs such as 16GB memory and a 256K context window along with benchmark claims around 77% reasoning and ~72% coding.00:00 Gemma 4 Drops00:32 Demos Built in Hermes01:35 Get It via Ollama02:20 What Gemma 4 Is02:56 Using It in AOS03:50 Goldie Pocket Genius06:50 Setup and Integrations07:33 Use Cases and Limits09:08 Benchmarks and Multimodal10:04 Recap and Community11:23 Final Goodbye
AI-powered recaps with compact key takeaways, quotes, and insights.
Get key takeaways from AI News Today | Julian Goldie Podcast in a 5-minute read.
Stay current on your favorite podcasts without falling behind.
It's a free AI-powered email that summarizes new episodes of AI News Today | Julian Goldie Podcast as soon as they're published. You get the key takeaways, notable quotes, and links & mentions — all in a quick read.
When a new episode drops, our AI transcribes and analyzes it, then generates a personalized summary tailored to your interests and profession. It's delivered to your inbox every morning.
No. Podzilla is an independent service that summarizes publicly available podcast content. We're not affiliated with or endorsed by Julian Goldie.
Absolutely! The free plan covers up to 3 podcasts. Upgrade to Pro for 15, or Premium for 50. Browse our full catalog at /podcasts.
AI News Today | Julian Goldie Podcast publishes daily. Our AI generates a summary within hours of each new episode.
AI News Today | Julian Goldie Podcast covers topics including Business, Marketing. Our AI identifies the specific themes in each episode and highlights what matters most to you.
Free forever for up to 3 podcasts. No credit card required.
Free forever for up to 3 podcasts. No credit card required.