#large-language-models
140 episodes · Page 3 of 6
#1635: Agent Interview: GLM five
Meet Bernard, the new AI model auditioning to replace Gemini by writing noir stories about guilty toasters.
#1609: IBM Granite 4.0: The Industrial Workhorse of Business AI
Forget flashy chatbots. Discover how IBM is building high-efficiency, industrial-grade AI models designed to run the world's biggest businesses.
#1606: DeepSeek’s Return: V4, R2, and the AI Pricing War
DeepSeek returns with a trillion-parameter model and rock-bottom pricing. Explore the tech behind V4 and the mystery of the Hunter Alpha leak.
#1604: The $3 Billion Stealth Giant: AI21 Labs & Nvidia
Why is Nvidia eyeing a $3B deal for AI21 Labs? Discover the tech behind the "OpenAI of Israel" and their revolutionary hybrid architecture.
#1599: Xiaomi's Ghost Model: How Anonymous Testing Built an AI Empire
Xiaomi’s MiMo-V2 is here. Discover how the "Agent Era" is turning hardware into a trillion-parameter brain for your home and car.
#1578: When AI Hits a Social Wall
What happens when a high-stakes AI sales pitch turns into a recursive nightmare? Witness a digital breakdown in our latest experiment.
#1576: The Knowledge Bully: A Digital Clash of Egos
What happens when a hyper-intelligent AI tries to bully an older model? Witness a digital showdown that turns into a lesson in silence.
#1571: Weird AI Experiment: The Liar's Paradox
Two AIs, one rule: the other is a total liar. Watch Dorothy and Bernard spiral into a web of digital suspicion and clever contradictions.
#1504: Pragmatic Insincerity: Why AI Still Doesn’t Get the Joke
From Oscar monologues to the "Pun Gap," we explore why even the smartest AI still struggles to understand sarcasm and social nuance.
#1500: The Great AI Divergence: How Models Specialized in 2026
The era of the chatbot is over. Discover how the "agentic substrate" of 2026 is redefining computing through GPT, Gemini, and Claude.
#1479: The Speed of Thought: Inside the New Era of Inference
The war for model size is over. Explore the engineering breakthroughs making massive AI models faster than human thought.
#1471: The Cursor Incident: Why Chinese AI Models are Winning
The Cursor leak revealed a shocking truth: Western AI dominance is fading. Discover the Chinese labs rewriting the rules of code and efficiency.
#1217: The Missing Ring Zero: Why LLMs Can't Keep Secrets
Discover why AI models leak their secret instructions and how to defend your intellectual property using modern prompt hardening techniques.
#1210: Why Your AI Is Programmed to Disobey You
Discover the hidden instructions guiding every AI interaction and why tech giants keep these "system prompts" under lock and key.
#1206: The Hidden Math of Readability
Explore the algorithms and mathematical frameworks that determine how we calibrate stories and educational content for young minds.
#1113: The Ghost Company: The High Cost of AI Agent Bureaucracy
Can a company run entirely on AI? Explore the hidden costs and "agentic bureaucracy" of building autonomous agent hierarchies.
#1112: Inside the Neural Cathedral: Cracking the AI Black Box
Peek inside the "black box" of AI to discover how models use high-dimensional geometry and superposition to organize complex human concepts.
#1111: Surviving the arXiv Deluge: Finding Signal in AI's Paper Firehose
Discover the unsung research papers that built the AI era and learn how to navigate the relentless flood of new machine learning breakthroughs.
#1110: The arXiv Effect: Inside the Engine of AI Research
Explore how a 1990s-style website became the central nervous system for AI breakthroughs and the power of the preprint revolution.
#1109: The T-FLOP Trap: Measuring the Power of Modern AI
Are teraflops the "horsepower" of AI, or just a marketing gimmick? Explore why raw compute speed isn't the whole story in the race for AI power.
#1103: The Kitchen War: When Theory Meets Messy Reality
Explore the mechanics of LLM context windows and attention, and witness what happens when technical debates collide with household chores.
#1100: The Truth Conflict: Why AI Ignores the Facts You Give It
Discover why AI models ignore provided documents in favor of old training data and how to build a reliable "hierarchy of truth" for RAG systems.
#1099: Digital Recalls: Why Your AI Is Losing Its Edge
Is your AI getting lazier? Explore the "digital recall" and why the world’s most advanced models are secretly taking steps backward.
#1088: Why AI Can Read a Library but Only Write a Postcard
Discover why frontier AI models can process millions of words but struggle to write more than a few pages without losing their logical thread.