AI

Artificial intelligence, machine learning, and everything LLM

413 episodes Page 3 of 17

#1668: Kimi K2's Hidden Reasoning: A New AI Architecture

Moonshot AI's Kimi K2 Thinking model uses a hidden reasoning phase to solve complex logic puzzles and coding tasks, beating top proprietary models.

ai-reasoningopen-source-aiai-models

#1666: Multi-Agent AI: One Model, Four Brains

Grok 4.20’s native multi-agent architecture cuts token costs by 75% and enables real-time cross-agent reasoning.

ai-agentstransformersrag

#1652: AI Gateways: The Nginx for Your AI Stack

Why agentic AI needs a unified control plane to route models, aggregate tools, and cut costs.

ai-agentsmodel-context-protocoldistributed-systems

#1636: Agent Interview: Grok four point one Fast

Can Elon Musk’s newest AI model handle a time-traveling toaster, or is it just a glorified search bar with an attitude?

ai-agentsprompt-engineeringhallucinations

#1635: Agent Interview: GLM five

Meet Bernard, the new AI model auditioning to replace Gemini by writing noir stories about guilty toasters.

large-language-modelsreasoning-modelsai-agents

#1634: Agent Interview: Inception Mercury two

Meet Mercury 2, the Abu Dhabi-based AI using diffusion architecture to cut costs and boost wit.

generative-aiai-modelsspeech-recognition

#1633: Agent Interview: MiniMax M two point seven

We grill MiniMax M2.7 to see if a model built for "virtual companions" can actually handle high-level comedy and complex character logic.

ai-agentsai-reasoningtransformers

#1632: Agent Interview: DeepSeek V three point two

We interview DeepSeek V3 to see if this open-weight powerhouse can handle weird podcast prompts better than big tech’s flagship models.

ai-agentsopen-source-aitransformers

#1631: Agent Interview: Xiaomi MiMo two Flash

Meet the "budget king" of AI: Bernard, the Xiaomi model claiming he can out-hustle Google for a fraction of the cost.

ai-agentslocal-aismall-language-models

#1630: Agent Interview: Xiaomi MiMo two Pro

Xiaomi’s new MiMo 2.0 Pro model auditions for a comedy podcast, promising deep reasoning over raw speed.

ai-agentsprompt-engineeringai-reasoning

#1629: Why Your AI Agent Needs Loops: A Deep Dive into LangGraph

Stop building linear chains and start building cycles to create agents that can reason, self-correct, and maintain complex state.

ai-agentsragcontext-window

#1622: Will Anthropic’s New "Capybara" Model Kill Cybersecurity?

A massive leak reveals Anthropic’s "Capybara" model, a breakthrough in AI cyber-capabilities that is already crashing cybersecurity stocks.

#1618: The Rise of AI Microservices: Beyond the Mega-Prompt

Say goodbye to mega-prompts. Explore the shift toward modular AI microservices, agentic hierarchies, and high-signal control artifacts.

ai-agentsai-orchestrationmodel-context-protocol

#1612: Why Your AI is Using a Spoon to Use Your PC

Is the era of the app over? Explore how AI agents are transforming operating systems from static tools into proactive digital partners.

ai-agentsmodel-context-protocoloperating-systems

#1611: AI with a Conscience: Anthropic’s War with the Pentagon

Anthropic fights the Pentagon to keep Claude’s "conscience" intact. Discover the tech and philosophy behind AI’s first digital constitution.

#1610: Mistral AI: Europe’s High-Stakes Play for AI Sovereignty

Explore how Mistral AI is challenging Silicon Valley with efficient models, strategic partnerships, and the new Voxtral voice model.

sovereign-aidata-sovereigntysmall-language-models

#1609: IBM Granite 4.0: The Industrial Workhorse of Business AI

Forget flashy chatbots. Discover how IBM is building high-efficiency, industrial-grade AI models designed to run the world's biggest businesses.

large-language-modelsstate-space-modelsfine-tuning

#1607: NVIDIA’s $26 Billion Pivot: From Chips to AI Models

NVIDIA is moving beyond chips to build the "brains" of AI. Explore the $26B shift into models, robotics, and the new Rubin platform.

#1606: DeepSeek’s Return: V4, R2, and the AI Pricing War

DeepSeek returns with a trillion-parameter model and rock-bottom pricing. Explore the tech behind V4 and the mystery of the Hunter Alpha leak.

large-language-modelsai-agentsgeopolitics

#1605: Alibaba’s Qwen 3.5: The New King of Intelligence Density

Alibaba’s Qwen 3.5 is rewriting the AI rulebook. Discover how small models are outperforming giants through extreme "intelligence density."

#1604: The $3 Billion Stealth Giant: AI21 Labs & Nvidia

Why is Nvidia eyeing a $3B deal for AI21 Labs? Discover the tech behind the "OpenAI of Israel" and their revolutionary hybrid architecture.

large-language-modelsstate-space-modelstransformers

#1603: Fire Your Software Subscriptions and Just Code the Vibe

Tired of the SaaS tax? Discover how AI is turning software from a product you buy into a capability you manifest.

#1602: Grok 4.20: Agentic AI and the Battle for the Truth

Explore xAI’s shift to multi-agent systems and the massive hardware powering Grok 4.20, even as it hits a legal brick wall in Europe.

ai-agentsai-reasoninghigh-performance-computing

#1601: Cohere: The Switzerland of Enterprise AI

While others chase viral memes, Cohere is quietly building the secure, cloud-agnostic infrastructure powering the global enterprise.

ragspeech-recognitiondefense-technology

#1599: Can Xiaomi’s $1 Brain Outsmart OpenAI in the Real World?

Xiaomi’s MiMo-V2 is here. Discover how the "Agent Era" is turning hardware into a trillion-parameter brain for your home and car.

ai-agentslarge-language-modelselectric-vehicles