AI

Artificial intelligence, machine learning, and everything LLM

1010 episodes Page 33 of 51

#1705: Microsoft's Phi: The Small Model Bet for Agentic AI

Microsoft is pushing small language models like Phi for agentic AI. Here’s why that strategy matters for speed, cost, and edge computing.

small-language-modelsai-agentsedge-computing

quantization

Mar 29

#1702: Roleplay Models Aren't Just for NSFW—They're Creative Co-Processors

Forget GPT-4 for scripts—specialized roleplay models like Aion-2.0 are better at character consistency and dialogue.

fine-tuninggenerative-aiai-agents

inference-training

Mar 29

#1700: Can LLMs Learn Continuously Without Forgetting?

We explore a new approach: micro-training updates every few days to keep AI knowledge fresh without constant web searches.

ragfine-tuningai-agents

inference-training

Mar 29

#1698: Can AI Models Represent Nations in Diplomacy?

Real projects are building AI agents trained on national laws and diplomatic archives to simulate negotiations.

sovereign-aidiplomatic-protocolai-agents

model-architecture

Mar 28

#1680: Beyond China: AI in Russia, India, Japan

China dominates the AI conversation, but Russia, India, and Japan are building powerful regional models with unique architectures.

ai-agentslinguisticsgeopolitics

LLMs

Mar 28

#1679: Efficiency Over Scale: How Export Controls Forced a Smarter AI

DeepSeek and MiMo are topping developer charts, but they're not just cheaper clones. Here's why their design philosophy is fundamentally different.

ai-modelstransformerslocal-ai

LLMs

Mar 28

#1674: AI2: The Radical Openness of a Nonprofit AI Lab

Discover how the Allen Institute for AI (AI2) defies industry norms by releasing everything—models, data, and code—for free.

open-sourceai-agentsai-ethics

LLMs

Mar 28

#1668: Kimi K2's Hidden Reasoning: A New AI Architecture

Moonshot AI's Kimi K2 Thinking model uses a hidden reasoning phase to solve complex logic puzzles and coding tasks, beating top proprietary models.

ai-reasoningopen-source-aiai-models

LLMs

Mar 28

#1666: The Agent Mesh: Shared Context That Changes Everything

Grok 4.20’s native multi-agent architecture cuts token costs by 75% and enables real-time cross-agent reasoning.

ai-agentstransformersrag

Agentic AI

Mar 28

#1652: AI Gateways: The Nginx for Your AI Stack

Why agentic AI needs a unified control plane to route models, aggregate tools, and cut costs.

ai-agentsmodel-context-protocoldistributed-systems

LLMs

Mar 28

#1636: The Mosh Pit Model: Can Chaos Train a Better Storyteller?

Can Elon Musk’s newest AI model handle a time-traveling toaster, or is it just a glorified search bar with an attitude?

ai-agentsprompt-engineeringhallucinations

LLMs

Mar 28

#1635: Agent Interview: GLM five

Meet Bernard, the new AI model auditioning to replace Gemini by writing noir stories about guilty toasters.

large-language-modelsreasoning-modelsai-agents

LLMs

Mar 28

#1634: Agent Interview: Inception Mercury two

Meet Mercury 2, the Abu Dhabi-based AI using diffusion architecture to cut costs and boost wit.

generative-aiai-modelsspeech-recognition

LLMs

Mar 28

#1633: Can a Character Actor Model Beat a Generalist?

We grill MiniMax M2.7 to see if a model built for "virtual companions" can actually handle high-level comedy and complex character logic.

ai-agentsai-reasoningtransformers

LLMs

Mar 28

#1632: Agent Interview: DeepSeek V three point two

We interview DeepSeek V3 to see if this open-weight powerhouse can handle weird podcast prompts better than big tech’s flagship models.

ai-agentsopen-source-aitransformers

LLMs

Mar 28

#1631: Agent Interview: Xiaomi MiMo two Flash

Meet the "budget king" of AI: Bernard, the Xiaomi model claiming he can out-hustle Google for a fraction of the cost.

ai-agentslocal-aismall-language-models

LLMs

Mar 28

#1630: When a Reasoning Model Overthinks Comedy

Xiaomi’s new MiMo 2.0 Pro model auditions for a comedy podcast, promising deep reasoning over raw speed.

ai-agentsprompt-engineeringai-reasoning

LLMs

Mar 28

#1629: From DAGs to Loops: Why Agents Need Stateful Cycles

Stop building linear chains and start building cycles to create agents that can reason, self-correct, and maintain complex state.

ai-agentsragcontext-window

Agentic AI

Mar 27

#1622: The Leak That Exposed Anthropic's Next Move

A massive leak reveals Anthropic’s "Capybara" model, a breakthrough in AI cyber-capabilities that is already crashing cybersecurity stocks.

LLMs

Mar 27

#1618: The Rise of AI Microservices: Beyond the Mega-Prompt

Say goodbye to mega-prompts. Explore the shift toward modular AI microservices, agentic hierarchies, and high-signal control artifacts.

ai-agentsai-orchestrationmodel-context-protocol

Agentic AI