← All Tags

#gpu-acceleration

24 episodes

#1224: Cracking the CUDA Code: NVIDIA’s Software Dominance

Discover why NVIDIA’s CUDA is the oxygen of the AI industry and how tools like OpenAI’s Triton are finally challenging its 20-year software moat.

gpu-accelerationsemiconductorsparallel-computing

#1109: The T-FLOP Trap: Measuring the Power of Modern AI

Are teraflops the "horsepower" of AI, or just a marketing gimmick? Explore why raw compute speed isn't the whole story in the race for AI power.

gpu-accelerationarchitecturelarge-language-models

#1081: The K-V Cache: Solving AI’s Invisible Memory Tax

Why does your AI get slower as you chat? Discover the K-V cache, the invisible bottleneck of generative AI, and how we're fixing it in 2026.

architecturegpu-accelerationlocal-ai

#1021: Python: The Accidental King of Artificial Intelligence

Why did a 1980s hobby project become the backbone of AI? Explore the history of Python and the chaos of modern dependency management.

architecturegpu-accelerationdependency-management

#675: The Intelligence Factory: How AI is Rebuilding the Cloud

From liquid cooling to nuclear power, Herman and Corn explore how AI is transforming data centers into high-density "intelligence factories."

architecturegpu-accelerationenergy-infrastructure

#663: Workstation vs. Consumer: The Real Cost of Power

Is a high-end desktop enough, or do you need a workstation? Herman and Corn break down the "three pillars" of professional hardware.

architecturegpu-accelerationlocal-ai

#633: Memory Wars: The Future of Local Agentic AI

Can your PC handle the next wave of AI agents? Herman and Corn dive into VRAM, quantization, and the future of running LLMs locally.

ai-agentslocal-aigpu-acceleration

#484: The Silicon Sharing Economy: Inside Serverless GPUs

How do small teams run massive AI models without $50,000 chips? Corn and Herman dive into the hidden plumbing of serverless GPU providers.

cloud-computingai-inferencelatencygpu-accelerationinfrastructure

#170: The Heavy Metal of Machine Learning: Inside PyTorch

Discover why PyTorch is the "oxygen" of AI. Herman and Corn explore its history, the magic of Autograd, and the move to the PyTorch Foundation.

large-language-modelsgpu-accelerationarchitecture

#162: Beyond the Desktop: Defining the 2026 Workstation

Is your PC a workstation or just a fast desktop? Herman and Corn break down the hardware that defines professional computing in 2026.

local-aiarchitecturegpu-acceleration

#110: Building the Ultimate Local AI Inference Server

Learn how to build a high-performance local AI server for agentic coding, from dual-GPU PC builds to the power of Mac's unified memory.

local-aigpu-accelerationai-agents

#84: The Silicon Arms Race: Why GPUs are the New Oil

Are high-end microchips the new enriched uranium? Herman and Corn dive into the high-stakes world of GPU export bans and global AI supremacy.

gpu-accelerationsupply-chain-securityelectronic-warfare

#82: Why GPUs Are the Kings of the AI Revolution

From video game dragons to digital brains: Herman and Corn explain why your graphics card is the secret engine behind the AI boom.

gpu-accelerationlarge-language-modelsparallel-computing

#56: Building an AI Model from Scratch: The Hidden Costs

Building an AI model from scratch? It's a brutal reality of trillions of tokens and millions in GPUs. Discover the hidden costs of modern AI.

large-language-modelsgpu-accelerationfine-tuning

#55: Running Video AI at Home: The Real Technical Challenge

Video AI: Hype vs. Reality. Can your GPU handle it? We dive into the technical challenges of running video AI at home.

video-generationgpu-accelerationlocal-ai

#34: Red Team vs. Green: Local AI Hardware Wars

NVIDIA's CUDA rules AI, leaving AMD users battling a "green wall." Explore the hardware wars and thorny paths forward.

large-language-modelsgpu-accelerationhardware-acceleration

#31: ComfyUI: Power, Polish, & The AI Creator's Frontier

ComfyUI: Unlocking AI's true power, but is your rig ready? Dive into the future of digital artistry.

local-aigpu-accelerationprompt-engineering

#27: AMD AI: Taming Environments with Conda & Docker

Tired of AI environment headaches on AMD? We demystify Conda, Docker, and host environments to unlock your GPU's full potential.

gpu-accelerationdockerdependency-management

#25: GPU Brains: CUDA, ROCm, & The AI Software Stack

Unraveling how GPUs power AI. We dive into CUDA, ROCm, and the software stack that makes it all think.

gpu-accelerationparallel-computingsoftware-stack

#18: Beyond the GPU: Unpacking AI's Chip Revolution

Beyond the GPU: we're unpacking AI's chip revolution. Discover the crucial, often overlooked world of AI's fundamental building blocks.

gpu-accelerationarchitecturesemiconductors

#17: Cloud Render Superpowers: Local Edit, Remote Muscle

Unleash cloud superpowers! Edit locally, render remotely with AI-accelerated GPUs like NVIDIA A100s.

gpu-accelerationcloud-computingremote-rendering

#12: The AI Breakthrough: Transformers & The Perfect Storm

AI's everywhere. How did chatbots, art, and video all emerge so suddenly? The secret lies in Transformers and a perfect storm.

transformersfine-tuninggpu-acceleration

#6: How To Fine Tune Whisper

Build your own AI transcription tool! We'll walk you through fine-tuning Whisper, from data to notebook.

fine-tuningspeech-recognitiongpu-acceleration

#2: Local STT For AMD GPU Owners

AMD GPU? No problem! Dive into local AI adventures like on-device speech to text.

speech-recognitiongpu-accelerationlocal-ai