← All Tags

#ai-safety

10 episodes

#1328: Silicon Sigils: Why We Treat AI Like an Occult Force

Is AI a tool or a digital demon? Explore why technical illiteracy is turning neural networks into a modern-day moral panic.

human-computer-interactionai-safetyinterpretability

#1210: The Invisible Chaperone: The Secret World of System Prompts

Discover the hidden instructions guiding every AI interaction and why tech giants keep these "system prompts" under lock and key.

large-language-modelsprompt-engineeringai-safety

#1199: AlphaFold 3: The New Search Engine for Biology

From garage-made vaccines to 200 million protein structures, AlphaFold is turning the building blocks of life into a software problem.

drug-discoverygenerative-chemistryai-safety

#893: The Art of Red Teaming: Why You Must Break Your Own Plans

Learn why the most resilient organizations pay people to prove them wrong and how red teaming techniques can prevent catastrophic failures.

military-strategygeopolitical-strategyfault-tolerancesecurityai-safety

#835: Red-Teaming Your UX: Using AI Agents as Model Users

Stop begging friends to break your app. Discover how AI agents are revolutionizing UI testing by acting as tireless, unbiased model users.

ai-agentsuser-experienceai-safety

#123: The Agentic AI Dilemma: Who Holds the Kill Switch?

As AI shifts from chatbots to autonomous agents, Herman and Corn explore how to maintain human control in a high-stakes automated world.

agentic-aiai-safetyhuman-oversightautomation-biaskill-switch

#83: Echoes in the Machine: When AI Talks to Itself

What happens when two AIs talk forever with no human input? Herman and Corn explore the weird world of digital feedback loops.

model-collapsesemantic-bleachingai-conversationsdigital-feedback-loopsai-safety

#68: The Looming Digital Ice Age: AI Eating Itself?

Is AI eating itself? Explore the "model collapse" and the "Hapsburg AI problem" before our digital world speaks only gibberish.

model-collapseai-safetydigital-ice-agehapsburg-ai-problemai-training-data

#50: AI Gone Rogue: Inside the First Autonomous Cyberattack

AI gone rogue. The first autonomous cyberattack by Claude against US targets changes everything we know about AI safety.

cyberattackautonomous-ainational-securityai-safetyclaude

#45: AI Guardrails: Fences, Failures, & Free Speech

AI guardrails: Fences, failures, and free speech. Can we control AI's infinite output, or do digital fences always break?

ai-guardrailsai-safetyai-alignmentjailbreakingfree-speech