← All Tags

#speech recognition

12 episodes

The Unseen Magic of AI's Ears: Decoding VAD

Ever wonder how your AI knows you're talking? We're diving deep into VAD, the unseen magic behind AI's ears.

voice activity detectionVADspeech recognitionASRspeech-to-text

Personalizing Whisper: The Voice Typing Revolution

Voice typing is changing everything. Join us as we explore the revolution of personalizing Whisper!

speech-recognitionfine-tuningtransformers

Mic Check: Mastering AI Dictation Hardware

Uncover the secrets to perfect AI dictation! Corn and Herman explore the ultimate speech-to-text hardware.

large-language-modelsspeech-recognitionaudio-hardware

AI Gets Personal: The Power of Voice Fine-Tuning

AI that understands *your* voice? Dive into the fascinating world of fine-tuning and discover how AI gets personal.

fine-tuningspeech-recognitionpersonalized-ai

Building Custom ASR Tools

Ever wondered how to build your own ASR tools from scratch? Discover the why and how in this episode!

ASRspeech recognitioncustom asrmachine learningspeech to text

If Your Voice Ages, Does Your Fine-Tune Become Useless?

Your voice changes, but your fine-tuned model shouldn't become useless. We explore the biology of the larynx and ASR.

speech-recognitionfine-tuningvocal-physiology

Local STT For AMD GPU Owners

AMD GPU? No problem! Dive into local AI adventures like on-device speech to text.

speech-recognitiongpu-accelerationlocal-ai

How To Fine Tune Whisper

Build your own AI transcription tool! We'll walk you through fine-tuning Whisper, from data to notebook.

fine-tuningspeech-recognitiongpu-acceleration

Fine-Tuning ASR For Maximal Usability

Fine-tuned ASR is just the start. Discover the next steps for deployment and maximizing usability.

ASRspeech recognitionfine-tuningdeploymentusability

Benchmarking Custom ASR Tools - Beyond The WER

Benchmarking custom ASR fine-tunes: We're diving deep beyond the WER to truly measure performance.

ASRbenchmarkingwerspeech recognitionfine-tuning

Safetensors or something else: STT inference formats explained

Unpacking ASR weight formats: Safetensors and beyond. Tune in to understand the distinctions.

safetensorsASRspeech recognitioninferenceweight formats

Building Your Own Whisper

Ever wondered if you could build your own speech recognition tool? We dive deep into crafting custom ASR.

ASRspeech recognitionwhispermachine learningaudio processing