Problem
What made this worth building.
I record a lot of voice notes. Ideas while driving, meeting notes, thoughts I don't want to lose. But raw transcripts are just noise. Hours of speech become hundreds of lines that don't help. What was actually said? Who said it? What matters? Yes, Google and Microsoft have transcription. But I wanted to experience building a simple product myself — and I wanted the output to go beyond a wall of text into something actually structured and useful.