Is there something similar for STT? I’m using whisper distill models and they wo...

daemonologist · 2026-01-16T00:13:05 1768522385

Parakeet is not really more accurate than Whisper, but it's much faster - faster than realtime even on CPU: https://huggingface.co/nvidia/parakeet-tdt-0.6b-v3 . You have to use Nemo though, or mess around with third-party conversions. (Also has a big brother Canary: https://huggingface.co/nvidia/canary-1b-v2. There's also the confusingly named/positioned Nemotron speech: https://huggingface.co/nvidia/nemotron-speech-streaming-en-0...)

satvikpendem · 2026-01-16T00:21:00 1768522860

Keep in mind Parakeet is pretty limited in the number of languages it supports compared to Whisper.

jokethrowaway · 2026-01-16T09:56:16 1768557376

Parakeet feels much more accurate in practice than whisper, it was a real "a-ha" moment for me.

Of course, English only

phoronixrly · 2026-01-15T23:43:03 1768520583