Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Elevenlabs is a lot closer to compelling audiobook narration (needs a better way to deal with multiple characters in a story without manual use of multiple voices): https://pub-a24da573c61f4b2d905bdebb2d0ecf88.r2.dev/ElevenLa... (an H.G.Wells example I just asked it to read).



thanks!


I was going to mention ElevenLabs, too. Their samples are very impressive in how the intonation and word stress are varied based on the text’s meaning. Their pricing is a bit high for personal use, though.

(The link you posted seems to have been truncated. Can you try posting it again?)


Yeah, sadly it'd cost about $100 to get a book per month... Not quite competitive with Audible yet, but give it a year perhaps, or a few iterations of the open source models... (fixed the link)


100 dollars per book, right, but that book is public and can be shared between millions of people.


Any open source alternatives?


None of the open source models I've seen are as "well-rounded", production ready as Eleven Labs. Though for example bark is really great at prosody: https://suno-ai.notion.site/Bark-Examples-5edae8b02a604b54a4... And piper isn't bad at speech quality: https://rhasspy.github.io/piper-samples/

We might only be a few papers away from a good open source Elevenlabs competitor.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: