Václav from Kyutai here. Thanks for the bug report! A workaround for now is to c... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		vvolhejn 18 hours ago \| parent \| context \| favorite \| on: Pocket TTS: A high quality TTS that gives your CPU... Václav from Kyutai here. Thanks for the bug report! A workaround for now is to chunk the text into smaller parts where the model is more reliable. We already do some chunking in the Python package. There is also a more fancy way to do this chunking in a way that ensures that the stitched-together parts continue well (teacher-forcing), but we haven't implemented that yet.

mgaudet 9 hours ago [–]

Is this just sort of expected for these models? Should users of this expect only truncation or can hallucinated bits happen too?

I also find Javert in particular seems to put in huge gaps and spaces... side effect of the voice?

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact