Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

As others mentioned, Tesseract is SOTA in FOSS OCR. It also still is being developed, improving slow but constantly.

The main issue for a use-case like NormCap are the trained models: they are optimized for images of _printed_ text and layouts, which is different from on-screen-text in many aspects. Unfortunately, I don't have the resources to train my own models.

Cuneiform was a long time competitor, but afaik development there is stalled.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: