As others mentioned, Tesseract is SOTA in FOSS OCR. It also still is being devel...

As others mentioned, Tesseract is SOTA in FOSS OCR. It also still is being developed, improving slow but constantly.

The main issue for a use-case like NormCap are the trained models: they are optimized for images of _printed_ text and layouts, which is different from on-screen-text in many aspects. Unfortunately, I don't have the resources to train my own models.

Cuneiform was a long time competitor, but afaik development there is stalled.