Comment on Selfhosted & AI

<- View Parent
midribbon_action@lemmy.blahaj.zone ⁨23⁩ ⁨hours⁩ ago

Don’t think that’s true. You can run the whole form through, come out with an identical pdf with searchable/copyable text. Even a completely novel form uses the same alphabet. Add some regex to pull out the fields you need to enter, and on failure give it to a human. All of that can be done with python on a raspberry pi. A decade ago.

github.com/ocrmypdf/OCRmyPDF

original
Sort:hotnewtop