Comment on Self-Hosted Document & Budgeting Automation: Paperless-ngx, Firefly III, and n8n – Good Idea?
HotChickenFeet@sopuli.xyz 3 days ago
Paperless-ngx does include OCR, as well as supporting document types (which can have fields, etc) - but there is no built-in way to intelligently extract field values. You can use the python API to access & update the data and fields. So field extraction via your own code is feasible.
Given the large variety in receipt layout & potential for habdwritten totals after tips - I’d encourage part of your workflow to include manual ispection/correction of every processed receipt - or at the bare minimum that you include check-in points where you verify your end balances 100% match after all transactions have been entered, so you can detect & root-cause errors ASAP.
MIXEDUNIVERS@discuss.tchncs.de 3 days ago
Well that is something i also have read online. Thats the reason why i think of using n8n
Thats the workflow which should be feasible. i think i have brainstormed with ki (say what you want) and i think thats the plan i try to implement first.
tofu@lemmy.nocturnal.garden 2 days ago
You can try ofc, but OCR isn’t perfect. Sometimes mistakes an 8 for a 6 or the like. Just keep that in mind, depending on what’s your goal.