Comment on What is an efficient workflow to separate and organize bulk scanned PDF documents? (At work; software is limited.)

Sunsofold@lemmings.world ⁨7⁩ ⁨hours⁩ ago

I know there are scripting ways to work with PDFs. I was listening to someone talking just earlier about using a script and a localhosted LLM to organise and rename PDFs with author and title. If you can identify some kind of patterns (such as a heading that starts each document of a type) that you can detect, a script could find those pages and then feed that into something that will segment page ranges for each doc. It’s definitely possible but the patterns to look for will be determined by the docs you are looking at.

source
Sort:hotnewtop