Comment on What is an efficient workflow to separate and organize bulk scanned PDF documents? (At work; software is limited.)

<- View Parent
Sunsofold@lemmings.world ⁨4⁩ ⁨hours⁩ ago

Hmm. Well, first off, if you mean you don’t know how to write a script and don’t view it as worth learning for this task, that limits the task a fair amount. If you mean you don’t want to learn about the particulars of script based PDF editing or OCR, that’s understandable.

If you don’t want to script at all, you should be able to segment the PDFs via acrobat, or even just ‘print to PDF’ with page ranges on most viewers. There are ways of bulk renaming files once you have segmented them, even without scripting, though it’d be use case dependent as to whether/how that’d be useful to you.

If you want to script just a little, I made a script ages ago where I used the documents’ name to hold the metadata of what needed to be modified. You could certainly do that. (e.g. open the doc in one window, select the file for renaming in your file explorer, scroll through and input the sequence of pages in the rename field, [documentName3,7,15,22,29.PDF] run a script to segment the PDF at those page numbers so you end up with ‘documentName-1.PDF’ containing pages 1 to 2, another with 3 to 6, etc.)

A bit more effort could maybe be used to do some level of renaming, though how much use that would be would depend on the particulars of your case. I could see extending the previous script a little and making the page annotations include a doc type. (e.g. 13cn meaning segment at page 13 and label it as ‘originalDocumentName-clientNotification’, or even 13’arbitraryText’ and use the arbitrary text as the new file name)

The particularity of your case may be precisely why it hasn’t been automated yet.

source
Sort:hotnewtop