I have bunch of textbooks, and a lot of lecture notes and notes from colleagues, all in PDF format. What is a good way to classify, manage, store, and read these PDF files? I am trying calibre-web, but it seems difficult to find applications to connect to it.
I believe this new project should hit your need quite well!
Papra is quite new in the selfhosted sphere but a welcome addition. Yet to test it myself but it sounds and looks very promising > github.com/papra-hq/papra
starkzarn@infosec.pub 1 day ago
Paperless-ngx! github.com/paperless-ngx/paperless-ngx
Feddinat0r@feddit.org 1 day ago
I second this. Using this for about half an year as my full document store, letters, anything.
Search is great, lovin it
non_burglar@lemmy.world 19 hours ago
Paperless-ngx is great, but it is particularly bad at handling PDF documents. Roughly half my documents just won’t import.
github.com/paperless-ngx/paperless-ngx/…/3933
reddit.com/…/paperlessngx_not_all_pdf_files_can_b…
github.com/paperless-ngx/paperless-ngx/…/2187
b3an@lemmy.world 21 hours ago
I third this! I saw title and came to say.
It’s actively being developed still, I get emails like once every 1–3 weeks, sometimes more. Sometimes less.
I use docker desktop for this. I also lowkey learned how to set up a multi-database for this at one point, but kinda stopped after I got it working. More to see if I could.
I also tried bare metal building this, but had shit luck. It’s been a couple years though. Docker just makes it easy as hell.
I still keep all the originals separate just in case, and the tool can help you make multiple copies too (like PDF-A). I’ve never needed to go back and use those though, as Paperless just works so well once you get the hang of it and how you want your data stored.
I picked a structure that kind of lets me find stuff easily even if the tool is not running (like just by folder structures).
I’ve yet to make this online available for obvious reasons. But it would be nice to be able to pull up pretty much any document you need, any time.
Any suggestions on safe web access quickly from a phone might be helpful (WireGuard maybe?) if you have them.
nickiam2@aussie.zone 17 hours ago
Tailscale is how I access my server. I’ve got a domain name that points to the internal tailscale IP address, but that’s not really necessary
walden@sub.wetshaving.social 21 hours ago
For remote access, wireguard is great. You can access stuff via their internal addresses.