i dont think op is looking to mirror archive.org, my take was that they wanted someyhing like archive.org but selfhosted and for personal / small-scale use
Comment on What software does Internet Archive run?
possiblylinux127@lemmy.zip 11 months agoArchive box is a piece of software and the Internet archive is a organization that is focused on predicting the content on the internet.
The Internet Archive has PBs worth of data. I doubt any home user could manage that.
kittykittycatboys@lemmy.blahaj.zone 11 months ago
avidamoeba@lemmy.ca 11 months ago
Exactly. I’m already running a local wiki, but I don’t want stuff I link to in my wiki to result in 404 in a few years. Or worse, to some AI-ridden ad-infested dumpster fire.
laserjet@lemmy.dbzer0.com 11 months ago
You can use something as simple as a browser extension like SingleFile that can automatically download complete, contained copies of anything bookmarked or only certain URLs.
z00s@lemmy.world 11 months ago
mosiacmango@lemm.ee 11 months ago
Protecting
recapitated@lemmy.world 11 months ago
They’re beating the algorithm