explicitly built into systems like CDNs, git, and blockchain
Git only duplicates blobs; textual content is stored as deltas. And it’s bad practice to version-control blobs: the more correct approach is to control the source from which the blob is generated.
nyan@lemmy.cafe 5 weeks ago
If the data has value, then yes, duplication is a good thing up to a point. The thesis is that only 10% of the data has value, though, and therefore duplicating the other 90% is a waste of resources.
The real problem is figuring out which 10% of the data has value, which may be more obvious in some cases than others.