You’ll pry my kitten pictures from my cold dead hands!
Data centers contain 90% crap data
Submitted 5 weeks ago by fantawurstwasser@feddit.org to technology@lemmy.world
https://gerrymcgovern.com/data-centers-contain-90-crap-data/
Comments
0x0@lemmy.zip 5 weeks ago
Vortieum@sopuli.xyz 5 weeks ago
Solutions?
vane@lemmy.world 5 weeks ago
rm -rf /data
mr_jaaay@lemmy.ml 5 weeks ago
I’m imagining Data from Star Trek being deleted…
Captain, this is most illogical.
sugar_in_your_tea@sh.itjust.works 5 weeks ago
That depends on the problem.
I disagree w/ the author that storing blurry cat memes is what’s “destroying our environment.” Transportation is our biggest net polluter in terms of CO2, which is higher than all electrical generation combined. If we’re want to solve CO2 emissions, we have to solve transportation, since that’s the 500 pound gorilla in the room.
If we look specifically at datacenters, storage makes up a tiny fraction of the overall energy use. That article mentions that datacenters probably have a similar CO2 footprint as the aviation industry, which makes up about 2.5% of the world’s carbon emissions, or about 10% of the total transportation emissions from the above link.
If the goal is to fix climate change, data centers are pretty far down the list in terms of priorities. Higher priorities are, roughly in this order:
- ground transportation - electrify or switch to something like hydrogen
- electrical power generation - this will directly reduce the impact of data centers, be part of 1, and solve a number of other issues
- residential heating - switch from fossil fuels to heat pumps for heating, which should be a relatively “drop-in” replacement and could save customers money
- industry - largely solved by 2, but there may need to be some shifts in certain types of production processes to reduce emissions
Changing anything about data centers is way down the list of priorities, and it’ll be largely solved by something much higher up. So it’s really the wrong target to attack.
Neon@lemmy.world 5 weeks ago
You forget the production and disposal stage of datacenters which are the biggest polluters.
partial_accumen@lemmy.world 5 weeks ago
Solutions?
Carbon tax.
In this micro example, imagine if you could access all of your data for free when there as abundant sunshine (carbon free), or had to pay for carbon based energy at night. You’d start to sort your data for what you really wanted so that you’d only be paying a small amount for a small amount of data.
HubertManne@piefed.social 5 weeks ago
I don't see one unless our society because less dependent on bullshit and honors privacy. I don't know about anyone else but I constantly bullshit specifics about myself on line to dirty up any data collected on me.
TacticalCheddar@lemm.ee 5 weeks ago
We fully transition to clean energy like nuclear and build more power plants to allow us to store our online stuff.
The author of this article is not a serious person. He’s in the same bucket as Greta Thurnberg. They just like to scream and blame people instead of providing practical solutions. It’s frankly tiring to hear them.
partial_accumen@lemmy.world 5 weeks ago
He’s in the same bucket as Greta Thunberg. They just like to scream and blame people instead of providing practical solutions.
Greta Thunberg is 22 years old right now, and was “screaming” and “blaming people” when she was 11 years old.
She saw the world she was going to inherit and forced conversation to work toward solutions. Expecting an 11 year old to provide answers that none of the established world has is silly.
kkj@lemmy.dbzer0.com 5 weeks ago
Thunberg’s solution has always been “listen to the experts who have been screaming at you for 50 years.” You don’t have to be an expert to care about things or to want to listen to people who are experts.
acosmichippo@lemmy.world 5 weeks ago
charge more to customers for long term data storage.
Fluffy_Ruffs@lemmy.world 5 weeks ago
How do you differentiate old from new? I can just create a fresh copy of whatever I’m storing and it’ll look new.
nyan@lemmy.cafe 5 weeks ago
Massive deduplication across all accounts on all servers of image, audio, and video data would theoretically be possible, but ain’t gonna happen. Or we could just discourage people from posting cat videos and bad memes (even less likely to happen).
lemmyng@lemmy.ca 5 weeks ago
I would argue that duplication of content is a feature, not a bug. It adds resilience, and is explicitly built into systems like CDNs, git, and blockchain (yes I know, blockchains suck at being useful, but nevertheless the point is that duplication of data is intentional and serves a purpose).
Brkdncr@lemmy.world 5 weeks ago
Deduplication is trivial when applied at the block level, as long as the data is not encrypted, or is encrypted at rest by the storage system.
nyan@lemmy.cafe 5 weeks ago
Sturgeon’s Law in action again.
Skullgrid@lemmy.world 5 weeks ago
1980s-2000s : the information age
2000s-present : the data age.
Information implies it’s correct, data implies it can be anything.
HubertManne@piefed.social 5 weeks ago
aughts were not bad but it was falling and once we got in the teens ugh. oh and old man thing the pre www was advertisement free which was awesome.
Skullgrid@lemmy.world 5 weeks ago
sure. the cut off can be somewhere around there, start can be earlier too.