Open Menu
AllLocalCommunitiesAbout
lotide
AllLocalCommunitiesAbout
Login

Data centers contain 90% crap data

⁨153⁩ ⁨likes⁩

Submitted ⁨⁨5⁩ ⁨weeks⁩ ago⁩ by ⁨fantawurstwasser@feddit.org⁩ to ⁨technology@lemmy.world⁩

https://gerrymcgovern.com/data-centers-contain-90-crap-data/

source

Comments

Sort:hotnewtop
  • Skullgrid@lemmy.world ⁨5⁩ ⁨weeks⁩ ago

    1980s-2000s : the information age

    2000s-present : the data age.

    Information implies it’s correct, data implies it can be anything.

    source
    • HubertManne@piefed.social ⁨5⁩ ⁨weeks⁩ ago

      aughts were not bad but it was falling and once we got in the teens ugh. oh and old man thing the pre www was advertisement free which was awesome.

      source
      • Skullgrid@lemmy.world ⁨5⁩ ⁨weeks⁩ ago

        sure. the cut off can be somewhere around there, start can be earlier too.

        source
  • 0x0@lemmy.zip ⁨5⁩ ⁨weeks⁩ ago

    You’ll pry my kitten pictures from my cold dead hands!

    source
  • Vortieum@sopuli.xyz ⁨5⁩ ⁨weeks⁩ ago

    Solutions?

    source
    • vane@lemmy.world ⁨5⁩ ⁨weeks⁩ ago

      rm -rf /data

      source
      • mr_jaaay@lemmy.ml ⁨5⁩ ⁨weeks⁩ ago

        I’m imagining Data from Star Trek being deleted…

        Captain, this is most illogical.

        source
    • sugar_in_your_tea@sh.itjust.works ⁨5⁩ ⁨weeks⁩ ago

      That depends on the problem.

      I disagree w/ the author that storing blurry cat memes is what’s “destroying our environment.” Transportation is our biggest net polluter in terms of CO2, which is higher than all electrical generation combined. If we’re want to solve CO2 emissions, we have to solve transportation, since that’s the 500 pound gorilla in the room.

      If we look specifically at datacenters, storage makes up a tiny fraction of the overall energy use. That article mentions that datacenters probably have a similar CO2 footprint as the aviation industry, which makes up about 2.5% of the world’s carbon emissions, or about 10% of the total transportation emissions from the above link.

      If the goal is to fix climate change, data centers are pretty far down the list in terms of priorities. Higher priorities are, roughly in this order:

      1. ground transportation - electrify or switch to something like hydrogen
      2. electrical power generation - this will directly reduce the impact of data centers, be part of 1, and solve a number of other issues
      3. residential heating - switch from fossil fuels to heat pumps for heating, which should be a relatively “drop-in” replacement and could save customers money
      4. industry - largely solved by 2, but there may need to be some shifts in certain types of production processes to reduce emissions

      Changing anything about data centers is way down the list of priorities, and it’ll be largely solved by something much higher up. So it’s really the wrong target to attack.

      source
      • Neon@lemmy.world ⁨5⁩ ⁨weeks⁩ ago

        You forget the production and disposal stage of datacenters which are the biggest polluters.

        source
        • -> View More Comments
    • partial_accumen@lemmy.world ⁨5⁩ ⁨weeks⁩ ago

      Solutions?

      Carbon tax.

      In this micro example, imagine if you could access all of your data for free when there as abundant sunshine (carbon free), or had to pay for carbon based energy at night. You’d start to sort your data for what you really wanted so that you’d only be paying a small amount for a small amount of data.

      source
    • HubertManne@piefed.social ⁨5⁩ ⁨weeks⁩ ago

      I don't see one unless our society because less dependent on bullshit and honors privacy. I don't know about anyone else but I constantly bullshit specifics about myself on line to dirty up any data collected on me.

      source
    • TacticalCheddar@lemm.ee ⁨5⁩ ⁨weeks⁩ ago

      We fully transition to clean energy like nuclear and build more power plants to allow us to store our online stuff.

      The author of this article is not a serious person. He’s in the same bucket as Greta Thurnberg. They just like to scream and blame people instead of providing practical solutions. It’s frankly tiring to hear them.

      source
      • partial_accumen@lemmy.world ⁨5⁩ ⁨weeks⁩ ago

        He’s in the same bucket as Greta Thunberg. They just like to scream and blame people instead of providing practical solutions.

        Greta Thunberg is 22 years old right now, and was “screaming” and “blaming people” when she was 11 years old.

        She saw the world she was going to inherit and forced conversation to work toward solutions. Expecting an 11 year old to provide answers that none of the established world has is silly.

        source
        • -> View More Comments
      • kkj@lemmy.dbzer0.com ⁨5⁩ ⁨weeks⁩ ago

        Thunberg’s solution has always been “listen to the experts who have been screaming at you for 50 years.” You don’t have to be an expert to care about things or to want to listen to people who are experts.

        source
        • -> View More Comments
    • acosmichippo@lemmy.world ⁨5⁩ ⁨weeks⁩ ago

      charge more to customers for long term data storage.

      source
      • Fluffy_Ruffs@lemmy.world ⁨5⁩ ⁨weeks⁩ ago

        How do you differentiate old from new? I can just create a fresh copy of whatever I’m storing and it’ll look new.

        source
        • -> View More Comments
    • nyan@lemmy.cafe ⁨5⁩ ⁨weeks⁩ ago

      Massive deduplication across all accounts on all servers of image, audio, and video data would theoretically be possible, but ain’t gonna happen. Or we could just discourage people from posting cat videos and bad memes (even less likely to happen).

      source
      • lemmyng@lemmy.ca ⁨5⁩ ⁨weeks⁩ ago

        I would argue that duplication of content is a feature, not a bug. It adds resilience, and is explicitly built into systems like CDNs, git, and blockchain (yes I know, blockchains suck at being useful, but nevertheless the point is that duplication of data is intentional and serves a purpose).

        source
        • -> View More Comments
      • Brkdncr@lemmy.world ⁨5⁩ ⁨weeks⁩ ago

        Deduplication is trivial when applied at the block level, as long as the data is not encrypted, or is encrypted at rest by the storage system.

        source
        • -> View More Comments
  • nyan@lemmy.cafe ⁨5⁩ ⁨weeks⁩ ago

    Sturgeon’s Law in action again.

    source