Open Menu
AllLocalCommunitiesAbout
lotide
AllLocalCommunitiesAbout
Login

sacrificing accessibility for not getting web-scraped

⁨72⁩ ⁨likes⁩

Submitted ⁨⁨1⁩ ⁨day⁩ ago⁩ by ⁨THTR300@feddit.org⁩ to ⁨technology@lemmy.world⁩

https://tilschuenemann.de/projects/sacrificing-accessibility-for-not-getting-web-scraped

source

Comments

Sort:hotnewtop
  • Lazycog@sopuli.xyz ⁨15⁩ ⁨hours⁩ ago

    For those who are too lazy to check what people here in the comments are saying, here’s what happpens when you open the page in reader mode:

    a web page in firefox’s reader mode displays the unobfuscated headline “sacrificing accessibility for not getting web-scraped” while the rest of the text is cryptic combination of numbers and letters instead of the actual content text.

    source
  • kimara@sopuli.xyz ⁨9⁩ ⁨hours⁩ ago

    I was hoping the author would advise against using this, because afterwards your website isn’t accessible.

    I don’t think this is the answer to web scrapers, since it so adversely affects people with disabilities. The web is one of the few things that are structurally (reasonably) accessible by default even if making inaccessible websites (like the author’s now) one is possible.

    If you have to choose between accessibility or letting scrapers to access your site and scrapping accessibility, you should choose accessibility.

    source
  • Static_Rocket@lemmy.world ⁨23⁩ ⁨hours⁩ ago

    Ironically, my first instinct to opening that page and seeing it’s unusual layout and density on mobile was to switch to the reader view and get hit with the cyphertext immediately. Cool, I guess.

    source
    • dracs@programming.dev ⁨22⁩ ⁨hours⁩ ago

      I did the exact same thing. Wouldn’t like to see it adopted due to accessibility issues. But its a neat trick.

      source
      • cmnybo@discuss.tchncs.de ⁨21⁩ ⁨hours⁩ ago

        It would be better to use Anubis to try to block the bots or use Iocaine and let the bots scrape as much garbage as they want.

        source
  • EndlessNightmare@reddthat.com ⁨18⁩ ⁨hours⁩ ago

    your search rank will drop

    If your goal is to prevent web-scraping, this seems like an intended effect.

    source
  • onehundredsixtynine@sh.itjust.works ⁨8⁩ ⁨hours⁩ ago

    Honestly, fuck anyone who seriously implements this on their website.

    source
  • JasonDJ@lemmy.zip ⁨20⁩ ⁨hours⁩ ago

    yZkB-kREXj IjXE cgZejm

    Cool.

    source
  • MonkderVierte@lemmy.zip ⁨9⁩ ⁨hours⁩ ago

    I’m not reading that shit:

    Image

    source
  • Maroon@lemmy.world ⁨14⁩ ⁨hours⁩ ago
    [deleted]
    source
    • falseWhite@lemmy.world ⁨13⁩ ⁨hours⁩ ago

      There’s a code example in the article of how to scramble html responses

      source