Comment

Comment on I spent a year on Linux and forgot to miss Windows

vikingtons@lemmy.world ⁨4⁩ ⁨months⁩ ago

archive link

archive.is/N4thi

source

Sort:hotnew top

Arcane2077@sh.itjust.works ⁨4⁩ ⁨months⁩ ago
Anyone else facing captcha loops whenever they try to view an archive.is link? Haven’t been able to read subscriber only articles for months now

source
- mjr@infosec.pub ⁨4⁩ ⁨months⁩ ago
  Not every time, but far too often. They don’t seem to care that they’re discriminating against people with AV impairment, plus locking out some secure browsers.
  
  source
  - ilovepiracy@lemmy.dbzer0.com ⁨4⁩ ⁨months⁩ ago
    Just a heads up, archive.is is not related to the internet archive and I believe is run by a solo dev with private funding.
    
    source
    mjr@infosec.pub ⁨4⁩ ⁨months⁩ ago
    
    archive.is is not related to the internet archive and I believe is run by a solo dev with private funding.
    
    I looked into who runs it a bit and oh wow, it’s far far worse than that. If you get a captcha from archive.is / archive.ph / archive.today and allow it scripting permission, it seems to use your browser as part of a DDoS attack. See infosec.exchange/@iampytest1/115902693235671566 and linked pages.
    
    source
  - Arcane2077@sh.itjust.works ⁨4⁩ ⁨months⁩ ago
    Dang, yeah it’s probably my strict browser settings. Thanks for the confirmation of shared experience.
    
    source
    cecilkorik@piefed.ca ⁨4⁩ ⁨months⁩ ago
    Sometimes I’m able to get around it by tweaking some ublock permissions, but once I was surprised to discover that changing my user-agent with user-agent switcher seemed to do the trick. It’s really strange. Cloudflare’s captcha loops are inscrutable.
    
    source
- FauxLiving@lemmy.world ⁨4⁩ ⁨months⁩ ago
  LLM-driven web scraping is intense for some sites, so their bot detection software is tuned in a way that creates a lot of false positives.
  
  Obscuring your browser fingerprint, or blocking javascript, or using an unusual user-agent string can trigger a captcha challenge.
  
  If you’re not doing that and seeing a site suddenly start giving your captchas then they may be being DDoS’d by scrapers and are challenging all clients.
  
  A site that archives content is especially vulnerable because they have a lot of the data that is useful for AI training.
  
  It is incredibly annoying, but until we have a robust way of proving identity that can’t be gamed by bad actors we’re stuck with individual user challenges.
  
  source
- vikingtons@lemmy.world ⁨4⁩ ⁨months⁩ ago
  No but I do get about three or four challenges
  
  source
- Axolotl_cpp@feddit.it ⁨4⁩ ⁨months⁩ ago
  I don’t have this problem; You probably are using TOR or a VPN and it triggered the captcha
  
  source
  - MadMadBunny@lemmy.ca ⁨4⁩ ⁨months⁩ ago
    Nope
    
    source
- Pika@sh.itjust.works ⁨4⁩ ⁨months⁩ ago
  I haven’t faced a captcha but, it just took a solid 2 minutes to resolve and load the article for me. Maybe they have something else happening behind the scenes impacting performance so they are locking down certain routes?
  
  source