Comment

Comment on Tesla said it didn’t have key data in a fatal crash. Then a hacker found it.

MonkderVierte@lemmy.zip ⁨3⁩ ⁨days⁩ ago

How does archive get the unpaywalled version? I don’t think they pay the subscription for every single tabloid out there?

Sort:hotnew top

stoly@lemmy.world ⁨3⁩ ⁨days⁩ ago
The paywall is JavaScript but the content is still in plaintext below. The crawlers don’t read the JavaScript.

source
- MonkderVierte@lemmy.zip ⁨3⁩ ⁨days⁩ ago
  Disabling 3rd-party js has no paywall, but only the first paragraph too. Crawlers get full access?
  
  source
AnarchistArtificer@slrpnk.net ⁨3⁩ ⁨days⁩ ago
I think they use the same thing that web crawlers use. If Google’s crawler couldn’t access the content of the page (or could only access a limited amount of content), it would likely rank far lower in search results

source
- MonkderVierte@lemmy.zip ⁨3⁩ ⁨days⁩ ago
  Btw, why is there no search engine where you can sort and filter how you want instead of how they want?
  
  source