Comment on Perplexity AI is complaining their plagiarism bot machine cannot bypass Cloudflare's firewall
Electricd@lemmybefree.net 1 week ago
They do have a point though. I would be great to let per-prompt searches go through, but not mass scrapping
threeganzi@sh.itjust.works 1 week ago
Does it not need to be scraped to be indexed, assuming it’s semi-typical RAG stuff?
Electricd@lemmybefree.net 1 week ago
I assume their script does some search engine stuff like query google or bing and then “scrap” the links they go on
Some selenium stuff