To that end the company is developing a “Pay Per Crawl” system, which would give content creators the option to request payment from AI companies for utilising their original content.
So Cloudflare is not as much “saving the Internet”, as just becoming a middleman between LLM training companies and content creators. Which I believe has a potential of being a true goldmine in the future.
Concave1142@lemmy.world 14 hours ago
Until the AI companies find a way around it. Love the idea so hopefully it causes at least 3 days of struggle for the AI crawlers.
Having said that… Can someone else put this in place so we do not have Cloudflare hosting everything where we would just be one intern away from a global outage. Please? Pretty please?
orclev@lemmy.world 14 hours ago
The problem is that the biggest service Cloudflare provides is DDoS protection, and doing that requires that you have more bandwidth available than your attacker. Having enough bandwidth to withstand modern botnet powered DDoS attacks is ridiculously expensive (and it’s also a finite resource, there’s only so much backbone infrastructure). Basically it’s economically infeasible to have multiple companies providing the service Cloudflare does. You might be able to get away with two companies doing so, but it’s unlikely you could manage more than that without some of them starting to go bankrupt.
acosmichippo@lemmy.world 13 hours ago
when a critical service is not economical for more than one business to do (natural monopoly), that’s when govt should be stepping in.
Kowowow@lemmy.ca 14 hours ago
I wonder if it would be a good investment for a country to have their own then down the line expand to sell the same service to others
auraithx@piefed.social 7 hours ago
Yeah this will have absolutely no impact to gathering training data.
I assumed it was to block ai agents crawling it during requests, which they’d be unlikely to bypass in the web ui.
But no company spending millions on training will hesitate to have an agent appear as a regular desktop user to scrape data.
boonhet@sopuli.xyz 7 hours ago
Does cloudflare still look at the agent? I thought they have more reliable data points.
baduhai@sopuli.xyz 11 hours ago
Proof of work seems to be working pretty well for many websites.
WreckingBANG@lemmy.ml 8 hours ago
github.com/TecharoHQ/anubis