Comment on Perplexity AI is complaining their plagiarism bot machine cannot bypass Cloudflare's firewall

<- View Parent
ubergeek@lemmy.today ⁨6⁩ ⁨hours⁩ ago

Information should not be gated or owned in a way that would make it illegal for anyone to access it under proper conditions.

Then you don’t believe content creators should have any control over their own works?

The “proper conditions” are deemed by the content creator, not the consumers.

Doing a GET request doesn’t do that.

Not at all. It consumes at most, a watt.

What kind of problems that would be?

Increasing my hosting bill, to accommodate the senseless traffic being sent my way?

Outages for my site, making my content unavailable for legitimate users?

You have to agree that at one point “be used by LLM” would not be different from “be used by a user”.

Not at all. LLMs are not users.

It’s self-hosted and free.

If you want, or they charge for the hosted version. If they want to use a paid for version, then they can divert some of that revenue to me, the creator, because without creators, they would have no product.

How does that prohibit usage and processing of your info? That sounds like “I won’t be providing any comments on Lemmy website, if you want my opinion you can mail me at a@b.com”

That’s a apples and oranges comparison, and you know it.

That will never block all of them. Your info will be used without your consent and you will not feel troubled from it. So you might not feel troubled if more things do the same.

Perplexity seems to be troubled by it.

What if I use my local hosted LLM? Anyway, the point is, selling text can’t work well, and you’re going to spend much more resources on collecting and summarizing data about how your text was used and how others benefited from it, in order to get compensation, than it worths.

If selling text can’t work well, then why do LLM products insist on using my text, to sell it?

Also, it might be the case that some information is actually worthless when compared to a service provided by things like LLM, even though they use that worthless information in the process.

LLMs are a net negative, as far as costs go. They consume far more in resources than they provide in benefit. If my information was worthless without an LLM, it’s worthless with an LLM, therefore, LLMs don’t need to access it. Periodt.

The bottom line? Content creators get the first say in how their content is used, and consumed. You are not entitled to their labor, for free, and without condition.

source
Sort:hotnewtop