Comment on OpenAI and Anthropic are ignoring an established rule that prevents bots scraping online content

<- View Parent
balder1991@lemmy.world ⁨4⁩ ⁨months⁩ ago

It’s just how machine learning has been since ever.

We only know the model’s behavior by testing, hence we only know more or less the behavior in relation to the amount of testing that was done. But the model internals has always been a black box of numbers that individually mean nothing and if tracked which neurons fire here and there it’ll appear just random, because it probably is.

Remember the machine learning models aren’t carefully designed, they’re just brute-force trained for a long time and have the numbers adjusted again and again whenever the results look closer or further away from the desired output.

source
Sort:hotnewtop