I don’t really get the hostility towards AI scraping. Don’t we want to have a healthy shared graph of human knowledge? This data is also used by open source models. It’s poisoning the well for everybody to spite some companies who also have the resources to filter this. Hate is making people do stupid things. It’s emberassing to call yourself a gamer these days ngl.
‘World Of Warcraft’ Players Trick AI-Scraping Games Website Into Publishing Nonsense
Submitted 1 year ago by someguy3@lemmy.ca to technology@lemmy.world
Comments
drmoose@lemmy.world 1 year ago
abrasiveteapot@sh.itjust.works 1 year ago
Because regurgitation without understanding leads to demonstrably untrue information being propagated as fact. There have been a number of instances also where AIs have straight up made stuff up as well.
drmoose@lemmy.world 1 year ago
Its shouldn’t be uses as a fact tool tho and not intended for it.
cyd@lemmy.world 1 year ago
Ironically, this article itself reads like it was written by AI.
MelastSB@sh.itjust.works 1 year ago
Can you remove the link that the author refused to link, or is that an automatic Lemmy feature?
someguy3@lemmy.ca 1 year ago
I can’t edit the top box, but I edited mine to take out the duplicate text. The link was automatic though.
Max_P@lemmy.max-p.me 1 year ago
Automatic feature. Anything that looks like a valid domain gets autolinked
nocturne213@lemmy.world 1 year ago
Forbes is not much better, their “articles” are mostly garbage.
jocanib@lemmy.world 1 year ago
tbf this is not very much different from how many flesh’n’blood journalists have been finding content for years. [brixtonbuzz.com/…/the-pumped-up-squirrel-of-rush-…](The legendary crack squirrels of Brixton) was nearly two decades ago now (yikes!). Fox was a little late to the party with U.K. Squirrels Are Nuts About Crack in 2015.
Obviously, I want flesh’n’blood writers getting paid for their plagiarism-lite, not the cheapskates who automate it. But this kind of embarrassing error is a feature of the genre. And it has been gamed on social media for some time now (eg Lib Dem leader Jo Swinson forced to deny shooting stones at squirrels after spoof story goes viral)
I don’t know what it is about squirrels…
fearout@kbin.social 1 year ago
Reposting a comment from another similar thread to show that this is easily fixable, and you should be wary of any non-reputable news source anyway.
So I was curious how current LLMs might handle this with proper instructions, so I asked chatGPT this: “What can you tell me about this Reddit post? Would you write a news article about this? Analyze the trustworthiness of this information:” and pasted the text from the post. Here’s a part of its reply:
So it’s not even an issue with current models, just bad setup. An autoGPT with several fact-checking questions added in can easily filter this stuff.