The creator of an open source project that scraped the internet to determine the ever-changing popularity of different words in human language usage says that they are sunsetting the project because generative AI spam has poisoned the internet to a level where the project no longer has any utility.
Wordfreq is a program that tracked the ever-changing ways people used more than 40 different languages by analyzing millions of sources across Wikipedia, movie and TV subtitles, news articles, books, websites, Twitter, and Reddit. The system could be used to analyze changing language habits as slang and popular culture changed and language evolved, and was a resource for academics who study such things. In a note on the project’s GitHub, creator Robyn Speer wrote that the project “will not be updated anymore.”
grue@lemmy.world 3 weeks ago
The project creator doesn’t mince words:
Solumbran@lemmy.world 3 weeks ago
Seems pretty mild and reasonable, to be honest.
kn33@lemmy.world 3 weeks ago
Yeah, it seems really restrained for someone who has to end a project they’ve put so much effort into.
Randomgal@lemmy.ca 3 weeks ago
NGL sounds like a butthurt dude. Emotional arguments without logic.
Croquette@sh.itjust.works 3 weeks ago
I’d be fucking butthurt as well if my pet project was being destroyed by mega corpos for a shitty generative thief AI.
jaggedrobotpubes@lemmy.world 3 weeks ago
This does not say wonders about reading comprehension.
SirQuackTheDuck@lemmy.world 3 weeks ago
Imagine being an author whose sole income is writing books.
Here comes an AI that stole your work and is asked by a customer of OpenAI to summarise your books. It does so perfectly and the issuer is able to use your results freely, since they think it’s AI generated and doesn’t require attribution.
You receive nothing in return.
Good luck making a living.