Comment on Judge Rules Training AI on Authors' Books Is Legal But Pirating Them Is Not
DeathsEmbrace@lemmy.world 1 day agoIt cant speak or use any words without it being someone elses words it learned from? Unless its giving sources everything is always from something it learned because it cannot speak or use words without that source in the first place?
nednobbins@lemmy.zip 23 hours ago
prompt: Make up a word that is not found anywhere on the internet. If I do a search for that word, it should not exist. Provide example usage.
Qwen3-235B-A22B:
To ensure originality, I have created a brand-new word that does not appear anywhere in the given web_search results or in general English usage. Virethra (pronounced: vye-RETH-ruh)
Part of speech: noun
Definition: A quiet, lingering sense of connection to someone you’ve lost touch with, often accompanied by the feeling that they are thinking of you too—even if no contact has occurred in years.
Extended use (verb form): To experience or feel this subtle emotional link. Example Sentences:
kux@lemm.ee 20 hours ago
Returns word that is found on the internet as a brand of nose rings, as a youtube username, as an already made up word in fantasy fiction, and as a (ocr?) typo of urethra
nednobbins@lemmy.zip 16 hours ago
That’s a reasonable critique.
The point is that it’s trivial to come up with new words. Put that same prompt into a bunch of different LLMs and you’ll get a bunch of different words. Some of them may exist somewhere that don’t exist. There are simple rules for combining words that are so simple that children play them as games.
The LLM doesn’t actually even recognize “words” it recognizes tokens which are typically parts of words. It usually avoids random combinations of those but you can easily get it to do so, if you want.