Comment on OpenAI has built a text watermarking method to detect chatgpt written content

brucethemoose@lemmy.world ⁨2⁩ ⁨months⁩ ago

This has been known in the ML space forever. LLMs don’t actually output words, but “probabilities” for tokens. And if you arbitrarily weigh these probabilities, it creates a “signature” in any text thats easy to measure. The sampler randomizes it a tiny bit, but thats not a problem in long texts.

It’s defeatable. I’m sure if you maken enough OpenAI queries, you can find the bias. But this likely will stop the lazy absures, aka 99% of abusers.

source
Sort:hotnewtop