Comment on Stack Overflow bans users en masse for rebelling against OpenAI partnership — users banned for deleting answers to prevent them being used to train ChatGPT
it creates a lot of poisoned data especially if you like edit half your posts with nonsense
That’s trivial to filter if you just look at how much time has passed between posting and editing.
sure, but the more you fuck with the data, the more it requires curating, the less valuable it becomes. I’m not entirely sure places like reddit even retain full edit history for posts over a year old.
realharo@lemm.ee 9 months ago
That’s trivial to filter if you just look at how much time has passed between posting and editing.
Fedizen@lemmy.world 9 months ago
sure, but the more you fuck with the data, the more it requires curating, the less valuable it becomes. I’m not entirely sure places like reddit even retain full edit history for posts over a year old.