My understanding is different from others here. I thought they served the same Captcha to many people at once and use the majority response to decide who is answering correctly.
It knows they’re wrong which is why I don’t really think this article is accurate. Is it training if it already has the answers? Probably not.
AmidFuror@fedia.io 3 months ago
catloaf@lemm.ee 3 months ago
That’s true, or at least it used to be back when they were using it for OCR. I have no reason to believe it’s changed.
voxthefox@lemmy.world 3 months ago
It’s why they ask you to do multiple, 1-2 of them are the control group, they are training on the others
tyler@programming.dev 3 months ago
You’re implying they give you multiple. I hardly ever get multiple, pretty much only if I ‘fail’ the first one.
Miaou@jlai.lu 3 months ago
If they have a good fingerprint on you they don’t need the control group. That’s why you get 5+ captchas when using a VPN/tor.
Rolando@lemmy.world 3 months ago
If they gave two captchas, one which they knew the answer and one which they didn’t, they could use the second for training. (Even if you’re paying someone, you want to do that sort of thing when crowdsourcing data, because you never know if the paid person is just screwing around.)
MajinBlayze@lemmy.world 3 months ago
That’s why it gives you a panel of 9 images. It had a high confidence on some images, and a low confidence on others. When you pick the correct images and don’t pick incorrect ones it uses the ones it’s confident about as “validation” while taking the feedback on low confidence images to update the training data.
What this means is that only ones actually being “graded” are the ones bots can solve anyway.
SkaveRat@discuss.tchncs.de 3 months ago
and it will show the images to multiple people
Petter1@lemm.ee 3 months ago
It seems exactly like that, I experimented with it by trying to leave the one I think it has low confidence unchecked, and it often worked.