Comment

Comment on Forget security – Google's reCAPTCHA v2 is exploiting users for profit | Web puzzles don't protect against bots, but humans have spent 819 million unpaid hours solving them

<- View Parent

tyler@programming.dev ⁨6⁩ ⁨months⁩ ago

It knows they’re wrong which is why I don’t really think this article is accurate. Is it training if it already has the answers? Probably not.

source

Sort:hotnew top

MajinBlayze@lemmy.world ⁨6⁩ ⁨months⁩ ago
That’s why it gives you a panel of 9 images. It had a high confidence on some images, and a low confidence on others. When you pick the correct images and don’t pick incorrect ones it uses the ones it’s confident about as “validation” while taking the feedback on low confidence images to update the training data.

What this means is that only ones actually being “graded” are the ones bots can solve anyway.

source
- SkaveRat@discuss.tchncs.de ⁨6⁩ ⁨months⁩ ago
  and it will show the images to multiple people
  
  source
- Petter1@lemm.ee ⁨6⁩ ⁨months⁩ ago
  It seems exactly like that, I experimented with it by trying to leave the one I think it has low confidence unchecked, and it often worked.
  
  source
AmidFuror@fedia.io ⁨6⁩ ⁨months⁩ ago
My understanding is different from others here. I thought they served the same Captcha to many people at once and use the majority response to decide who is answering correctly.

source
- catloaf@lemm.ee ⁨6⁩ ⁨months⁩ ago
  That’s true, or at least it used to be back when they were using it for OCR. I have no reason to believe it’s changed.
  
  source
voxthefox@lemmy.world ⁨6⁩ ⁨months⁩ ago
It’s why they ask you to do multiple, 1-2 of them are the control group, they are training on the others

source
- tyler@programming.dev ⁨6⁩ ⁨months⁩ ago
  You’re implying they give you multiple. I hardly ever get multiple, pretty much only if I ‘fail’ the first one.
  
  source
  - Miaou@jlai.lu ⁨6⁩ ⁨months⁩ ago
    If they have a good fingerprint on you they don’t need the control group. That’s why you get 5+ captchas when using a VPN/tor.
    
    source
Rolando@lemmy.world ⁨6⁩ ⁨months⁩ ago
If they gave two captchas, one which they knew the answer and one which they didn’t, they could use the second for training. (Even if you’re paying someone, you want to do that sort of thing when crowdsourcing data, because you never know if the paid person is just screwing around.)

source