Forget security – Google's reCAPTCHA v2 is exploiting users for profit | Web puzzles don't protect against bots, but humans have spent 819 million unpaid hours solving them

ForgottenFlux@lemmy.world · 4 months ago

Forget security – Google's reCAPTCHA v2 is exploiting users for profit | Web puzzles don't protect against bots, but humans have spent 819 million unpaid hours solving them

tyler@programming.dev · 4 months ago

It knows they’re wrong which is why I don’t really think this article is accurate. Is it training if it already has the answers? Probably not.

MajinBlayze@lemmy.world · edit-2 4 months ago

That’s why it gives you a panel of 9 images. It would have a high confidence on some images, and a low confidence on others. When you pick the correct images and don’t pick incorrect ones it uses the ones it’s confident about as “validation” while taking the feedback on low confidence images to update the training data.

What this does mean in practice is that only ones actually being “graded” are the ones bots can solve anyway.

SkaveRat@discuss.tchncs.de · 4 months ago

and it will show the images to multiple people

Petter1@lemm.ee · 4 months ago

It seems exactly like that, I experimented with it by trying to leave the one I think it has low confidence unchecked, and it often worked.

AmidFuror@fedia.io · 4 months ago

My understanding is different from others here. I thought they served the same Captcha to many people at once and use the majority response to decide who is answering correctly.

catloaf@lemm.ee · 4 months ago

That’s true, or at least it used to be back when they were using it for OCR. I have no reason to believe it’s changed.

Vox@lemmy.world · 4 months ago

It’s why they ask you to do multiple, 1-2 of them are the control group, they are training on the others

tyler@programming.dev · 4 months ago

You’re implying they give you multiple. I hardly ever get multiple, pretty much only if I ‘fail’ the first one.

Rolando@lemmy.world · 4 months ago

If they gave two captchas, one which they knew the answer and one which they didn’t, they could use the second for training. (Even if you’re paying someone, you want to do that sort of thing when crowdsourcing data, because you never know if the paid person is just screwing around.)

Forget security – Google's reCAPTCHA v2 is exploiting users for profit | Web puzzles don't protect against bots, but humans have spent 819 million unpaid hours solving them

Forget security – Google's reCAPTCHA v2 is exploiting users for profit | Web puzzles don't protect against bots, but humans have spent 819 million unpaid hours solving them

Google's reCAPTCHAv2 is just labor exploitation, boffins say