• millie@beehaw.org
    link
    fedilink
    English
    arrow-up
    43
    ·
    9 months ago

    It is incredibly obvious that CAPTCHAs are at the very least a way of exploiting distributed labor to train AI.

    • Railcar8095@lemm.ee
      link
      fedilink
      arrow-up
      21
      ·
      9 months ago

      They had been used to help with text recognition for book scanning for more than a decade. It has never been secret, it was explained on them time ago.

      This is the logical progression, regardless of your feelings with “AI”

  • Chozo@fedia.io
    link
    fedilink
    arrow-up
    30
    ·
    9 months ago

    Okay, this “$1 trillion” metric is a bit of a reach, and seems to be based on an arbitrary value assigned to an estimated amount of data Google has collected, and not actually $1,000,000,000,000 in revenue. It does not appear that Google has actually made a trillion dollars from CAPTCHA data.

      • Zaktor@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        17
        ·
        9 months ago

        They don’t seem to actually identify the cookies as tracking (as opposed to just identifying that the account can bypass further challenges), just assuming that any third party cookie has a monetary tracking value.

        It also appears to be unreviewed and unpublished a few years later. Just being in paper format and up on arXiv doesn’t mean that the contents are reliable science.

      • Kissaki@beehaw.org
        link
        fedilink
        English
        arrow-up
        1
        ·
        9 months ago

        we do so via a large-scale (over 3, 600 distinct users) 13-month real-world user study and post-study survey

        results indicate that the website context directly influences (with statistically significant differences) solving time between pass- word recovery and account creation.

        We explore the cost and security of reCAPTCHAv2 and conclude that it has an immense cost and no security. Overall, we believe that this study’s results prompt a natural conclusion: reCAPTCHAv2 and similar reCAPTCHA technology should be deprecated.

  • Kissaki@beehaw.org
    link
    fedilink
    English
    arrow-up
    12
    ·
    9 months ago

    Since Cloudflare published Turnstile I’ve hated Captchas even more, because Turnstile does it so much better. Captchas are such a hassle. One website I occasionally visit does not keep me logged in and then presents one of the worst captcha puzzle systems. Shitty captchas are a huge barrier.

    Turnstile is, in almost all cases, one checkbox to click (I’ve never been challenged beyond that). All captcha puzzles should be replaced with Turnstile or similar simple (for the user to solve) tech.

      • Kissaki@beehaw.org
        link
        fedilink
        English
        arrow-up
        2
        ·
        9 months ago

        The announcement blog post linked on the bottom of the linked Turnstile page has some info on that

        For Turnstile, the actual act of checking a box isn’t important, it’s the background data we’re analyzing while the box is checked that matters. We find and stop bots by running a series of in-browser tests, checking browser characteristics, native browser APIs, and asking the browser to pass lightweight tests (ex: proof-of-work tests, proof-of-space tests) to prove that it’s an actual browser. The current deployment of Turnstile checks billions of visitors every day, and we are able to identify browser abnormalities that bots exhibit while attempting to pass those tests.

  • _cryptagion [he/him]@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    12
    ·
    9 months ago

    I’m a simple guy. If a website I visit uses any kind of captcha other than Cloudflare’s Turnstile, then I close that website and don’t use it ever again. I’m not interested in wasting five minutes picking which squares have busses in them because ReCaptcha has decided I have to do the captcha 200 times.

    • ooli2@lemm.eeOP
      link
      fedilink
      English
      arrow-up
      10
      ·
      9 months ago

      What is infuriating, is that some government official website in my country used google captcha

      • jagged_circle@feddit.nl
        link
        fedilink
        English
        arrow-up
        6
        ·
        edit-2
        9 months ago

        This happened to me recently. Worse, there’s an error message saying I didn’t solve the CAPTCHA…but I wasn’t prompted for the CAPTCHA!

        I opened a bug report and the gov said “works for me”

        So, yeah, people breaking laws because they can’tsubmit legally required data to the gov due to reliance on faulty Google services is real.

  • Powderhorn@beehaw.org
    link
    fedilink
    English
    arrow-up
    11
    ·
    edit-2
    9 months ago

    It’s a lot easier to determine the intent of this hed with the quote being closed somewhere. Just after “service” would have been my guess, but it’s a disservice to remove that and leave people dangling.

    My larger issue is that when I’m faced with traffic lights – or, god forbid, motorcycles – this is performative nonsense wherein I’m supposed to guess percentage coverage on a given square without having been provided parameters.

    At this point, CAPTCHAs feel designed to make sure you can never get through the first time, thus needing to continue training image models several times before I can just fucking do what I originally came to the site for.

    • jarfil@beehaw.org
      link
      fedilink
      arrow-up
      6
      ·
      edit-2
      9 months ago

      At this point, CAPTCHAs feel designed […] training image models

      It was never a secret:

      The reCAPTCHA program originated with Guatemalan computer scientist Luis von Ahn, and was aided by a MacArthur Fellowship. An early CAPTCHA developer, he realized “he had unwittingly created a system that was frittering away, in ten-second increments, millions of hours of a most precious resource: human brain cycles”

      https://en.m.wikipedia.org/wiki/ReCAPTCHA#Origin