First, OpenAI offered a tool that allowed people to create digital images simply by describing what they wanted to see. Then, it built similar technology that generated full-motion video like something from a Hollywood movie.

Now, it has unveiled technology that can recreate someone’s voice.

The high-profile A.I. start-up said on Friday that a small group of businesses was testing a new OpenAI system, Voice Engine, that can recreate a person’s voice from a 15-second recording. If you upload a recording of yourself and a paragraph of text, it can read the text using a synthetic voice that sounds like yours.

    • Jordan117@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      11
      ·
      7 months ago

      They’re not releasing this yet though, and require even private testers to get explicit consent and not imitate public figures, restrictions I’m sure will carry over if and when it becomes public. They’ve been cautious to a fault. Look at something like ElevenLabs for an example of lax enforcement for cash.

      • jol@discuss.tchncs.de
        link
        fedilink
        English
        arrow-up
        7
        arrow-down
        2
        ·
        7 months ago

        Lol BS. They’ve said that about chat gpt 2 and about every second released product. There’s nothing they can do except ask people not to infring the rules.

        • K3zi4@lemmy.world
          link
          fedilink
          English
          arrow-up
          8
          arrow-down
          3
          ·
          7 months ago

          Not OP, But, Stephen Hawking could have kept his voice, as an example.There are probably lots of beneficial uses of this technology. Automatic TTS voice messages would be another 🤷‍♂️

          • starchylemming@lemmy.world
            link
            fedilink
            English
            arrow-up
            5
            arrow-down
            3
            ·
            7 months ago

            the benefits for the average joe are few and in between. the negatives are hefty.

            say goodbye to your jobs soon, people who dub movies. we can have the voice of the original actor say his lines in all languages… why stop there? who needs living actors in movies anyway? fake it all the way with a dead actors likeness. not to mention that it’s getting harder to tell what’s real anymore… and conveniently, if some important shithead does an oopsie, they can claim it’s an elaborate fake and not a public leak of their shitty behaviour.

            it’s good for modding games and other low budget media, i guess

            • K3zi4@lemmy.world
              link
              fedilink
              English
              arrow-up
              6
              arrow-down
              1
              ·
              7 months ago

              Sure, those are good examples of negatives, but that is just the way of it. This happens all the time when new technology emerges. Just think about the audio industry, all of a sudden people could produce music from their spare bedrooms- jobs weren’t needed anymore. But the music industry is now far more saturated than ever as a result, as it is so much more accessible to people, without the need for specialist equipment and stacks of cash.

            • KairuByte@lemmy.dbzer0.com
              link
              fedilink
              English
              arrow-up
              2
              ·
              7 months ago

              This happens with literally every leap in technology. You realize we used to have people whose sole job was to connect phone calls? People whose job it was to light the street lamps? People whose job was to listen for ships/planes approaching?

  • yildolw@lemmy.world
    link
    fedilink
    English
    arrow-up
    13
    arrow-down
    1
    ·
    7 months ago

    “Mommy, they are going to take my thumbs. Please pay the ransom. It’s $10k in Amazon giftcards. Mommy, please!”

    Why, OpenAI? Why are you trying to destroy the world? We have no defences against voice impersonation

  • AutoTL;DR@lemmings.worldB
    link
    fedilink
    English
    arrow-up
    3
    ·
    7 months ago

    This is the best summary I could come up with:


    First, OpenAI offered a tool that allowed people to create digital images simply by describing what they wanted to see.

    The company said it was particularly worried that this kind of technology could be used to break voice authenticators that control access to online banking accounts and other personal applications.

    “This is a sensitive thing, and it is important to get it right,” an OpenAI product manager, Jeff Harris, said in an interview.

    (The New York Times has sued OpenAI and its partner, Microsoft, on claims of copyright infringement involving artificial intelligence systems that generate text.)

    Businesses can use these technologies to generate audiobooks, give voice to online chatbots or even build an automated radio station DJ.

    In January, New Hampshire residents received robocall messages that dissuaded them from voting in the state primary in a voice that was most likely artificially generated to sound like President Biden.


    The original article contains 579 words, the summary contains 148 words. Saved 74%. I’m a bot and I’m open source!

  • topinambour_rex@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    arrow-down
    1
    ·
    7 months ago

    How they can unveil something which exist since an year ? Am I reading 1984 and soon OpenAI will have invented the wheel ?