Wednesday, July 24, 2024
HomeSony liveAI voice cloning: OpenAI reveals new text-to-speech mannequin with each promise and...

AI voice cloning: OpenAI reveals new text-to-speech mannequin with each promise and peril


OpenAI continues to push the boundaries of AI tech. First, it launched a software that may conjure digital pictures with only a description. Then, it revealed Sora, a know-how that generates Hollywood-quality movement movies. And now, it’s moving into the realm of voice recreation.

The most recent from OpenAI is a function that reads textual content aloud in a remarkably human-like voice. This breakthrough in synthetic intelligence marks a big leap ahead, but it surely additionally raises issues concerning the potential for deepfake manipulation (through Bloomberg).

The corporate has unveiled early outcomes from testing this function, providing demos, which you’ll hearken to right here. Dubbed Voice Engine, this text-to-speech mannequin is at the moment in a restricted trial section with about 10 builders. OpenAI has opted for a cautious strategy slightly than a widespread launch.

Following suggestions from stakeholders like policymakers and educators, OpenAI has determined to cut back its preliminary rollout. The corporate acknowledges the intense dangers of producing human-like speech, particularly throughout delicate occasions like an election 12 months.

The corporate wrote in a weblog put up:

Not like earlier audio initiatives, Voice Engine stands out for its capability to imitate particular person voices with outstanding accuracy, capturing nuances in cadence and intonation. And all it wants is simply 15 seconds to duplicate an individual’s voice.

Amongst OpenAI’s companions is the Norman Prince Neurosciences Institute at Lifespan, the place the know-how is used to assist sufferers in voice rehabilitation. As an example, it was used to revive the speech of a younger affected person who had problem talking clearly attributable to a mind tumor. The AI discovered from earlier recordings for a college challenge.

Along with its functions in healthcare, the customized speech mannequin has caught the eye of corporations like Spotify, which sees potential in translating audio content material, equivalent to podcasts, into a number of languages. Nonetheless, OpenAI emphasizes moral pointers for utilizing the know-how, together with acquiring consent from authentic audio system and disclosing AI-generated content material to listeners.

Additionally, earlier than contemplating a wider launch, OpenAI is soliciting suggestions and urging public consciousness of the challenges posed by superior AI tech. This contains advocating for the phasing out of voice authentication in delicate areas like banking.

OpenAI added in its weblog put up:


Moreover, the corporate provides that it hopes this preview sparks a dialog about addressing the dangers related to AI developments and selling societal resilience.




Please enter your comment!
Please enter your name here

Most Popular

Recent Comments