Social Media

# Meta Showcases New ‘Voicebox’ Speech-to-Textual content Translation Device

Meta Showcases New ‘Voicebox’ Speech-to-Textual content Translation Device

On the floor a minimum of, Meta’s newest AI development doesn’t seem to be a serious step.

Right now, Meta has printed an outline of its new ‘Voicebox’ AI system, which can allow customers to translate textual content to audio, in a variety of kinds and voices.

As offered on this overview clip, the Voicebox system can take textual content inputs and translate them into audio, with completely different voice choices, enabling extra superior text-to-audio translation, however with decreased studying and processing necessities than different, comparable choices.

Although, on the floor a minimum of, it’s not a heap completely different from the text-to-audio instruments that we’re now accustomed to – whether or not we like them or not – on TikTok and different apps.

The Voicebox translations sound fairly comparable – and I’m keen to wager Meta gained’t let me use the voice of Rocket Raccoon or a Transformer in these new translations.

However the Voicebox system can be greater than only a direct text-to-speech translation device.

As defined by Meta:

Voicebox can produce top quality audio clips and edit pre-recorded audio – like eradicating automotive horns or a canine barking – all whereas preserving the content material and elegance of the audio. The mannequin can be multilingual and might produce speech in six languages. Sooner or later, multipurpose generative AI fashions like Voicebox may give natural-sounding voices to digital assistants and non-player-characters within the metaverse. They might permit visually impaired individuals to listen to written messages from associates learn by AI of their voices, give creators new instruments to simply create and edit audio tracks for movies, and rather more.”

As Meta notes, Voicebox additionally lets you use fashions of voice for translation, so you need to use an audio clip of one other particular person to be able to make your text-to-speech translation sound like that particular person is talking, through simply seconds of audio enter.

Which is able to undoubtedly result in a brand new raft of deepfakes – although once more, comparable instruments do exist already. They’re simply not the identical, and Meta says not pretty much as good, as this new course of.

The actual good thing about Voicebox, in a broad-reaching sense, can be in translation, and enabling simplified, native-sounding variations of your textual content inputs in several languages. That would open up new, cross-market alternatives, whereas the superior modeling of the system will even facilitate broader use circumstances and course of, which may present different key advantages.

However Meta can be conscious of the dangers.

At this stage, Meta isn’t releasing the supply code or app to the general public, citing ‘the potential dangers of misuse’. It’s hoping to seek out extra sensible, invaluable use circumstances for the know-how over time – so its announcement at this time is extra of an FYI than a launch, as such.

You may learn extra about Meta’s Voicebox mission right here.


Andrew Hutchinson
Content material and Social Media Supervisor

Supply

Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top button