OpenAI Announces a New Voice Cloning Model

“OpenAI has announced Voice Engine, an AI model for voice cloning that uses a 15-second audio sample and text input to almost perfectly clone the voice. Voice Engine was first developed in late 2022. It is currently being tested for applications like non-readers and children, content translation, and improving essential service delivery in remote settings.”

OpenAI Teases Again with a New Voice Cloning Model

“OpenAI has been releasing demos of its text-to-video model Sora and we’re eagerly waiting for it to be publicly available. But before the anticipation could settle, OpenAI has announced a new model but it’s not available for use, again. The company has developed Voice Engine, an AI model for voice cloning that uses a 15-second audio sample and text input to almost perfectly clone the voice.

Key Highlights:

Development and Testing: Voice Engine was first developed in late 2022. It is being tested with “trusted partners” for applications like non-readers and children, content translation, and improving essential service delivery in remote settings.
Training and Data Use: The model is trained on a mix of licensed and publicly available data, with details on the training data being closely guarded considering the ramification of copyright issues.
Editing: TheVoice Engine currently doesn’t allow editing the generated output. There are no options for adjusting the tone, pitch, or cadence of the voice.
Pricing: Voice Engine will cost $15 per 1 million characters. It is quite cheap in comparison to the current-best in the industry – Eleven Labs – that charges $11 for 100,000 characters per month but provides editing features also. (Source)”

Through the link is an example of translation from the HeyGen platform that is using OpenAI’s Voice Engine model.

https://unwindai.substack.com/p/ai-chatbot-that-maps-human-emotions

Pro plugin deactivated or invalid

Posted on: April 7, 2024, 11:07 am Category: Uncategorized

By: Stephen Abram

Comments Off on OpenAI Announces a New Voice Cloning Model

OpenAI Announces a New Voice Cloning Model