OpenAI Announces a New Voice Cloning Model
“OpenAI has announced Voice Engine, an AI model for voice cloning that uses a 15-second audio sample and text input to almost perfectly clone the voice. Voice Engine was first developed in late 2022. It is currently being tested for applications like non-readers and children, content translation, and improving essential service delivery in remote settings.”
OpenAI Teases Again with a New Voice Cloning Model
Key Highlights:
- Development and Testing: Voice Engine was first developed in late 2022. It is being tested with “trusted partners” for applications like non-readers and children, content translation, and improving essential service delivery in remote settings.
- Training and Data Use: The model is trained on a mix of licensed and publicly available data, with details on the training data being closely guarded considering the ramification of copyright issues.
- Editing: TheVoice Engine currently doesn’t allow editing the generated output. There are no options for adjusting the tone, pitch, or cadence of the voice.
- Pricing: Voice Engine will cost $15 per 1 million characters. It is quite cheap in comparison to the current-best in the industry – Eleven Labs – that charges $11 for 100,000 characters per month but provides editing features also. (Source)”
Through the link is an example of translation from the HeyGen platform that is using OpenAI’s Voice Engine model.
https://unwindai.substack.com/p/ai-chatbot-that-maps-human-emotions

0 Responses
Stay in touch with the conversation, subscribe to the RSS feed for comments on this post.