FryAI
| Report: ChatGPT unveiled a host of updates including voice and image recognition, marking a significant stride towards creating a more interactive and intuitive user experience (similar to how we currently interact with Siri & Google Assistant). |
Key Points: |
- Users can now snap photos and have live discussions about them with ChatGPT—a feature particularly handy while traveling, planning meals, or assisting with academic problems.
- Voice interactions are powered by a novel text-to-speech model, which, in collaboration with professional voice actors, generates human-like audio. You’ll also be able to transcribe your own audio to text to streamline interactions.
- Image understanding is supported by BeMyEyes—extending GPT-4’s reasoning abilities to a wide array of images including photos, screenshots, and mixed text-image documents; making it easier than ever to have a discussion with GPT-4.
- The new functionality is being released to Plus and Enterprise users initially, with a broader rollout planned in the near future. Voice features will only be available on iOS and Android, with image capabilities accessible across all platforms.
|
Why you should care: So far ChatGPT has been limited by its scope of accessibility. With the introduction of voice-to-text, vocal responses, image functionality & more, users now have access to a much more intuitive AI model. I’m beyond excited about these updates. |
0 Responses
Stay in touch with the conversation, subscribe to the RSS feed for comments on this post.