Since its debut last year, ChatGPT has made its mark in the toolkits of various companies, taking on a wide range of tasks, from summarizing documents to coding. This has ignited a competition among major tech players eager to launch their own generative AI solutions. Google, for instance, is preparing to unveil its counterpart to ChatGPT, known as Gemini.
Now, the future of AI-assisted conversations is here, and it's truly revolutionary! OpenAI is thrilled to introduce two groundbreaking features—Voice and Image capabilities—to ChatGPT. These additions promise a more immersive and intuitive experience than ever before.
Voice and Image: A Game-Changer in Conversations
If you're traveling and feeling a bit lost, try this simple trick: take a picture of a landmark, and you can start a live conversation about all the interesting details.
Now, when you're wondering what to cook for dinner, just snap some photos of your fridge and pantry with your phone. ChatGPT can then provide you with step-by-step recipes tailored to what you have on hand.
And if your child needs help with homework, no problem. Take a picture of that tricky math problem, circle it, and you'll both get helpful hints to work through it together.
With Voice and Image capabilities, ChatGPT easily becomes a part of your daily routine, offering endless possibilities and we’re here to share more about these exciting new features!
Rolling Out to Plus and Enterprise Users
The good news doesn't stop there! These remarkable Voice and Image features are set to roll out to Plus and Enterprise users over the next two weeks.
Voice functionality will soon be available on iOS and Android when you opt-in within your settings.
Meanwhile, Image capabilities will be accessible across all platforms, making ChatGPT even more versatile.
Voice: Crafting Human-Like Conversations
Are you prepared to be amazed? You can now talk to ChatGPT, and it will respond to you! You can use your voice to have conversations, request bedtime stories, or resolve dinner table debates with your AI assistant.
To get started, go to Settings → New Features in the mobile app and enable voice conversations. Then, tap the headphone icon on the top-right corner of the home screen and choose your favorite voice.
You will have the option to select a voice with choices ranging from five distinct personas, including names like "Juniper," "Breeze," and "Ember."
ChatGPT will then generate audio in the chosen voice, enabling tasks such as reading a story out loud. Click here to see for yourself!
This new voice feature uses advanced technology to create human-like audio from text and a short speech sample. Professional voice actors helped create these voices, and Whisper, an open-source speech recognition system, converts your spoken words into text, making your chats with ChatGPT feel natural and engaging.
Images: Enhancing Visual Communication
The Image capability opens up a world of possibilities, from troubleshooting everyday challenges, like a stubborn grill that won't start, to planning meals by exploring the contents of your fridge, and even diving into work-related data analysis through complex graphs.
To make visual conversations more accurate, a drawing tool is now available in the mobile app. This tool lets users highlight specific parts of an image, making it easier to highlight specific aspects of the image.
For those eager to explore ChatGPT's image capabilities, here are two convenient methods within the ChatGPT mobile app:
To begin, you can opt for the camera icon positioned to the left of the message input bar. This enables you to capture a new photo directly from your smartphone. Before sharing the image, you have the option to use your finger to highlight and emphasize specific areas that you'd like the chatbot to focus on, enhancing the overall conversational experience.
Or, you have the flexibility to select photos directly from your device, whether you're using a smartphone or a desktop browser. Smartphone users can effortlessly choose files saved on their phones, while desktop users can upload saved photos from their computers. Although video uploads are not currently supported, you can submit multiple images within a single prompt for a better interaction.
According to Wired, ChatGPT performs optimally when you follow specific image upload guidelines. For the best outcomes, it is recommended to upload clear and well-lit images.
While ChatGPT demonstrates remarkable proficiency in identifying objects accurately, it's important to approach its responses with a degree of caution. While it excels in recognizing items like orchid plants, international coins, and various other objects, it's not immune to occasional inaccuracies.
For instance, as Wired noted, ChatGPT mislabeled a daily multivitamin as a pill for treating erectile dysfunction. Thus, it's advisable to verify its responses when uncertainty arises.
Safety and Responsibility: A Top Priority
OpenAI is committed to safety and responsibility. They are introducing these capabilities gradually to ensure they are both powerful and secure. This step-by-step approach allows for ongoing improvements and the refinement of safety measures, which is especially important as they explore advanced voice and vision technologies.
However, please keep in mind never to upload personal or sensitive photos when testing out the image feature in ChatGPT.
If you wish to control how long OpenAI retains your data and AI interactions for chatbot training purposes, follow these steps:
Go to Settings, then select Data Controls, and disable Chat History & Training. With this setting turned off, your information will be deleted after one month. Please note that you'll need to perform this process individually for each web browser you use to access ChatGPT, whether on a PC or a mobile device.
SO, WHERE DO YOU FIND THIS PARTNER?
Well, aren’t we glad you asked! We at DigiCom are obsessive data-driven marketers pulling from multi-disciplinary strategies to unlock scale. We buy media across all platforms and placements and provide creative solutions alongside content creation, and conversion rate optimizations. We pride ourselves on your successes and will stop at nothing to help you grow.
Comments