What is Gemini Live? A first look at Google’s new real-time voice AI bot

Google

Following OpenAI’s Spring Update event yesterday, Google demoed its superpowered artificial intelligence voice assistant to rival GPT-4o. Gemini Live is a mobile conversational experience that leverages an improved multimodal AI model to offer users a more natural conversational experience in real time.

Also: Everything announced at Google I/O 2024: Gemini, Search, Android 15, and more

Gemini Live lets you have voice conversations with Gemini that feel natural and intuitive. For example, you can ask Gemini Live questions at your own pace and interrupt the AI bot mid-sentence to have it clarify or adjust how it’s replying, similar to what OpenAI showed off during its GPT-4o demo. Google will offer a variety of voices for users to choose from for their Gemini Live experience, as OpenAI has done with ChatGPT since integrating Whisper in September 2023.

Google plans to add the full multimodal experience to Gemini Live later this year, allowing Gemini to view the world around you when you open the camera during a conversation. This is yet another thing that users will be able to do with ChatGPT over the coming weeks through an update that will be first rolled out to ChatGPT Plus users. In the Gemini app, this will be powered by Google’s Project Astra.

Also: ChatGPT vs. ChatGPT Plus: Is a paid subscription still worth it?

Among this and other updates, Google upgraded Gemini Nano to process text, images, and sounds, no longer limited to text inputs. Gemini Nano with Multimodality will be available first for Pixel smartphones. 

Source Link

LEAVE A REPLY

Please enter your comment!
Please enter your name here