Artificial intelligence took center stage at Google I/O, with the company unveiling a range of AI-powered features. One of the most exciting announcements was the introduction of “Ask Photos” for Google Photos. This innovative feature enables users to search their photo collections and ask questions using natural language input.
Ask Photos, powered by Gemini, offers a more sophisticated search experience than simply searching for photos of a specific person or object. It understands context and can answer complex questions, making it a game-changer for photo organization and discovery. For example, asking Ask Photos for a photo of your child treading water might return a single or multiple images of that exact moment.
With over 6 billion photos uploaded daily to Google Photos, users’ galleries are growing exponentially. As a result, finding specific photos or videos can become a tedious task, involving endless scrolling through pages, even with the right keywords.
Ask Photos revolutionizes this process by allowing users to search in a natural, conversational way. For example, you can ask Google Photos to “Show me the best photo from each national park I’ve visited.” This feature saves you from scrolling through countless photos, quickly providing the desired results and making photo discovery more efficient and enjoyable.
We often take photos to preserve memories or store important information, but this valuable information can get buried among our vast photo collections. Ask Photos solves this problem by extracting relevant details from your photos when you ask questions about your life, such as the location of last year’s camping trip or the expiration date of vouchers.
Leveraging Gemini’s multimodal capabilities, Ask Photos can comprehend the context and subject matter of photos to extract specific details. For instance, you can ask about the themes of Lena’s past birthday parties, and Ask Photos will analyze the decorations in the background or on the birthday cake to provide the answer. This feature makes it easy to uncover hidden information in your photos and relive cherished memories.
Google’s Gemini AI model has shown remarkable sophistication in processing natural language inputs. This feature, though it may seem creepy to some, has the potential to greatly assist users in searching through their vast photo collections on Google Photos. Gemini’s ability to understand context and subject matter makes it a powerful tool for photo discovery.
Ask Photos, a new feature coming to @GooglePhotos, makes it easier to search across your photos and videos with the help of Gemini models. It goes beyond simple search to understand context and answer more complex questions. #GoogleIO pic.twitter.com/OsYXZLo5S1
— Google (@Google) May 14, 2024
Google’s emphasis on AI is a natural response to the growing presence of generative AI models in the tech industry. Google has incorporated AI into various aspects of its services, and its competitors are following suit. OpenAI’s recent unveiling of its Omni model has demonstrated the rapid advancements being made in AI technology, and Apple is set to join the competition by unveiling its own generative AI efforts at the upcoming Worldwide Developer Conference.
Apple’s entry into the AI arena is expected to bring significant changes to its virtual assistant, Siri. The new Siri upgrade, powered by generative AI, will reportedly make the voice assistant more conversational and versatile. While it may not directly compete with commercial chatbots like ChatGPT, Siri’s new capabilities will focus on improving its existing tasks and providing a more private experience for users. Apple’s approach to AI is expected to prioritize on-device processing, emphasizing privacy and faster response times.
Maybe you liked other interesting articles?