Generative AI

Auto Added by WPeMatico

StreetReaderAI: Towards making street view accessible via context-aware multimodal AI

We introduce StreetReaderAI, a new accessible street view prototype using context-aware, real-time AI and accessible navigation controls. Interactive streetscape tools, available today in every major mapping service, have revolutionized how people virtually navigate and explore the world — from previewing routes and inspecting destinations to remotely visiting world-class tourist locations. But to date, screen readers […]

StreetReaderAI: Towards making street view accessible via context-aware multimodal AI Read More »

Toward provably private insights into AI use

We detail how confidential federated analytics technology is leveraged to understand on-device generative AI features, ensuring strong transparency in user data handling and analysis. Generative AI (GenAI) enables personalized experiences and powers the creation of unstructured data, including summaries, transcriptions, and more. Insights into real-world AI use [1, 2] can help GenAI developers enhance their tools

Toward provably private insights into AI use Read More »

Introducing Nested Learning: A new ML paradigm for continual learning

We introduce Nested Learning, a new approach to machine learning that views models as a set of smaller, nested optimization problems, each with its own internal workflow, in order to mitigate or even completely avoid the issue of “catastrophic forgetting”, where learning new tasks sacrifices proficiency on old tasks. The last decade has seen incredible

Introducing Nested Learning: A new ML paradigm for continual learning Read More »

Generative UI: A rich, custom, visual interactive user experience for any prompt

We introduce a novel implementation of generative UI, enabling AI models to create immersive experiences and interactive tools and simulations, all generated completely on the fly for any prompt. This is now rolling out in the Gemini app and Google Search, starting with AI Mode. Generative UI is a powerful capability in which an AI

Generative UI: A rich, custom, visual interactive user experience for any prompt Read More »

AfriMed-QA: Benchmarking large language models for global health

Afrimed-QA, a collection of contextually relevant datasets for evaluation of LLMs on African health question answering tasks, developed in partnership with organizations across Africa. Large language models (LLMs) have shown potential for medical and health question answering across various health-related tests spanning different formats and sources, such as multiple choice and short answer exam questions

AfriMed-QA: Benchmarking large language models for global health Read More »

Towards better health conversations: Research insights on a “wayfinding” AI agent based on Gemini

Google Researchers share user insights from a novel research AI agent that helps people find their way to better health information through proactive conversational guidance, goal understanding, and tailored conversations. The ability to find clear, relevant, and personalized health information is a cornerstone of empowerment for medical patients. Yet, navigating the world of online health

Towards better health conversations: Research insights on a “wayfinding” AI agent based on Gemini Read More »

The anatomy of a personal health agent

Learn about Google Researchers research prototype, an LLM-powered personal health agent that analyzes data from everyday wellness devices paired with health data, such as blood biomarkers, to offer evidence-based health insights and to provide a personalized coaching experience. The rapid advancement of large language models (LLMs), combined with data from wearable devices, presents a transformative opportunity

The anatomy of a personal health agent Read More »

AlphaEvolve: an LLM-based coding agent, to find and verify combinatorial structures that improve results on the hardness of approximately solving certain optimization problems.

Algorithms & Theory Google Researchers invoke AlphaEvolve, an LLM-based coding agent, to find and verify combinatorial structures that improve results on the hardness of approximately solving certain optimization problems. Recently, large language models (LLMs) have demonstrated surprising capabilities in competitive mathematics and competitive programming, demonstrating world-leading performance across both of these fields. However, their successes in mathematical discovery

AlphaEvolve: an LLM-based coding agent, to find and verify combinatorial structures that improve results on the hardness of approximately solving certain optimization problems. Read More »

A collaborative approach to image generation : PASTA

Google Research introduce PASTA, a reinforcement learning agent that refines text-to-image output over multiple turns of interaction with a user by learning their unique preferences. This process is made possible by a novel user simulation technique. You have a perfect image in your mind. You enter a prompt, hit generate, and the result is close

A collaborative approach to image generation : PASTA Read More »