Human-Computer Interaction and Visualization

Auto Added by WPeMatico

Towards better health conversations: Research insights on a “wayfinding” AI agent based on Gemini

Google Researchers share user insights from a novel research AI agent that helps people find their way to better health information through proactive conversational guidance, goal understanding, and tailored conversations. The ability to find clear, relevant, and personalized health information is a cornerstone of empowerment for medical patients. Yet, navigating the world of online health […]

Towards better health conversations: Research insights on a “wayfinding” AI agent based on Gemini Read More »

Introducing interactive on-device segmentation in Snapseed

  A novel mobile technology that facilitates real-time image segmentation, thereby improving the user experience for photo editing within Snapseed. The key to elevating a good photo often lies in selective image adjustments: brightening a subject in the foreground, enhancing the sky, or making the color of a jacket pop. Yet, isolating specific elements with

Introducing interactive on-device segmentation in Snapseed Read More »

A collaborative approach to image generation : PASTA

Google Research introduce PASTA, a reinforcement learning agent that refines text-to-image output over multiple turns of interaction with a user by learning their unique preferences. This process is made possible by a novel user simulation technique. You have a perfect image in your mind. You enter a prompt, hit generate, and the result is close

A collaborative approach to image generation : PASTA Read More »

XR Blocks: Accelerating AI + XR innovation

XR Blocks is an open-source framework to help you develop immersive experiences for the web, featuring XR realism, XR interaction, and AI + XR applications with live demos in xrblocks.github.io. The combination of artificial intelligence (AI) and extended reality (XR) has the potential to unlock a new paradigm of immersive intelligent computing. However, a significant gap

XR Blocks: Accelerating AI + XR innovation Read More »

Teaching Gemini to spot exploding stars with just a few examples

In a publication in Nature Astronomy, we show how Google’s Gemini model can be transformed into an expert astronomy assistant that classifies cosmic events with high accuracy and explains its reasoning in plain language, achieving 93% accuracy across three datasets by learning from just 15 annotated examples per survey. Modern astronomy is a treasure hunt

Teaching Gemini to spot exploding stars with just a few examples Read More »

StreetReaderAI: Towards making street view accessible via context-aware multimodal AI

We introduce StreetReaderAI, a new accessible street view prototype using context-aware, real-time AI and accessible navigation controls. Interactive streetscape tools, available today in every major mapping service, have revolutionized how people virtually navigate and explore the world — from previewing routes and inspecting destinations to remotely visiting world-class tourist locations. But to date, screen readers

StreetReaderAI: Towards making street view accessible via context-aware multimodal AI Read More »

Generative UI: A rich, custom, visual interactive user experience for any prompt

We introduce a novel implementation of generative UI, enabling AI models to create immersive experiences and interactive tools and simulations, all generated completely on the fly for any prompt. This is now rolling out in the Gemini app and Google Search, starting with AI Mode. Generative UI is a powerful capability in which an AI

Generative UI: A rich, custom, visual interactive user experience for any prompt Read More »