Generative AI

Auto Added by WPeMatico

The Generative AI Scientist Roadmap 2026

Some people want to “learn AI.” Others want to build the future. If you’re in the second category, bookmark this right now – because the Generative AI Scientist Roadmap 2026 isn’t another cute syllabus. It’s the no-nonsense, industry-level blueprint for turning you from “I know Python loops” into “I can architect agents that run companies.” […]

The Generative AI Scientist Roadmap 2026 Read More »

Is DeepSeek’s V3.2 the Most Powerful Open-source LLM?

If you’ve been watching the open-source LLM space, you already know it has turned into a full-blown race. Every few months, a new model comes out claiming to push the boundary and some genuinely do. Chinese labs especially have been moving fast, with models like GLM 4.6, Kimi K2 Thinking, Qwen 3 Next, ERNIE-4.5-VL and

Is DeepSeek’s V3.2 the Most Powerful Open-source LLM? Read More »

Nano Banana Pro vs Grok Imagine for Image Generation and Editing

The AI image world today is split between two giants. One is backed by Google’s Gemini, while the other carries the unmistakable Elon Musk aftertaste. We know the former as the Nano Banana Pro – an upgraded, souped-up version of the already-iconic Nano Banana. To challenge it in a vs match, is Grok Imagine, the

Nano Banana Pro vs Grok Imagine for Image Generation and Editing Read More »

Google Earth AI: Unlocking geospatial insights with foundation models and cross-modal reasoning

Google Earth AI is our family of geospatial AI models and reasoning agents that provides users with actionable insights, grounded in real-world understanding. Today, we’re sharing our latest Earth AI innovations and expanding access to these new capabilities on Google Earth and Google Cloud. For years, Google has developed AI models that enhance our understanding

Google Earth AI: Unlocking geospatial insights with foundation models and cross-modal reasoning Read More »

How we are building the personal health coach

The personal health coach is built with Gemini models to deliver personalized and adaptive coaching, grounded in science and informed by expert oversight. Historically, health and fitness journeys have been fragmented, generic and inaccessible, whether within existing apps or through general health and fitness journeys outside of apps. For instance, a primary care provider might

How we are building the personal health coach Read More »

StreetReaderAI: Towards making street view accessible via context-aware multimodal AI

We introduce StreetReaderAI, a new accessible street view prototype using context-aware, real-time AI and accessible navigation controls. Interactive streetscape tools, available today in every major mapping service, have revolutionized how people virtually navigate and explore the world — from previewing routes and inspecting destinations to remotely visiting world-class tourist locations. But to date, screen readers

StreetReaderAI: Towards making street view accessible via context-aware multimodal AI Read More »

Toward provably private insights into AI use

We detail how confidential federated analytics technology is leveraged to understand on-device generative AI features, ensuring strong transparency in user data handling and analysis. Generative AI (GenAI) enables personalized experiences and powers the creation of unstructured data, including summaries, transcriptions, and more. Insights into real-world AI use [1, 2] can help GenAI developers enhance their tools

Toward provably private insights into AI use Read More »

Introducing Nested Learning: A new ML paradigm for continual learning

We introduce Nested Learning, a new approach to machine learning that views models as a set of smaller, nested optimization problems, each with its own internal workflow, in order to mitigate or even completely avoid the issue of “catastrophic forgetting”, where learning new tasks sacrifices proficiency on old tasks. The last decade has seen incredible

Introducing Nested Learning: A new ML paradigm for continual learning Read More »

Generative UI: A rich, custom, visual interactive user experience for any prompt

We introduce a novel implementation of generative UI, enabling AI models to create immersive experiences and interactive tools and simulations, all generated completely on the fly for any prompt. This is now rolling out in the Gemini app and Google Search, starting with AI Mode. Generative UI is a powerful capability in which an AI

Generative UI: A rich, custom, visual interactive user experience for any prompt Read More »