Gemini Updates & Release Notes

Follow

335 updates curated from 278 sources by the Releasebot Team. Last updated: Jun 6, 2026

Get this feed:
  • Jun 5, 2026
    • Date parsed from source:
      Jun 5, 2026
    • First seen by Releasebot:
      Jun 6, 2026
    Google logo

    Gemini by Google

    The latest AI news we announced in May 2026

    Gemini adds Gemini 3.5, Gemini Omni and a wave of new AI experiences across Search, Android, Health and hardware, bringing more proactive agents, smarter shopping, wellness tracking and next-gen devices.

    Here’s a recap of our biggest AI updates from May, including announcements from Google I/O 2026, the Android Show and Google Health.

    For more than 20 years, we’ve invested in machine learning and AI research, tools and infrastructure to build products that make everyday life better for more people. Teams across Google are working on ways to unlock AI’s benefits in fields as wide-ranging as healthcare, crisis response and education. To keep you posted on our progress, we're doing a regular roundup of Google's most recent AI news.

    Here’s a look back at some of our AI announcements from May.

    May 2026 was packed with AI news

    At Google I/O 2026, we officially entered the agentic Gemini era with the launch of Gemini 3.5 — which delivers frontier intelligence for agents and coding — and Gemini Omni, where Gemini’s ability to reason meets the ability to create. The Android Show set the stage with brand-new hardware built specifically for these tools, including the Googlebook from our hardware partners. We also broadened our personal wellness tools with the new Google Health app and Fitbit Air, and launched an initiative to apply advanced quantum science and AI to the life sciences. Ultimately, May was all about making AI more proactive, helpful and integrated into your everyday life.

    Experience Gemini Omni

    We announced Gemini Omni, our new model that can create anything from any input — starting with video. Omni allows you to combine images, audio, video and text as input and generate high-quality videos grounded in Gemini's real-world knowledge.

    Simulate real-world places

    By combining Project Genie with Street View, we introduced an experimental new way to simulate and explore highly realistic, interactive 3D environments of real-world places right from your browser.

    Learn how we partner with musical artists

    Through a new partnership between Google Flow Music and Believe, we're equipping artists and producers with a creative AI collaborator to help throughout the creation process, from brainstorming lyrics and melodies to putting final touches on a song.

    Discover what’s possible with Gemini 3.5

    We launched our latest family of models combining frontier intelligence with action. With powerful new action-taking capabilities, Gemini 3.5 is built to help you reliably execute complex, multi-step agentic workflows across your apps.

    Connect with your proactive 24/7 partner

    The Gemini app is becoming a more helpful AI assistant, featuring an intuitive new UI, personalized daily briefs and Gemini Spark. Instead of just answering questions, it acts as a proactive helper — managing your inbox, scheduling appointments and anticipating your daily needs in the background.

    Get more done with our advanced AI models in Search

    Our next era for Search introduces new features that bring together the best of the web with the best of AI. We're launching information agents in Search to intelligently work in the background, 24/7. They'll monitor information on your behalf and send detailed updates with links to dive deeper and take action. We’re also bringing Antigravity and the agentic coding capabilities of Gemini 3.5 Flash right into Search, so Search can build generative UI and interactive visuals tailored to your questions, plus custom experiences like dashboards or mini apps for your ongoing tasks. And we introduced a new, intelligent Search box, marking its biggest upgrade in over 25 years.

    With agentic coding capabilities in Search, you’ll be able to ask Search to code a custom fitness tracker for you, using real-time data like reviews, live maps and weather to keep you on track.

    Stay in sync with Android Halo

    To help you manage your agents, we introduced Android Halo, a new space on your phone that lets you see your agents’ progress and receive contextual assistance without interrupting your flow.

    Simplify your shopping with Universal Cart

    Universal Cart is your new hub for shopping on Google. Universal Cart works across merchants and across services, so you can add things to your cart while you’re browsing Search, chatting with Gemini, watching YouTube or even reading your Gmail.

    Try the all-new Google Health app

    The Google Health app brings all of your health and wellness into one place debuting new and advanced capabilities.

    Meet the all-new Fitbit Air

    Fitbit Air is our smallest tracker yet — a proactive wellness partner that uses high-fidelity sensor technology in a tiny, discreet pebble that enables advanced health and fitness tracking like 24/7 heart rate, heart rhythm monitoring with Afib alerts, SpO2, resting heart rate, heart rate variability, sleep stages and duration, and more.

    Discover the new Googlebook

    Googlebook is our newly designed laptop experience built from the ground up for Gemini Intelligence. It features the Magic Pointer for contextual suggestions, custom widgets to help you organize your tasks, cross-device features with Android phones and powerful AI features to help you get more done. Googlebooks will be made by our hardware partners like Acer, Asus, Dell, HP and Lenovo.

    Upgrade your phone with Gemini Intelligence

    Android is becoming more proactive. With Gemini Intelligence, your advanced phone can better understand your context, turn spoken thoughts into polished text and proactively suggest actions to help you throughout your day.

    Bring next-gen Android to your car

    Gemini is becoming even more helpful on the road. The next generation of Android in the car brings highly conversational voice controls, proactive routing and richer entertainment options so you can do more and have more fun while driving.

    Get ready for new intelligent eyewear

    We revealed our upcoming intelligent eyewear, with new frames and features that let you get directions, send texts, snap photos and more — all without taking out your phone.

    Understand whether content online is AI-generated

    To make it easier to understand how content was created and edited, we're expanding our content transparency and verification tools in Google Search, Gemini, Chrome, Pixel and Cloud.

    Explore Gemini for Science

    We unveiled Gemini for Science, a collection of science tools and experiments designed to expand the scale and precision of scientific exploration.

    See AlphaEvolve’s real world impact

    Find out how AlphaEvolve has been tackling real-world problems — from optimizing complex logistical supply chains and chip design to simulating molecular systems and electrical power grids.

    Discover how AI can help protect our environment

    We launched the Google DeepMind Accelerator program in the Asia Pacific region, dedicated to supporting startups using frontier AI to address critical climate, energy and environmental challenges.

    See how quantum computing and AI could transform life sciences

    We launched the Research Program at the Intersection of Life Sciences & Quantum AI (REPLIQA), an initiative committing $10 million to five universities to apply advanced quantum science and AI to the life sciences, to improve human outcomes.

    Original source
  • Apr 1, 2026
    • Date parsed from source:
      Apr 1, 2026
    • First seen by Releasebot:
      Jun 3, 2026
    Google logo

    Gemini by Google

    April 2026

    Gemini adds project notebooks, global Google app connections, free Lyria 3 Pro music creation, a macOS app, and visual interactive answers for complex questions, expanding productivity and creativity across chat.

    Create a space for your projects and chats in Gemini

    Add your chats and sources to notebooks in Gemini and jump back into projects fast.

    Productivity

    Personal Intelligence is going global

    Connect your Google apps to Gemini for help that’s unique to you.

    Productivity

    Create music with Lyria 3 Pro at no cost

    Mix and customise your sound in entirely new ways with tracks up to three minutes long at no cost.

    Creativity

    Gemini, now on macOS

    Open the Gemini app from any screen on macOS, share what you’re working on, and get instant help.

    Models

    Understand complex concepts visually

    Turn your tough questions into custom interactive visuals directly in your chat with Gemini.

    Creativity

    Original source
  • All of your release notes in one feed

    Join Releasebot and get updates from Google and hundreds of other software products.

    Create account
  • May 29, 2026
    • Date parsed from source:
      May 29, 2026
    • First seen by Releasebot:
      May 31, 2026
    Google logo

    Gemini by Google

    9 demos of Gemini Omni and Gemini 3.5 in action

    Gemini introduces Gemini Omni and Gemini 3.5 Flash, bringing video generation and conversational video editing plus faster agentic coding and multi-step workflow execution. The rollout also expands availability across the Gemini app, Search, Workspace, AI Studio and enterprise tools.

    Gemini Omni

    With Gemini Omni, Gemini’s ability to reason meets the ability to create, while Gemini 3.5 is built to help you execute complex, agentic workflows.

    At Google I/O 2026, we announced our latest models: Gemini Omni and the Gemini 3.5 family of models.

    Gemini Omni is our new model that can create anything from any input, starting with video. With Omni, you can combine images, audio, video and text as input and generate high-quality videos grounded in Gemini's real-world knowledge. You can also easily edit your videos through conversation.

    Then there’s Gemini 3.5, our latest family of models combining frontier intelligence with action. This represents a major leap forward in building more capable, intelligent agents. We’re kicking off the series by releasing 3.5 Flash. It delivers frontier performance for agents and coding, excelling at complex long-horizon tasks that deliver real-world utility.

    To give you a clearer understanding of Gemini Omni and Gemini 3.5 Flash, here are 9 demos of what they can help you do.

    Gemini Omni

    Edit your videos through conversation. One capability that makes Omni special is that it gives you an easier way to edit video — with natural language. Every instruction builds on the last. Your characters stay consistent, the physics hold up and the scene remembers what came before. That means you can transform the world around you. Change specific things, or change everything. Your video becomes the starting point for something you never could have filmed yourself.

    Reimagine the action. Take a video you shot and just ask Omni to change what’s happening. Edit the action, add in new characters or objects or transform a moment into something unexpected.

    Refine your videos across multiple turns. Change the environment, angle, style or even specific details, without ever losing the thread of your original scene. Scroll through the carousel to see how edits build on each other.

    Gemini 3.5 Flash

    Take on agentic tasks at scale. 3.5 Flash delivers intelligence that rivals large flagship models on multiple dimensions, at the speeds you have come to expect from the Flash series. This balance of speed and performance makes 3.5 Flash ideal for tackling long-horizon agentic tasks. Here, powered by Antigravity, 3.5 Flash executes multi-step workflows to automatically rename and categorize unstructured assets based on dynamic criteria.

    When coupled with the updated Antigravity harness, 3.5 Flash becomes a powerful engine for deploying collaborative subagents to tackle problems at scale for the most demanding use cases. Under supervision, it can reliably execute multi-step workflows and coding tasks while sustaining frontier performance.

    Create richer, more interactive web UIs and graphics with 3.5 Flash. 3.5 Flash builds on the strong multimodal foundation of Gemini 3. Watch as 3.5 Flash generates different UX approaches for a checkout flow in just 60 seconds on AI Studio.

    Try personal AI agents and new intelligent experiences. 3.5 Flash is now the default model for the Gemini app and AI Mode in Search globally. Its agentic capabilities are powering new features to bring frontier-level intelligence to your daily life.

    The enhanced agentic coding capabilities of 3.5 Flash are delivering even more intelligent experiences in Search, like our new information agents. Operating in the background, 24/7, these agents intelligently reason across information to find exactly what you need at exactly the right moment. They will send a comprehensive update along with links to the web to dive deeper, so you can take action. Information agents will launch first for Google AI Pro & Ultra subscribers this summer.

    Now that we’re bringing the power of Google Antigravity and agentic coding capabilities of Gemini 3.5 Flash right into Search, Search can build the ideal response, in the right format for your question — completely on the fly. So you can get custom generative UI, including visual tools and simulations, tailored precisely to your needs. These generative UI capabilities will be available for everyone in Search this summer, free of charge.

    For your ongoing tasks like planning a wedding or establishing a new fitness routine, Search will also build you custom experiences – like dashboards, trackers or mini apps – that you can keep coming back to. You’ll be able to create your own custom experiences with Antigravity right in Search in the coming months, starting first for Google AI Pro and Ultra subscribers in the U.S.

    Then there’s the new Gemini Spark, your personal AI agent, which runs on Gemini 3.5 and uses the Antigravity harness. It runs 24/7, helping you navigate your digital life, taking action on your behalf while under your direction. It’s deeply integrated with the Workspace tools you rely on daily, like Gmail, Docs, Slides and more. Gemini Spark is now available to all Google AI Ultra subscribers in the U.S.

    Gemini Omni Flash is rolling out to all Google AI Plus, Pro and Ultra subscribers globally through the Gemini app and Google Flow. It’s also rolling out at no cost to users on YouTube Shorts and YouTube Create App. In the coming weeks, we'll also be rolling it out to developers and enterprise customers via APIs.

    Gemini 3.5 Flash is generally available via Google Antigravity, the Gemini API in Google AI Studio and Android Studio, Gemini Enterprise Agent Platform and Gemini Enterprise. It’s also available for everyone in AI Mode in Search and now rolling out to everyone globally in the Gemini app.

    Original source
  • May 28, 2026
    • Date parsed from source:
      May 28, 2026
    • First seen by Releasebot:
      May 28, 2026
    Google logo

    Gemini by Google

    Catch up on 12 major I/O 2026 moments

    Gemini releases a major I/O 2026 wave of AI updates, including Gemini Omni for video generation, Gemini 3.5 Flash, a redesigned app experience, Daily Brief, Spark, Universal Cart, smarter Search agents and new macOS features.

    Relive some of our top onstage moments this year, including the debut of our newest models, updates to Search and more.

    Our biggest, boldest new developments took center stage at Google I/O 2026. We announced technical breakthroughs, like Gemini Omni’s ability to create anything from any input, starting with video. And we shared product updates to help you day-to-day, like the brand new, intelligent Search box that will let you search across modalities, using text, images, files, videos or Chrome tabs as inputs. (And with plenty of other big I/O announcements, there’s a lot more where that came from!)

    In case you missed it, here are some of our most exciting I/O keynote reveals this year.

    1. Gemini Omni

    Gemini Omni is our new model that can create anything from any input — starting with video. With Omni, you can combine images, audio, video and text as input and generate high-quality videos grounded in Gemini's real-world knowledge. You can also easily edit your videos through conversation.

    First, we’re launching the first model in the Omni family: Gemini Omni Flash. Gemini Omni Flash is rolling out to all Google AI Plus, Pro and Ultra subscribers globally through the Gemini app and Google Flow. It’s also rolling out at no cost to users on YouTube Shorts and YouTube Create App.

    2. Gemini 3.5 Flash

    Our new Gemini 3.5 family of models combines frontier intelligence with action. We’re kicking off the series by releasing Gemini 3.5 Flash, which delivers frontier performance for agents and coding, excelling at complex long-horizon tasks that deliver real-world utility.

    Gemini 3.5 Flash is generally available via Google Antigravity, the Gemini API in Google AI Studio and Android Studio, Gemini Enterprise Agent Platform and Gemini Enterprise. It’s also available for everyone in AI Mode in Search and now rolling out to everyone globally in the Gemini app. We’re also hard at work on Gemini 3.5 Pro. It’s already being used internally, and we look forward to rolling it out next month.

    3. Information agents in Search

    We’re entering the era of Search agents, where you can easily create, customize and manage multiple AI agents for your many tasks, right in Search. We’re starting with information agents, which operate in the background, 24/7, to intelligently reason across the web, like blogs, news sites and social posts (plus our freshest data, such as real-time info on finance, shopping and sports). Information agents will help you stay updated on whatever matters most to you, sending a comprehensive update with exactly what you need at exactly the right moment, along with helpful links to explore further on the web.

    Information agents are rolling out this summer, starting first with Google AI Pro and Ultra subscribers. Simply add “keep me updated” to your search to create an information agent, and view your active agents via the side panel in AI Mode in Search.

    4. Google Antigravity-powered experiences in Search

    We’re bringing Antigravity and the agentic coding capabilities of Gemini 3.5 Flash right into Search, so Search can build you the ideal format exactly for your question, completely custom, on the fly. You can get dynamic layouts, interactive visuals and entire experiences, all created just for you. These generative UI capabilities will be available for everyone in Search this summer, free of charge.

    Some projects aren’t one-off questions — they're ongoing tasks. Also with Antigravity, Search will also code entire custom experiences, like tools, dashboards or trackers, just for you. It’s like building your own mini apps with Search. They’re especially awesome for those long-running tasks where you want to keep coming back, like planning a wedding or managing your home move. You’ll be able to build custom experiences with Antigravity, right in Search in the coming months, starting first for Google AI Pro and Ultra subscribers in the U.S.

    5. Daily Brief

    Daily Brief in the Gemini app is a new agent that gives you a personalized morning brief and organizes exactly what you need to know to start your day. This personalized digest is designed to be your first stop every morning.

    Once you opt in, Gemini works across your connected apps in the background. It gathers urgent updates from your Gmail inbox, tracks upcoming events from your Calendar and compiles relevant follow-up details into a skimmable briefing. It goes far beyond a simple summary. Daily Brief actively organizes and prioritizes based on your specific goals, even suggesting immediate next steps. You can easily steer it by giving responses a quick thumbs up or down over time.

    Daily Brief is rolling out to all Google AI subscribers (18+) in the Gemini app, starting in the U.S. In order to use Daily Brief, Google AI subscribers must have chosen to connect their Google apps.

    6. Universal Cart

    Our new Universal Cart is a truly intelligent shopping cart and your new hub for shopping on Google. It works across merchants and across services, so you can add things to your cart while you’re browsing Search, chatting with Gemini, watching YouTube or even reading your Gmail. The moment you add a product, your cart goes to work for you in the background. It finds deals and price drops, gives you insights on price history and alerts you when something comes back in stock.

    Universal Cart is rolling out across Search and the Gemini app in the U.S. this summer, with YouTube and Gmail to follow.

    7. Neural Expressive

    We've completely redesigned the Gemini experience from the ground up with Neural Expressive, our stunning new design language you’ll see from the moment you open the Gemini app or visit the site. The interface features fluid animations, vibrant colors, new typography and haptic feedback throughout. Model responses are where Neural Expressive truly comes alive. Instead of a wall of text, Gemini now designs tailored responses in real time — incorporating rich imagery, interactive timelines, narrated videos and dynamic graphics.

    Neural Expressive is now rolling out in the Gemini app on Android, iOS and the web to everyone.

    8. Gemini Spark

    This 24/7 personal AI agent in the Gemini app helps you navigate your digital life, takes action on your behalf and is under your direction. It’s integrated with Google’s suite of tools, like Gmail, Docs, Slides and more, and because it’s a cloud-based agent, it’s able to keep working in the background, even when you close your laptop or lock your phone. With Spark, you can set recurring tasks, teach it new skills and create complete workflows. You choose whether to turn it on and what apps it connects to, and it’s designed to ask you first before performing high-stakes actions like spending money or sending emails.

    Gemini Spark is rolling out to trusted testers, and we’re also rolling it out as a Beta for Google AI Ultra subscribers in the U.S.

    9. Gemini app for macOS

    We’re working on big updates to the Gemini app for macOS. We’ll be bringing Gemini Spark to the Gemini desktop app this summer so it can help with tasks involving your local files and automate workflows across your desktop.

    We’re also innovating on new voice experiences in the macOS app, similar to what we previewed at The Android Show. You won’t have to worry about all the “ums” or “what abouts” that happen as you think aloud. Using the context from your screen, Gemini can turn your free-flowing speech into precise drafts, instantly reformatting the text to capture your intent, right where your cursor is.

    The macOS app is available to download for all users, with Gemini Spark and the new voice features will roll out later this summer.

    10. Intelligent eyewear

    Our next big milestone for Android XR is intelligent eyewear. There will be two types of intelligent eyewear: audio glasses that offer spoken help in your ear, and display glasses that show you the information you need, right when you need it.

    Audio glasses are launching later this fall, and at I/O 2026, we revealed the first two designs. These glasses let you stay hands-free and heads-up for things like listening to music, taking photos, making calls, placing your usual coffee order or tapping into your phone apps without reaching into your pocket.

    11. SynthID

    Three years ago, we introduced SynthID, our industry-leading digital watermarking technology that embeds imperceptible signals into AI-generated content. Since then, we've integrated SynthID into our generative media models and products, watermarking over 100 billion images and videos and 60,000 years of audio assets, and brought SynthID verification to the Gemini app. We’re now expanding this verification capability to Search and also to Chrome in the coming weeks.

    Companies like OpenAI, Kakao and ElevenLabs are adopting SynthID to watermark more of their own AI-generated content. We’re also launching a new AI content detection API on Google Cloud’s Gemini Enterprise Agent Platform, giving businesses a robust tool to identify synthetic media across their operations.

    Additionally, we’re expanding Content Credentials across products. Pixel 10 was the first smartphone to provide Content Credentials for images in its native camera app, and we are expanding this technology to video on Pixel 8, 9 and 10 phones in the coming weeks. We’re also adding Content Credentials verification to the Gemini app, and to Search and Chrome in the coming months. This will show you if the origin of the content was AI or a camera, and if it’s been edited with generative AI tools.

    12. Gemini for Science

    Gemini for Science is a new collection of science tools and experiments designed to expand the scale and precision of scientific exploration. Building on the deep reasoning and research capabilities of Gemini as well as Deep Think and Deep Research, it includes new experiments on Labs as well as Science Skills to connect agentic platforms like Google Antigravity to over 30 major life science databases and tools.

    You can express interest to try Gemini for Science experiments on Google Labs, and Science Skills is available today on GitHub and directly in Google Antigravity.

    Original source
  • May 20, 2026
    • Date parsed from source:
      May 20, 2026
    • First seen by Releasebot:
      May 22, 2026
    Google logo

    Gemini by Google

    100 things we announced at I/O 2026

    Gemini launches a major I/O 2026 wave of AI upgrades, including Gemini 3.5 Flash, Gemini Omni, a redesigned Gemini app, new Search agents, Daily Brief, Universal Cart, and expanded tools for builders, creators, and researchers across Google’s products.

    We've been busy! Here’s a rundown of the top announcements, launches and demos at I/O 2026.

    This week at Google I/O 2026, we unveiled new models, agents and tools to help you build, search, create, discover, shop and get more done. You can dig into our I/O announcements — including an edited transcript of Google CEO Sundar Pichai’s remarks from the stage. And for a TL;DR (of sorts), keep scrolling for our annual list of 100 highlights from the event.

    Create and build with our most advanced models

    Gemini 3.5

    1. We launched Gemini 3.5 Flash: the first in our latest series of models combining frontier intelligence with action.
    2. Gemini 3.5 Flash is generally available today via our agent-first development platform Google Antigravity, the Gemini API in Google AI Studio and Android Studio.
    3. Gemini 3.5 Flash delivers intelligence that rivals large flagship models at speeds you expect from the Flash series. It outperforms Gemini 3.1 Pro on challenging coding and agentic benchmarks like Terminal-Bench 2.1 (76.2%), GDPval-AA (1656 Elo) and MCP Atlas (83.6%).
    4. Landing in the top-right quadrant of the Artificial Analysis index, 3.5 Flash delivers frontier-level intelligence at exceptional speed — proving you no longer have to trade quality for latency.
    5. Gemini 3.5 Flash is ideal for tackling long-horizon agentic tasks. What used to take a developer days or an auditor weeks, 3.5 Flash can now help complete in a fraction of the time, often at less than half the cost of other frontier models. It rapidly plans, builds and iterates to solve real-world problems, whether it’s developing new applications, maintaining codebases or helping to prepare financial documents. Building on the strong multimodal foundation of Gemini 3, 3.5 Flash generates richer, more interactive web UIs and graphics.
    6. We’re also hard at work on Gemini 3.5 Pro. It’s already being used internally and we look forward to rolling it out next month.

    Gemini Omni

    1. Gemini Omni is our new model that can create anything from any input — starting with video. It combines Gemini’s intelligence with the best of our generative media models for a new level of world understanding, multimodality and editing. We’re starting with video outputs now, but over time, Gemini Omni will be able to generate any output from any input.
    2. Gemini Omni combines an intuitive understanding of physics with Gemini's knowledge of history, science and culture, bridging the gap from photorealism to meaningful storytelling. It has an improved understanding of forces like gravity, kinetic energy and fluid dynamics, allowing you to create more realistic scenes.
    3. Videos created with Omni include our imperceptible SynthID digital watermark. You can easily verify content through the Gemini app, Gemini in Chrome and Search.
    4. You can reference anything. Gemini Omni turns any reference — image, text, video or audio — into a single, cohesive output. While only voice references will be supported for audio to start, we’ll roll out other types of audio inputs soon.

    Gemini Omni in the Gemini app, Google Flow and YouTube

    1. Gemini Omni Flash is rolling out now to all Google AI Plus, Pro and Ultra subscribers globally through the Gemini app and Google Flow. It’s also available today in YouTube Shorts Remix and the YouTube Create app to users (18+) at no cost.
    2. Creating, remixing and editing a video is easier than ever. Gemini Omni in the Gemini app offers a fluid, conversational way to create and edit videos — like applying cinematic zooms or changing backgrounds with a simple prompt.
    3. You can upload any photo or video from your camera roll, apply built-in templates with a single click and experience the magic without needing expensive equipment or technical jargon. You can even drop yourself into the action by creating a custom AI avatar that looks and sounds like you.
    4. For creatives using Google Flow, Omni Flash allows you to blend real-world inspiration with generated content and iterate conversationally. Gemini Omni Flash also improves character consistency, meaning identity and voice are preserved across every scene.
    5. And you can try the new Gemini Omni model at no cost in YouTube Shorts Remix, with an exciting new upgrade that lets you step directly into your favorite Shorts. Just select an eligible Short, prompt what you want changed — like adding yourself or any visual reference — and get a fresh new version with your edits.

    Search, shop and discover exactly what you’re looking for

    AI Search

    1. AI Mode is our most powerful AI Search, and it has surpassed more than 1 billion monthly users. And today we’re upgrading the experience with Gemini 3.5 Flash as the new default model, globally. We’re seeing incredible momentum, with AI Mode queries more than doubling every quarter since launch. And last quarter, we saw Search queries reach an all-time high.
    2. Today we’re launching the biggest upgrade to our Search box in over 25 years — a new, intelligent Search box, now completely reimagined with AI. You can search using text, images, files, videos and Chrome tabs and Search reasons across them all. You’ll continue to get a range of results from Search, just like you do today.
    3. We’re also making it even easier to continue the conversation with Search, bringing AI Overviews and AI Mode into one, seamless AI Search experience. You can flow effortlessly from your question, to a search results page with an AI Overview, to a follow-up in AI Mode, all with links to learn more. The new seamless AI Search experience is live today across desktop and mobile, worldwide.

    Information agents

    1. We’re entering the era of Search agents, where you can easily create, customize and manage multiple AI agents for your many tasks, right in Search. And we’re starting with information agents, which operate in the background 24/7 so you can stay updated on any topic, task or project that matters to you.
    2. Your agent will intelligently look across everything on the web, like blogs, news sites and social posts, plus our freshest data, such as real-time info on finance, shopping and sports, to monitor for changes related to your specific question. And your agent will send you an intelligent, synthesized update, with the ability to take action. You can spin up multiple information agents in Search simultaneously to get updated and make progress on all the things that matter to you.
    3. You’ll be able to put information agents to work for you this summer, rolling out first to Google AI Pro and Ultra subscribers.

    Generative UI and Antigravity in Search

    1. With the power of Google Antigravity and the agentic coding capabilities of Gemini 3.5 Flash, Search can build you the ideal format exactly for your question, completely custom, on the fly.
    2. You can get custom generative UI, with Search designing custom layouts, assembling components – like interactive visuals, tables, graphs or simulations – in real time – helping you better understand complicated topics. Generative UI with Antigravity is rolling out to Search this summer for everyone, free of charge.
    3. Some projects aren’t one-off questions — they’re ongoing tasks, like planning a wedding or managing a home move. For these, Search can go a step further — helping you build entire custom experiences, like dashboards or trackers that you can continue coming back to. You can think of these like mini apps for your own specific tasks.
    4. You’ll be able to build custom experiences like mini apps with Antigravity right in Search in the coming months, starting with subscribers.

    Personal Intelligence

    1. We’re expanding Personal Intelligence in AI Mode to more people in nearly 200 countries and territories across 98 languages — no subscription required.
    2. With AI Mode in Search, you can securely connect apps like Gmail and Google Photos, and soon Google Calendar. Personal Intelligence was designed with transparency, choice and control at its core. You’re always in control — you choose if and when you want to connect apps like Gmail and Google Photos.

    Universal Cart

    1. We’re introducing Universal Cart: a truly intelligent shopping cart and your new hub for shopping on Google. You’ll be able to add things to your cart while you’re browsing Search, chatting with Gemini, watching YouTube or even reading your Gmail. The moment you add a product, your cart goes to work for you in the background. It finds deals and price drops, gives you insights on price history and alerts you when something comes back in stock. The Universal Cart runs on our Gemini models, so your cart gets smarter as the models improve.
    2. It also uses intelligent reasoning to anticipate your needs and help solve problems before they arise. It’ll proactively flag any product incompatibilities and suggest alternatives. And since the cart was built on Google Wallet, it understands your payment method perks, loyalty information and merchant offers so it can intelligently help you choose between payment methods.
    3. Universal Commerce Protocol (UCP) makes checkout from your cart super smooth. For many of your favorite brands, you can check out right on Google in just a few taps with Google Pay, or transfer items straight to the retailer’s site and buy there.
    4. We’re rolling out the Universal Cart across Search and the Gemini app this summer, with YouTube and Gmail to follow.

    Streamline your day with the Gemini app

    Gemini Spark

    1. Gemini Spark is your 24/7 personal AI agent that helps you navigate your digital life, takes action on your behalf and is under your direction. It works in the background on your phone or laptop even while they’re turned off.
    2. Spark runs on Gemini 3.5 and is built on the Google Antigravity platform. It operates autonomously, under your direction. You choose to turn it on and it’s designed to check with you before taking major actions on your behalf.
    3. Gemini Spark is very early in its product journey, and we’re prioritizing safety in this first release — that’s why we’re rolling it out to trusted testers, and we’re planning on bringing the Beta to Google AI Ultra subscribers in the U.S. next week.
    4. Down the line, we’ve got a packed roadmap of features for Google Spark that we’ll be shipping throughout the summer. For example: You’ll be able to text or email Spark directly, create custom sub-agents and even authorize payments while specifying the budget and merchants.

    Daily Brief

    1. Daily Brief is our new out-of-the-box agent that organizes and prioritizes your day ahead with a personalized digest based on your goals, and suggests next steps.
    2. With Daily Brief, Gemini works overnight for you, gathering info for your day ahead. It analyzes your inbox, calendar, and tasks to find the most important things for you. It’s concise and insightful, connecting the dots across your life. It’s actionable, anticipating your needs and suggesting next steps. And it learns over time, remembering your preferences, dates and times.
    3. Daily Brief is rolling out starting today to all Google AI subscribers (18+) in the Gemini app, starting in the U.S. In order to use Daily Brief, Google AI subscribers must have chosen to connect their Google apps.

    Neural Expressive

    1. We’ve completely redesigned the Gemini experience from the ground up. From the moment you open the app or visit the site, you’re greeted with a new design language we call Neural Expressive, which has fluid animations, vibrant colors, new typography and haptic feedback throughout.
    2. We’ve simplified and streamlined everything, unifying our tools menu and making it easier to discover and generate gorgeous images, videos and music, with built-in templates you can instantly remix.
    3. The moment you send your prompt, Neural Expressive truly comes alive. You won’t see a wall of text anymore. Instead, Gemini will carefully lay out its response in real time, just for you. As you scroll, you might see interactive images that you can zoom in on and explore the information on a completely new level, or timelines that you can quickly skim, or embedded visuals.
    4. We’ve also completely transformed the Gemini Live experience — it now opens immediately and inline. Gemini Live is also using a new model that’s smarter, faster and less distracted by background noise.
    5. Soon you’ll even be able to pick out a regional dialect that resonates with you. We’ll be rolling these out in the coming weeks.

    Uplevel your building with agents

    Google Antigravity

    1. Google Antigravity is our agent-first development platform that allows anyone to be a builder. And today, Antigravity is massively expanding its suite of agentic capabilities, surfaces, integrations and product features.
    2. Google Antigravity 2.0 is a new, standalone desktop application that acts as a central home for agent interaction. You can orchestrate multiple agents to execute tasks in parallel, such as having one agent code a website while another generates brand assets.
    3. Antigravity CLI is for those who prefer to stay in the terminal. It’s a lightweight, high-velocity product surface that lets you create new agents instantly without a graphical user interface.
    4. Antigravity SDK gives you programmatic access to the same agent harness powering Google’s products that’s co-optimized for our Gemini models. This SDK lets you customize agent behavior and host them on your own infrastructure.
    5. We’re also launching native voice support for Gemini audio models, as well as integration with many surfaces and platforms, like Android, Firebase and Google AI Studio.
    6. The new Antigravity is unabashedly agent-first, focusing on the core agent conversations, agent-produced artifacts and multi-agent orchestration. The Antigravity agent harness, the invisible framework for Gemini to perform real-world tasks, has become much more powerful, with new core primitives such as subagents, hooks and asynchronous task management. Underpinning all of this are the Gemini models, with Gemini 3.5 Flash having been co-optimized with the Antigravity agent harness.
    7. Multi-day engineering efforts are collapsing into hours, if not minutes. This is made possible by the new subagent teamwork capability. We’re bringing this to you as an early research preview in Antigravity.
    8. We are unifying on Antigravity as the only platform you need for agent-first development. We took what we learned from how you used Gemini CLI and rolled those insights into the Antigravity CLI. We encourage users to migrate to Antigravity CLI and have published a guide to help you port your custom skills over. You’ll now get the same harness as Antigravity 2.0 and a unified agentic experience across all your surfaces.
    9. For enterprises, we’re allowing Google Antigravity to be connected directly to your Google Cloud projects, applying the same enterprise terms that you’d expect. For our existing Gemini Enterprise customers, you’ll soon see Antigravity rolled out in the coming months.

    Google AI Studio

    1. Coming soon, the new Google AI Studio app lets you capture an idea on the go and have a working prototype ready by the time you get to your desk.
    2. Google Workspace is now directly accessible from the apps you build within AI Studio. With this integration, you can build dashboards on top of your Sheets data, create tools that organize your users’ Drive or spin up apps that work with the documents and data your team already lives in. All without leaving AI Studio.
    3. Starting today, you can now build native Android apps right in the build tab. Just select “Build an Android app” and begin prompting.
    4. We’re also introducing support for the Google Play Console in AI Studio for developers to publish apps directly to the test track. You can preview your app on an Android Emulator running in the browser or install your app on an Android test device using the Android Debug Bridge (ADB). And you can connect your Google Play Developer account in AI Studio to publish your Android app to Google Play’s Internal Test Track with a single click.
    5. Builders who are just getting started with AI Studio can now deploy their first two apps to Google Cloud at no cost, no credit card required.
    6. If you want local development for faster iteration, you can now export directly to Google Antigravity. Your conversation history, project files and secrets all come with you, so you can pick up exactly where you left off, bring in your wider team and start scaling your development workflow.
    7. Within AI Studio, you now have more customization options to design how your app looks and feels. The AI Studio Build agent can automatically generate custom images on the fly using Nano Banana. This helps you build tailored interfaces or mock up specialized use cases without needing external placeholder assets. Our new edit tool lets you annotate directly in the preview window. You can draw on your app, tweak components and generate new visuals to iterate on your build, right in the flow.

    Managed Agents

    1. We’re launching Managed Agents in the Gemini API. With Managed Agents, a single API call to the Antigravity agent provisions a remote Linux environment where the agent can reason, plan and call tools using the harness; execute code and manage files in an isolated sandbox; and browse the web to fetch and process live data. Managed Agents are powered by the new Antigravity agent, built with Gemini 3.5 Flash and available via the Interactions API and in Google AI Studio.
    2. You can extend the Antigravity agent with your own instructions and skills. Instead of writing complex orchestration code, you can define everything in markdown files like AGENTS.md and SKILL.md and register them as a named agent.
    3. We announced the Build with Gemini XPRIZE Hackathon, a new global competition with a $2 million prize pool — the largest ever for a hackathon. We're asking developers to use Gemini to build real applications that solve some of the world's most pressing challenges.

    WebMCP and Chrome Dev Tools

    1. We’re giving you a first look at WebMCP, a proposed open web standard that allows you to expose structured tools like JavaScript functions and HTML forms to browser-based agents.
    2. Modern Web Guidance, now available in early preview, is a set of evergreen and expert-vetted skills that guide your AI coding tools across many common use cases to build modern web experiences that are also accessible, performant and secure.
    3. Scale your workflow with Chrome DevTools for agents, which provides your AI agent with the visibility it needs to verify, debug, and optimize code in real time - available for Google Antigravity and more than 20 other coding agents now.

    Subscriptions

    1. We’re launching a new $100 AI Ultra plan, specifically tailored for developers, technical leads, knowledge workers and advanced creators. It includes 5X higher usage limits in the Gemini app and Antigravity than our AI Pro plan, plus 20TB of cloud storage and more features to help accelerate your dev cycles and bring the frontier of intelligence into your workflow.
    2. Google AI Pro paid subscriptions now include the YouTube Premium Lite individual plan at no extra charge. This adds $8.99 in value monthly and lets you watch most YouTube and YouTube Kids videos ad-free, offline and in the background for an enhanced entertainment experience.

    Get more done at work and elevate your creativity

    AI Inbox

    1. Earlier this year, we introduced AI Inbox as a new view in Gmail that intelligently surfaces what matters most. It helps you prioritize your to-dos and provides updates on what’s important. It’s currently available for Google AI Ultra subscribers, and we’re starting to roll it out to all Google AI Plus and Pro subscribers in the U.S.
    2. AI Inbox now generates personalized draft replies based on contextual information so you can review and respond in seconds.
    3. Starting now, in AI Inbox, if a task requires reviewing a Google Doc, Sheet or Slide, the relevant link surfaces right next to your to-do.
    4. We announced more ways to streamline your task management in AI Inbox. Keep your view clutter-free by marking individual tasks as done, dismissing unhelpful suggestions or marking all emails in a given topic as read with a single click.
    5. Starting this summer, Google AI Pro and Ultra subscribers will be able to talk to your inbox with Gmail Live — asking specific queries without having to dig through threads.

    Google Pics

    1. We’re introducing Google Pics, our new image creation and editing tool, built on our latest Nano Banana model, that helps you create just about anything – from party flyers to infographics, with the creative controls you want. Whether you’re building a design from a blank canvas or editing an existing photo, Pics takes the hassle out of complex image generation. The new tool includes features including object segmentation (so you can select and edit specific elements with precision), text editing and translation, and integrations with Workspace.
    2. Google Pics is launching today to a limited group of trusted testers. This summer, it will roll out globally to Google AI Pro and Ultra subscribers and in preview for Google Workspace business customers.

    Docs Live and Talk to Keep

    1. Docs Live is a new feature that lets you create docs and edit them with your voice. Just talk, and Docs Live handles the heavy lifting — organizing your thoughts, structuring your document, and, with your permission, pulling relevant details from your Gmail, Drive, Chat and the web. Google AI Pro and Ultra subscribers will be able to talk to Docs starting this summer.
    2. You will soon be able to “brain dump” with Keep. Keep not only understands your rambles, it will get to work in the background, turning your stream of thoughts into organized notes and lists at the speed of your voice. This feature is rolling out this summer for Google AI Pro and Ultra subscribers and in preview to Google Workspace business customers.

    Google Flow

    1. On last year’s I/O stage, we introduced Google Flow. Since then we expanded Flow into an AI creative studio, with new capabilities in video and image generation and editing, and launched in over 140 countries around the world. Gemini Omni Flash is now available in Google Flow to Google AI subscribers globally.
    2. For creatives using Google Flow, Gemini Omni Flash allows you to blend real-world inspiration with generated content and iterate conversationally. Gemini Omni Flash also improves character consistency, meaning identity and voice are preserved across every scene.
    3. We’re also introducing Google Flow Agent. Until today, Flow could only execute one prompt at a time. Now, your agent can take on multi-step tasks. Your agent in Google Flow is your creative partner that can plan and reason through complex tasks with your inputs, under your control.
    4. Built with Gemini models, Google Flow Agent brings expertise and a deep understanding of your project to help with early brainstorming, creating and editing. For example, the agent can be a sounding board for dialogue between characters in a specific scene, or even make plot recommendations. When you’re deeper in a project, your agent can help create multiple variations at one time to give you more options, and even batch edit, so your tweaks are reflected across all your assets. Once you have your assets, your agent can organize them into collections, and it can even intuitively rename them. Google Flow Agent is now available to all Google Flow users globally.
    5. We’re also introducing Google Flow Tools. Now you can vibe code any creative tool you can think of, right in Google Flow, custom built by you for your unique creative process – like designing video effects, hand-drawn animations, or layering text.
    6. You can use natural language to create bespoke tools and workflows in Google Flow. Whether you’re looking for a particular image editor, video resizer or custom shaders, you can now easily develop them, no coding experience required. And if you create something you think others might like, you can easily share your tool with other Flow users, who can remix it into their own. All Google Flow users globally are able to use existing Tools, while Google AI subscribers can also create and remix them.

    Google Flow Music, Pomelli and Stitch

    1. With Google Flow Music, you’re also able to use Gemini Omni to work conversationally with the agent to direct shareable music videos. New refinement capabilities let you edit specific portions of your song with increased granularity. For example, you can change a section of lyrics to a different language, change the genre, adjust the instruments or fine-tune just about anything else you can imagine.
    2. Pomelli is adding new ways to build brand content and design websites.
    3. With real-time design and steering, designing with Stitch is now a more natural and intuitive collaboration. You can now describe what you want through text, or say it aloud, and Stitch works alongside you to build out and reflow your ideas. You can also import existing codebase and design files, ensuring your builds are on-brand.

    Drive scientific breakthroughs and discoveries

    Gemini for Science

    1. We introduced Gemini for Science, a new collection of science tools and experiments designed to expand the scale and precision of scientific exploration at every stage of the research process. It includes three new experimental tools on Google Labs that can streamline daily scientific tasks, whether staying on top of new published papers, transforming research goals into usable code or generating new hypotheses. Those tools are:
    2. Hypothesis Generation, built with Co-Scientist, simulates the scientific method. It collaborates with researchers to define a research challenge, then uses a multi-agent “idea tournament” to generate, debate and evaluate hypotheses. To ensure absolute rigor, claims are deeply verified and supported by clickable citations.
    3. Computational Discovery, an agentic research engine, built with AlphaEvolve and Empirical Research Assistance (ERA), which generates and scores thousands of code variations in parallel. This allows scientists to test novel modeling approaches — for complex fields like solar forecasting or epidemiology — that would take months to navigate manually.
    4. And Literature Insights, built with NotebookLM, which searches scientific literature and structures results into tables with custom, searchable attributes for side-by-side analysis. Researchers can use chat to uncover nuances grounded in their curated corpus, and create high-fidelity artifacts such as reports, slide decks, infographics, and audio and video overviews.
    5. We’re gradually opening access to these three experimental tools from 19 May. Users can visit labs.google/science to register interest.
    6. As part of Gemini for Science, we’re also launching Science Skills, a specialized bundle that integrates insights from over 30 major life science databases and tools including UniProt, AlphaFold Database, AlphaGenome API and InterPro. Using these skills on agent-first agentic platforms like Google Antigravity allows researchers to perform complex and often manual workflows like structural bioinformatics and genomic analyses in minutes rather than hours. Science Skills is available from May 19 on Github and for all Google Antigravity users.
    7. We’ve also created dedicated pilots with leading scientific conferences like ICML, STOC and NeurIPS to develop pioneering tools for agentic peer review and scientific validation such as our experimental Paper Assistant Tool (PAT) and ScholarPeer.

    Transform the way you learn and explore

    Ask YouTube

    1. We’re reimagining how users can search and discover content they’re interested in with our new conversational search experience, Ask YouTube. With Ask YouTube, you can ask more complex search queries, such as wanting tips on how to teach your kid to ride a bike. Ask YouTube will compile the most relevant videos across all of YouTube’s catalogue — including long-form videos and Shorts — and provide an interactive, structured response.
    2. Ask YouTube will begin rolling out this month on desktop as an experiment to a subset of users searching in English in the US.

    Android XR

    1. The next big milestone for Android XR is intelligent eyewear. There will be two types of intelligent eyewear: audio glasses that offer spoken help in your ear, and display glasses that show you the information you need, right when you need it.
    2. Our first audio glasses, made in partnership with Gentle Monster, Warby Parker, and Samsung, will arrive this fall and will be compatible with Android and iOS devices.

    SynthID

    1. Three years ago, we launched SynthID, our industry-leading watermarking technology that embeds imperceptible signals into AI-generated content. Our goal is to make it easier to learn more about the content you encounter online. That’s why we recently added SynthID verification for image, video and audio to the Gemini app. Already, it’s been used 50 million times globally, and we’re expanding this verification capability to Search today and Chrome over the coming weeks.
    2. You can learn about an image's origin by using Search features like Lens, AI Mode, and Circle to Search, as well as Gemini in Chrome. Just ask, "Is this made with AI?” or “Is this AI generated?”
    3. We’re also adding verification for C2PA Content Credentials, to easily check if content is an unaltered original from a camera or if it has been modified, and by what tools. This feature is rolling out in the Gemini app starting today, and it will come to Search and Chrome in the coming months.
    4. Because digital media travels across many platforms, industry-wide partnership and adoption of robust, interoperable tools is essential. More content across the web will soon carry these imperceptible watermarks, as companies like OpenAI, Kakao and ElevenLabs are bringing SynthID technology to more of their AI-generated content.
    Original source
  • May 19, 2026
    • Date parsed from source:
      May 19, 2026
    • First seen by Releasebot:
      May 20, 2026
    Google logo

    Gemini by Google

    Making it easier to understand how content was created and edited

    Gemini expands content transparency and verification with SynthID and C2PA checks across Search, Chrome, Pixel and Cloud, plus a new AI Content Detection API. It helps users and businesses identify how media was created, edited, and whether it was AI generated.

    We're expanding our tools to help you understand how content was created and edited across the web.

    As generative media becomes more advanced and accessible, it’s helpful to know where content comes from, and whether it’s been altered. Today, we’re expanding our content transparency and verification tools in Search, Gemini, Chrome, Pixel and Cloud, and deepening our partnership with the broader industry.

    Scaling our technology

    Three years ago, we introduced SynthID, our industry-leading digital watermarking technology that embeds imperceptible signals into AI-generated content. Since then, we've integrated SynthID into our generative media models and products, watermarking over 100 billion images and videos and 60,000 years of audio.

    Across a growing number of our generative media tools, we use C2PA Content Credentials, the industry standard that shows how media was created and modified, with or without AI. Pixel 10 was the first smartphone to provide Content Credentials for images in its native camera app, and we are expanding this technology to include video on Pixel 8, 9 and 10 phones in the coming weeks.

    By using this technology at the point of capture, Pixel documents when content has been captured by a camera. In an era of generative media, we believe that identifying authentic, unedited content can be just as important as knowing when a file was made or edited using AI.

    Providing more ways to verify content

    Our goal is to make it easier to learn more about the content you encounter online. That’s why we recently added SynthID verification for image, video and audio to the Gemini app. Already, it’s been used 50 million times globally, and we’re expanding this verification capability to Search today and Chrome over the coming weeks.

    You can learn about an image's origin by using Search features like Lens, AI Mode and Circle to Search, as well as Gemini in Chrome. Just ask, "Is this made with AI?” or “Is this AI generated?”

    We’re also adding verification for C2PA Content Credentials, to easily check if content is an unaltered original from a camera or if it has been modified, and by what tools. This feature is rolling out in the Gemini app starting today, and it will come to Search and Chrome in the coming months. This builds on features like the labels on YouTube that identify AI-generated content and our work with trusted testers on Backstory to make detection tools faster and more reliable.

    Partnering across the industry

    Because digital media travels across many platforms, industry-wide partnership and adoption of robust, interoperable tools is essential. More content across the web will soon carry these imperceptible watermarks, as companies like OpenAI, Kakao and ElevenLabs are bringing SynthID technology to more of their AI-generated content. This builds on our ongoing work to make content origins clearer, like open-sourcing our SynthID text watermarking technology and partnering with NVIDIA to watermark AI-generated video from their Cosmos world foundation models.

    To help more organizations identify AI-generated media, we’re launching a new AI Content Detection API on Google Cloud’s Gemini Enterprise Agent Platform. This gives businesses a powerful way to spot AI content made by both Google and other popular models, helping them decide how to evaluate and manage media across their own platforms — whether that’s for backend operations like sorting feeds and preventing insurance fraud, or for user-facing content like fact-checking and labeling synthetic media. We’re launching with a group of trusted partners and will continue to refine the API based on their feedback.

    We also continue to advocate for global standards for provenance technology as a member of the C2PA steering committee. This ensures that transparency tools built into our devices work seamlessly across the platforms you use every day. For example, Meta — a fellow C2PA Steering Committee member — will start labeling camera-captured media with Content Credentials on Instagram. This means authentic photos and videos shot natively on Pixel phones will soon be recognized and labeled as such when you share them on Instagram.

    We’ve long invested in providing helpful context about the information you find online. Content transparency is a complex challenge, but we’ll keep developing ways to push the technology forward and set a high bar for the industry. Our goal is to empower you with the tools needed to determine the history of any content you encounter.

    Original source
  • May 19, 2026
    • Date parsed from source:
      May 19, 2026
    • First seen by Releasebot:
      May 20, 2026
    Google logo

    Gemini by Google

    The Gemini app becomes more agentic, delivering proactive, 24/7 help

    Gemini introduces a redesigned AI assistant experience with a new UI, Daily Brief and Gemini Spark for proactive help, plus Gemini Omni for cinematic video creation and a macOS app with new voice features. It also expands connected apps and rolls out Gemini 3.5 Flash.

    Gemini is becoming a more helpful AI assistant, with an intuitive new UI, proactive daily briefs and Gemini Spark, an agent to help you get things done around the clock.

    It’s been a banner year for the Gemini app. Last year at Google I/O, Gemini was serving 400 million users. Today, more than 900 million people across 230 countries and more than 70 languages turn to Gemini for help every month.

    In time for Google I/O 2026, here’s what's new:

    • Gemini 3.5 Flash: The first in our next generation of models that combines frontier intelligence with lightning-fast action.
    • Neural Expressive: A vibrant, dynamic and completely reimagined design language for Gemini.
    • Gemini Omni: Our new model that can seamlessly transform text, images and video prompts into cinematic, high-quality video outputs.
    • Daily Brief: A new agent that gives you a personalized morning brief and organizes exactly what you need to know to start your day.
    • Gemini Spark: A 24/7 personal AI agent designed to proactively manage tasks and help you navigate your digital life, all under your direction.
    • MacOS app: Our desktop app will be integrating Gemini Spark so it can operate on your local machine, and it will also add powerful new voice features.

    Neural Expressive: A new design language for the AI era

    We’ve redesigned the entire Gemini experience from the ground up, introducing a stunning new design language we call Neural Expressive. The interface now features fluid animations, vibrant colors, new typography and haptic feedback.

    We’ve also integrated the Gemini Live conversational experience directly into Gemini. Now, you can seamlessly switch from typing a quick question to diving deep into a free-flowing conversation — and back again — without missing a beat. We also re-engineered the mic so you can tap and talk through a complex idea at your own pace without getting cut off mid-thought. And soon, we’ll start offering regional dialects, allowing you to choose a voice that truly resonates with you.

    Finally, we’re using the power of our Gemini models to make responses more engaging and easier to understand. Instead of throwing a wall of text at you, Gemini now designs tailored responses in real time — incorporating rich imagery, interactive timelines, narrated videos and dynamic graphics.

    Neural Expressive is rolling out globally today across the web, Android and iOS for everyone.

    Gemini Omni: Turning your ideas into cinematic videos

    To unlock your creative potential, we're introducing Gemini Omni, a model designed to turn your imagination into reality. By seamlessly combining text, images and video inputs, Gemini Omni allows you to generate stunning, high-quality video outputs effortlessly.

    With Gemini Omni, video editing becomes a fluid, natural conversation. You can apply cinematic zooms or swap out backgrounds with a simple prompt. Just upload footage from your camera roll, apply built-in templates with a single tap and create polished content without expensive equipment or specialized technical jargon. You can even drop yourself directly into the action by creating a custom AI avatar that looks and sounds exactly like you.

    Gemini Omni begins rolling out today to Google AI Plus, Pro and Ultra subscribers worldwide.

    Daily Brief: Start your day on the right foot

    We’re introducing Daily Brief, an agent that gives you a personalized morning digest that’s designed to be your first stop every day. Built on the success of our recent Google Labs experiment CC, Daily Brief gives you a seamless, intuitive entry point into the world of AI agents.

    Once you opt in, Gemini works across your connected apps in the background. It gathers urgent updates from your Gmail inbox, tracks upcoming events from your Calendar and compiles relevant follow-up details into a skimmable briefing.

    It goes far beyond a simple summary. Daily Brief actively organizes and prioritizes based on your specific goals, even suggesting immediate next steps. You can easily steer it by giving responses a quick thumbs up or down over time.

    Daily Brief begins rolling out today to Google AI Plus, Pro and Ultra subscribers, starting in the U.S.

    Gemini Spark: From information to action

    We’re also introducing Gemini Spark, a 24/7 personal AI agent that helps you navigate your digital life. Spark represents a big shift for Gemini, transforming it from an assistant that can answer your questions into an active partner that does real work on your behalf and under your direction.

    Gemini Spark runs on Gemini 3.5 and uses the Antigravity harness. It’s deeply integrated with the Workspace tools you rely on daily, like Gmail, Docs, Slides and more. Even better, because it is a cloud-based agent, Spark keeps working in the background even when you close your laptop or lock your phone. That combination means Spark is ready to take complex tasks off your plate so you can be more present for what matters most.

    With Gemini Spark, you can:

    • Set recurring tasks or triggers: Automatically parse monthly credit card statements to flag new or hidden subscription fees.
    • Teach it new skills: Direct it to check your inbox for ongoing updates from your kids' school, extract critical deadlines and send a consolidated daily digest to you and your partner.
    • Create complete workflows: Ask it to synthesize raw meeting notes across emails and chats, create polished Google Docs with its findings and even draft the companion email kicking off a project.

    This is just the beginning. We’ve got a packed roadmap of features shipping over the summer. We’re expanding our list of Gemini connected apps with new MCP connections to Canva, OpenTable and Instacart launching today, and a full list of more partners are integrating now. In the coming weeks, Spark will be able to use these MCP connections to get things done for you. We'll also be adding new abilities, including texting and emailing Spark, creating custom sub-agents and operating your local browser.

    Spark operates under your direction. You choose whether to turn it on and what apps it connects to, and it’s designed to ask you first before performing high-stakes actions like spending money or sending emails.

    Gemini Spark will roll out to trusted testers this week, and we're planning to roll it out as a Beta for U.S. Google AI Ultra subscribers next week.

    Gemini app for macOS: Take control of your desktop

    We’re working on big updates to the Gemini app for macOS. We’ll be bringing Gemini Spark to the Gemini desktop app this summer so it can help with tasks involving your local files and automate workflows across your desktop.

    We’re also innovating on new voice experiences in the macOS app, similar to what we previewed at The Android Show. You won’t have to worry about all the “ums” or “what abouts” that happen as you think aloud. Using the context from your screen, Gemini can turn your free-flowing speech into precise drafts, instantly reformatting the text to capture your intent, right where your cursor is.

    The macOS app is available to download today for all users, with Gemini Spark and the new voice features will roll out later this summer.

    All of today’s updates get us closer to our vision of a truly universal assistant that’s personal, proactive and powerful. So whether you’re a busy student, parent or small business owner, we look forward to what you can do with Gemini.

    Original source
  • May 19, 2026
    • Date parsed from source:
      May 19, 2026
    • First seen by Releasebot:
      May 20, 2026
    Google logo

    Gemini by Google

    I/O 2026: Welcome to the agentic Gemini era

    Gemini adds a major wave of AI features across Search, the Gemini app, Docs, Flow and more, including Gemini 3.5 Flash, Gemini Spark, Ask YouTube, Docs Live and new image and transparency tools. The update pushes Gemini deeper into agentic, conversational and creative experiences.

    Here’s how we’re helping you get more done with Gemini.

    Editor’s note: Below is an edited transcript of Google CEO Sundar Pichai’s remarks at Google I/O 2026, adapted to include more of what was announced on stage. See all the announcements in our collection.

    It’s been an extraordinary year since our last I/O, a period of relentless shipping, technology advances and hyper progress. We’re now in the part of the AI cycle where people want to see the value in the products they use every day. We’ve been really focused on that, and you’ll see that in the products and features we’re announcing today at I/O.

    Ten years since we pivoted the company to be AI-first, we still see AI as the most profound way to advance our mission and improve people’s lives at scale. That’s why we’ve been taking a differentiated, full-stack approach to AI innovation, from our custom silicon and secure foundation, to our world-class research and models, to our products and platforms that touch billions of people. This approach enables us to iterate and innovate faster in ways that are lighting up every part of the company.

    What’s incredible is how people are using AI, whether it’s students prepping for final exams with the Gemini app, musicians and artists using generative AI models like Lyria and Veo as part of their creative flow, or developers coding and bringing their ideas to life.

    AI momentum across the full stack

    These stories of how people are using AI are the best measure of progress. To understand the scale at which people are adopting AI, there is another great proxy — tokens, the fundamental units of data our models process, many representing a problem being solved.

    Two years ago, we were processing 9.7 trillion tokens a month across our surfaces — a huge number. Last year at I/O, that grew to roughly 480 trillion tokens. Fast forward to today, that number jumped 7x to over 3.2 quadrillion per month.

    It tells an important story about our products and how others are building as well — especially developers and enterprises:

    • Over 8.5 million developers are now building new apps and experiences with our models monthly.
    • Our model APIs are now processing roughly 19 billion tokens per minute.
    • Over the past 12 months, over 375 Google Cloud customers each processed more than one trillion tokens, representing incredible demand for AI from across industries.

    Momentum with our products

    Today we have 13 products with over a billion users each. Five of those have more than 3 billion users.

    Our Gemini models are a big reason more people are using our products, and why they're using our products more.

    It all starts with Search, which is bringing the benefits of generative AI to more people than any other product in the world. AI Overviews now has over 2.5 billion monthly active users. And AI Mode has been a revelation, our biggest upgrade to Search ever. People love it, and in just a year, it’s already surpassed 1 billion monthly active users.

    When people use our AI-powered features in Search, they use Search more. Search has become less about individual queries and feels more like an ongoing conversation, giving you deeper insights and connecting you with the vastness of the web.

    Another place where we’ve been rapidly innovating is in the Gemini app. Last year at I/O, the Gemini app had 400 million monthly active users. Today, we’ve surpassed 900 million, more than doubling in a year. In that same time, daily requests have grown over seven times.

    We’ve been adding a lot of unique features like Personal Intelligence, which make responses more customized and helpful. And to date more than 50 billion images have been generated with our Nano Banana image generation models. It was a breakout star this past year, showing how much latent creativity there is in the world.

    Natural, conversational AI in products

    There’s also a lot of latent productivity to be unlocked. Over the last year, we’ve been bringing the ability to have more natural conversations with Gemini directly inside our products. Recently, Maps got its biggest upgrade in a decade, including a new feature called Ask Maps. People are using Ask Maps for more complex, and much longer questions.

    Now we’re bringing more natural conversational AI to more products.

    Ask YouTube

    People come to YouTube everyday to ask a lot of questions. There’s a lot of great videos, but sometimes it’s hard to know where to start.

    Ask YouTube entirely reimagines the experience, making information much more digestible and easy to navigate. You’ll see videos that best match your interest, and most importantly, it jumps right to the part of the video most relevant to you.

    We’re starting to test Ask YouTube now, and it will roll out broadly in the U.S. this summer.

    Voice-powered Docs Live

    There are a lot of times I want to get things done at the speed of my voice. That is much more possible today thanks to technical leaps in our audio models.

    A new feature called Docs Live takes this to another level. To create a doc with Gemini before, you had to type out a precise prompt. With Docs Live, you can just verbally “brain dump” whatever is on your mind, and let Gemini do the rest. Here’s a demo in real-time:

    In the future, you’ll be able to create new docs and edit them directly, all with your voice. Docs Live is rolling out for subscribers this summer, and powerful voice capabilities will come to Gmail and Keep then too.

    Infrastructure supporting innovation at scale

    It’s incredible to see the pace of innovation rolling out across our products. Supporting all of this scale for our users, while also serving enterprises and developers around the world, requires massive investments in infrastructure. We’ve been investing for now and for the future. In 2022, we were spending $31 billion annually in capex. This year, we expect that number to be about six times that, approximately $180 to $190 billion. A key part of this investment is our custom silicon.

    A decade ago, we announced our very first commercial tensor processing unit, or TPU, on the I/O stage. Since then, we have transformed how the industry builds for AI. We recently announced our 8th generation of TPUs at Cloud Next. For the first time, we’ve taken a dual chip approach with specialized architectures for training and inference: TPU 8t and 8i.

    • TPU 8t is optimized for large-scale pretraining, and it’s nearly three times the raw computing power of our previous generation. We’ve taken a fundamentally different approach with our training infrastructure. With JAX and Pathways, our training is no longer constrained by the limits of a single, massive data center. Instead, we can now seamlessly distribute training across multiple sites, scaling training across more than 1 million TPUs globally. This gives us the ability to create the largest training cluster in the world. For model builders, this means training larger, more capable models in weeks rather than months.
    • TPU 8i is designed for inference. We have dramatically improved speed at every step. Because if we learned anything in 27 years of working on Search, it's that latency matters.

    In addition to speed, we’re also thinking about scaling sustainably. Both chips are more energy efficient, delivering up to two times better performance-per-watt.

    Gemini Omni

    This progress with TPUs is how we can make compute advances across models, coding and agents. With world models, AI is moving from predicting text to simulating reality. We have been working to push the boundaries of what these models can do.

    Gemini Omni is our new model that is capable of generating samples in any output modality from any input. We’re starting with video outputs, and over time we’ll enable image and text. This new model combines Gemini’s intelligence with our generative media models — a huge leap forward in world understanding. We’re launching the first model in the Omni family: Gemini Omni Flash.

    Gemini Omni Flash is available starting today. You will be able to try it on the Gemini app, Google Flow, and on YouTube Shorts. We'll also be rolling it out to developers and enterprise customers via APIs in the coming weeks.

    New SynthID updates and partners

    As generative AI gets better, so does the need for greater transparency. Research shows people can correctly identify high-quality deepfake videos only about a quarter of the time. Three years ago, we launched SynthID, our watermark that is invisible to the naked eye. Since launch, SynthID has now watermarked over one hundred billion images and videos, along with sixty thousand years of audio assets.

    Millions of people are using our SynthID detector in the Gemini app to verify AI-generated content. And now we’re going a step further and adding Content Credentials verification across products. This will show you if the origin of the content was AI or a camera, and if it’s been edited with generative AI tools. We want more people to have easy access to these tools, so we’re expanding both Content Credentials and SynthID verification to Search and Chrome.

    Of course, this only works at scale if more partners decide to watermark their own AI-generated content.

    Nvidia signed on to SynthID last year. And today, we are thrilled to announce that OpenAI, Kakao and Eleven Labs are adopting SynthID, too. It’s great to see the cross-industry collaboration. We’re looking forward to expanding to more partners and setting the standard of transparency for the AI era.

    Gemini 3.5 Flash

    Gemini 3 launched a few months ago, with a full family of models. It’s our most adopted series yet. We've loved seeing developers use Flash as their daily driver, and build incredible experiences with Pro's deep reasoning and multimodal capabilities. We’ve been hard at work on improving these models, especially focused on agentic coding, long-horizon tasks and real-world workflows.

    Today, we’re introducing Gemini 3.5 Flash, our first in a series of models combining frontier intelligence with action. Two things I’d highlight:

    • When compared to 3.1 Pro, 3.5 Flash is better across almost all benchmarks. It’s made huge progress in coding — and look at the extraordinary jump in GDPVal. This captures many real-world economically valuable tasks.
    • Gemini 3.5 Flash is a very capable model, at the frontier and comparable to the best models, but it’s still very fast. Which is why when you look at the intelligence versus output speed, it’s in a league of its own in the top right quadrant. When looking at output tokens per second, it is four times faster than other frontier models.

    The new model has been a game changer for us internally at Google. We’ve been using 3.5 Flash with a reimagined version of our agent-first development platform Antigravity, and it’s dramatically accelerated how we build. In March we were processing half a trillion tokens a day internally across our AI developer tools, and we’ve been doubling every few weeks. Now, we’re processing more than three trillion tokens a day. This scale created a powerful feedback loop helping us improve 3.5.

    What’s amazing about Flash is how it delivers frontier-level capabilities at less than half the price of comparable frontier models. We’ve heard that many companies are already blowing through their annual token budgets, and it’s only May. If companies used a mix of Flash and other frontier models they could save a lot of money. To put this in perspective, top companies are processing about 1 trillion tokens a day. If they shifted 80% of their workloads from other frontier models to 3.5 Flash, they’d save over $1 billion dollars annually. That is real savings they can pour back into their company.

    Gemini 3.5 Flash is available for everyone today across our products and APIs. We’re also excited for Gemini 3.5 Pro. We are using it internally, it’s showing great improvements, and it will be coming next month.

    Antigravity 2.0

    We’re also bringing 3.5 Flash to developers in Antigravity.

    Antigravity is expanding beyond the coding environment, turning it into a platform to develop and manage cohorts of autonomous AI agents. This includes Antigravity 2.0, a new standalone desktop application that acts as a central home for agent interaction, where anyone can orchestrate agents for all sorts of tasks. And we developed an even more optimized version of Flash: not just 4x but 12x faster than other frontier models.

    Users in Antigravity can get a taste of this experience starting today.

    Gemini Spark is your 24/7 agent

    Gemini 3.5 and Antigravity are unlocking a new world of agents and agentic capabilities. We’ve been bringing agents to developers and enterprises for a while. Now we are super focused on bringing the power of agents, safely and securely, to consumers so that it works for everyone. You’ll see agentic experiences across many of our products today.

    I’m particularly excited for Gemini Spark, your personal AI agent in Gemini app that helps you navigate your digital life, taking action on your behalf and under your direction.

    • It runs on dedicated virtual machines on Google Cloud. And it’s 24/7 so you don’t need to keep your laptop open.
    • It’s powered by Gemini 3.5 and the Google Antigravity harness, which allows it to perform long-horizon tasks easily in the background.
    • Spark will integrate seamlessly with tools, starting with our own, and in the coming weeks with third-party tools through MCP.
    • And you can work with Spark however is most convenient: in the Gemini app or soon, through email and chat.
    • On Android, you will be able to view live updates and task progress of agents like Spark through a new UI space called Android Halo, coming later this year. Later this summer, Spark will operate directly within Chrome, acting as your agentic browser across the web.

    We’re starting to roll out Gemini Spark to trusted testers this week and the Beta is coming to Google AI Ultra subscribers in the U.S. next week.

    Search in the agentic era

    Gemini Spark is the first experience made possible by 3.5 models and Antigravity. This combination gives us new ways to accelerate our mission and transform our products to be radically more helpful.

    As we enter this agentic era, Search will be more helpful and powerful than ever. Today, we’re introducing information agents in Search. These are personalized AI agents you can set up to work in the background, 24/7, to find what you need at exactly the right moment, and help you take action. Information agents are rolling out this summer starting with Google AI Pro and Ultra subscribers.

    Another way we’re building a truly agentic Search is by infusing it with agentic coding capabilities. With the power of Gemini 3.5 Flash and Google Antigravity, Search will build custom experiences just for your individual questions, like dynamic layouts and interactive visuals. These generative UI capabilities will be available for everyone in Search this summer, free of charge.

    And for longer running tasks that you need to keep coming back to, Search can go a step further — building persistent, custom dashboards or trackers that you can return to and make progress on. You can think of these like mini apps for your own specific tasks. You’ll be able to build custom experiences with Antigravity, right in Search in the coming months, starting first for Google AI Pro and Ultra subscribers in the U.S.

    More from our agentic Gemini era

    Here’s a look what else we shared at I/O:

    • Daily Brief is another out-of-the-box agent coming to the Gemini app. It gives you a personalized digest and synthesizes information from your inbox, calendar and tasks to find the most important things to be aware of. And it’s not just summarizing data: it’s prioritizing, organizing and suggesting the next steps, so it’s easy for you to take action. All in this super concise morning digest that’s built for skimming.

    • Google Flow is rolling out a new agent today to everyone that can plan and reason through complex tasks with your inputs, under your control. Built with Gemini models, it brings expertise and a deep understanding of your project to help with early brainstorming, creating and editing. You can also vibe code any creative tool, right in Flow — like tools for designing video effects, hand-drawn animations or layering text.

    • Google Pics is our new AI image creation and editing tool, built on our latest Nano Banana model, that helps you create just about anything with the creative controls you want. Whether you’re building a design from a blank canvas or editing an existing photo, Pics treats every element as an individual object rather than a flat, static image. This allows you to create, swap or perfect specific details, so you can bring your exact vision to life. Google Pics is available now to trusted testers and will be rolling out later this summer to Google AI Pro and Ultra subscribers in Workspace.

    • We also shared more about our intelligent eyewear, which we first gave a glimpse of last year, including audio glasses that offer spoken help in your ear and display glasses that show you the information you need, right when you need it. Both let you stay hands-free and heads up, with help from Gemini just by asking. Audio glasses are launching first, coming later this fall.

    • Gemini for Science brings together a number of AI tools to help accelerate scientific research. Building on the deep reasoning and research capabilities of Gemini as well as Deep Think and Deep Research, it includes new experiments on Labs as well as Science Skills to connect agentic platforms like Google Antigravity to over 30 major life science databases and tools. Users can express interest to try Gemini for Science experiments on Google Labs, and Science Skills is available today on Github and directly in Antigravity.

    As we look across the full stack of innovation, from the infrastructure behind TPU 8i to the frontier capabilities of Gemini 3.5 and Antigravity, it’s clear we’re firmly in our agentic Gemini era. I’m excited to see how it will unlock new ways to accelerate our mission and transform our products to be radically more helpful, for everyone everywhere.

    See everything we announced here.

    Original source
  • May 19, 2026
    • Date parsed from source:
      May 19, 2026
    • First seen by Releasebot:
      May 20, 2026
    Google logo

    Gemini by Google

    2026.05.19

    Gemini adds Omni video creation, launches 3.5 Flash, integrates Live into chat, and brings richer interactive responses plus a new Google AI Ultra plan.

    Gemini Omni is your creative partner for video creation.

    • What: Gemini Omni helps you to create and edit videos as easily as having a conversation. It's like Nano Banana for videos. Blend any combination of text, photos and video to create high-quality video. You can even drop yourself right into the action by creating a custom AI avatar that looks and sounds like you. Start from scratch, remix your camera roll or try out a premade template. Just chat with Gemini naturally to add details, make edits and watch your ideas come to life.
      Gemini Omni is rolling out today in the Gemini app to all Google AI subscribers globally aged 18 and over. Feature availability varies by region.
    • Why: We want to empower users to create, regardless of their technical skill set or access to complex software. With Gemini Omni, your natural language is the only tool that you need to direct, refine and produce captivating videos.

    3.5 Flash delivers frontier-level intelligence for your daily life.

    • What: Gemini 3.5 Flash is our best model yet for getting challenging tasks done quickly and efficiently, giving you frontier-level intelligence at speed. Whether you need help with everyday tasks, like analyzing multiple documents quickly or multi-step projects, like prototyping or vibe coding, 3.5 Flash is designed to navigate real-world complexity and help you take action.
      Rolling out to everyone globally. To access, select '3.5 Flash' in the model drop-down.
    • Why: We want to make Gemini the most helpful AI assistant for everyone. With Gemini 3.5 Flash, you can now tackle your everyday tasks with greater confidence, without choosing between speed and quality.

    The next chapter of Gemini Live.

    • What: Gemini Live is now being integrated into chat, allowing you to switch seamlessly between talking and typing. Gemini now connects directly with your favourite apps to handle the heavy lifting – from comparing products to catching up on emails. While you talk, Gemini will even show you information like real-time maps and weather cards, or you can show Gemini what you're seeing to dream up new images with Nano Banana. Just talk and let Gemini handle the rest.
      Rolling out globally to Android and iOS users.
    • Why: We're making Gemini a more complete hands-free assistant. By working directly with your favourite tools, Gemini provides frictionless help through natural conversation.

    Gemini responses are now even richer, more dynamic and interactive.

    • What: Instead of just reading a wall of text, for specific topics Gemini can show more interactive explorations, so you can zoom deep into high-resolution images to spotlight layer-by-layer information or generate engaging narrated video overviews of 30–60 seconds. These new responses can also include interleaven images, timelines and more. For every prompt, Gemini will reason through the best way to organise information to ensure that you get the right information in a format that works for you.
      Interactive multi-layer images and Video Overviews are available in English globally for specific topics when using the 'Pro' model.
    • Why: As part of our new Neural Expression design language, we're moving past a plain wall of text to beautifully crafted, immersive interactions directly in your Gemini main chat. We want to make sure that your AI assistant is more intelligent, adaptive and naturally works with whatever modality is best for you.

    Introducing a new $100 USD Google AI Ultra plan

    • What: Get the best of Gemini with Google AI Ultra subscriptions now starting at $100 USD/month. Subscribers get higher access to our advanced Gemini models and powerful features like video generation with Gemini Omni and more. AI Ultra $100 USD/month also includes additional benefits such as 20 TB of cloud storage and YouTube Premium.
      Available globally.
    • Why: We're committed to bringing you Google's latest AI innovations faster. Google AI Ultra is designed for our most dedicated users, giving them the best of Gemini and Google AI wherever they are. In addition, Ultra subscribers get priority access to our newest AI advancements, helping them to shape the future of AI.
    Original source
  • May 19, 2026
    • Date parsed from source:
      May 19, 2026
    • First seen by Releasebot:
      May 19, 2026
    Google logo

    Gemini by Google

    Introducing Gemini Omni

    Gemini launches Omni Flash, a new multimodal video model that creates and edits videos from text, images, audio and video. It brings conversation-based editing, stronger physics and world knowledge, avatar creation, and SynthID watermarking to Gemini app, Google Flow and YouTube Shorts.

    Gemini Omni Flash is a model that can create anything from any input – starting with video.

    Last year, Nano Banana brought Gemini's intelligence to image generation and editing. Since then, it’s helped millions of people restore old photos, design from sketches and visualize ideas in ways that weren’t possible before. From the start we built Gemini to be natively multimodal from the ground up, and now we’re taking the next step.

    We’re introducing Gemini Omni, where Gemini’s ability to reason meets the ability to create. Omni is our new model that can create anything from any input — starting with video. With Omni, you can combine images, audio, video and text as input and generate high-quality videos grounded in Gemini's real-world knowledge. You can also easily edit your videos through conversation.

    Today, we’re rolling out the first model in the Omni family: Gemini Omni Flash, to the Gemini app, Google Flow and YouTube Shorts. In time we will support output modalities like image and audio. Here’s some of what makes Omni special:

    Edit your videos through conversation

    Gemini Omni gives you an easier way to edit video — with natural language. Every instruction builds on the last. Your characters stay consistent, the physics hold up and the scene remembers what came before.

    Transform the world around you.

    Change specific things, or change everything. Your video becomes the starting point for something you never could have filmed yourself.

    Reimagine the action.

    Take a video you shot and just ask Omni to change what’s happening. Edit the action, add in new characters or objects, or transform a moment into something unexpected.

    Refine your videos across multiple turns.

    Change the environment, angle, style or even specific details, without ever losing the thread of your original scene. Scroll through the carousel to see how edits build on each other.

    Bring ideas to life, grounded in Gemini’s world knowledge

    Gemini Omni doesn't just build scenes that look real, it reasons about what should happen next. It combines an intuitive understanding of physics with Gemini's knowledge of history, science and cultural context, bridging the gap from photorealism to meaningful storytelling.

    Create visuals with more accurate physics.

    Omni has an improved intuitive understanding of forces like gravity, kinetic energy and fluid dynamics, allowing you to create more realistic scenes.

    Blend knowledge and creativity.

    Omni draws on Gemini's knowledge to connect language, imagery and meaning in ways that go far beyond pattern matching.

    Complex ideas made visual.

    Omni can create compelling explainers from short prompts, generating visuals that break down more complex ideas.

    Create videos from any combination of inputs

    Reference anything.

    Omni turns any reference — image, text, video or audio — into a single, cohesive output. While only voice references will be supported for audio to start, we’ll roll out other types of audio inputs soon.

    Start from what you have.

    With input references, you can use images of characters, scenes or drawings to create in a way that matches your vision.

    Apply styles, motion or effects.

    Define the visual language by using input references, or just describe it with natural language. Omni blends the input references to create a cohesive clip.

    Create videos with your own digital avatar

    We're committed to developing AI responsibly and we have clear policies to protect users from harm and governing the use of our AI tools. To start, you can create videos with your own voice by using Avatars, which create a digital version of yourself so you can generate videos that look and sound like you. Beyond the avatar feature, in terms of editing videos to change audio and speech, we are still working to test this and better understand how we can bring this capability to users responsibly.

    All videos created with Omni include our imperceptible SynthID digital watermark. You can easily verify that videos were generated with Gemini Omni through the Gemini app, Gemini in Chrome and Google Search. You can find out more about how we're expanding our content transparency and verification tools to help you understand how content was created and edited across the web in our blog post.

    Try Gemini Omni now

    Today, we’re launching the first model in the Omni family — Gemini Omni Flash. Gemini Omni Flash is rolling out today to all Google AI Plus, Pro and Ultra subscribers globally through the Gemini app and Google Flow. It’s also rolling out at no cost to users on YouTube Shorts and YouTube Create App starting this week.

    In the coming weeks, we'll also be rolling it out to developers and enterprise customers via APIs.

    Original source
  • May 19, 2026
    • Date parsed from source:
      May 19, 2026
    • First seen by Releasebot:
      May 19, 2026
    Google logo

    Gemini by Google

    Gemini 3.5: frontier intelligence with action

    Gemini launches Gemini 3.5 Flash, bringing frontier-level agentic and coding performance to the Gemini app, AI Mode in Search, developer tools, and enterprise platforms. It also introduces new personal AI agent experiences and stronger safety safeguards.

    Gemini 3.5 is built to help you execute complex, agentic workflows.

    Today, we’re introducing Gemini 3.5, our latest family of models combining frontier intelligence with action. This represents a major leap forward in building more capable, intelligent agents. We’re kicking off the series by releasing 3.5 Flash. It delivers frontier performance for agents and coding, excelling at complex long-horizon tasks that deliver real-world utility.

    3.5 Flash is available today to billions of people globally

    • For everyone via the Gemini app and AI Mode in Google Search
    • For developers in our agent-first development platform Google Antigravity and Gemini API in Google AI Studio and Android Studio
    • For enterprises in Gemini Enterprise Agent Platform and Gemini Enterprise.

    We’re also hard at work on 3.5 Pro. It's already being used internally, and we look forward to rolling it out next month.

    3.5 Flash: frontier performance for agents and coding

    Gemini 3.5 Flash delivers intelligence that rivals large flagship models on multiple dimensions, at the speeds you have come to expect from the Flash series. It’s our strongest agentic and coding model yet, outperforming Gemini 3.1 Pro on challenging coding and agentic benchmarks like Terminal-Bench 2.1 (76.2%), GDPval-AA (1656 Elo) and MCP Atlas (83.6%), and leading in multimodal understanding (84.2% on CharXiv Reasoning). When looking at output tokens per second, it is 4 times faster than other frontier models.

    Landing in the top-right quadrant of the Artificial Analysis index, 3.5 Flash delivers frontier-level intelligence at exceptional speed — proving you no longer have to trade quality for latency.

    3.5 Flash: agentic tasks at scale

    This balance of speed and performance makes 3.5 Flash ideal for tackling long-horizon agentic tasks. What used to take a developer days or an auditor weeks, 3.5 Flash can now help complete in a fraction of the time, often at less than half the cost of other frontier models. It rapidly plans, builds and iterates to solve real-world problems, whether it’s developing new applications, maintaining codebases or helping to prepare financial documents.

    When coupled with the updated Antigravity harness, 3.5 Flash becomes a powerful engine for deploying collaborative subagents to tackle problems at scale for the most demanding use cases. Under supervision, it can reliably execute multi-step workflows and coding tasks while sustaining frontier performance.

    Building on the strong multimodal foundation of Gemini 3, 3.5 Flash generates richer, more interactive web UIs and graphics.

    3.5 Flash: real-world impact

    3.5 Flash’s real-world agentic capabilities are already driving meaningful progress for our developers and enterprises alike. In developing the 3.5 model series, we worked closely with industry partners to understand where toil and complexity arose in their workflows. Partners are seeing meaningful impact — from banks and fintechs automating multi-week workflows to data science teams unearthing insights amidst complex data environments.

    Personal AI agents: built with 3.5 Flash

    3.5 Flash is now the default model for the Gemini app and AI Mode in Search globally. At I/O today, we showed how its agentic capabilities are powering new features to bring frontier-level intelligence to your daily life.

    The new Gemini Spark, your personal AI agent, uses 3.5 Flash. It runs 24/7, helping you navigate your digital life, taking action on your behalf while under your direction. We’re starting to roll out Gemini Spark to trusted testers today, and we’re planning on bringing the Beta to Google AI Ultra subscribers in the US next week.

    The enhanced agentic coding capabilities of 3.5 Flash are also delivering even more intelligent experiences across Search, from introducing new information agents that work for you 24/7 to unlocking more dynamic generative UI experiences.

    Gemini 3.5: built with frontier safeguards

    Gemini 3.5 was developed in accordance with our Frontier Safety Framework. We have strengthened our cyber and CBRN safeguards, which means it's less likely to generate harmful content, and to mistakenly refuse to answer safe queries. We achieve this with new, more advanced safety training and mitigations, including interpretability tools that help check and understand the AI's inner reasoning before it provides a response.

    3.5 Flash is available today

    Gemini 3.5 Flash is generally available via Google Antigravity, the Gemini API in Google AI Studio and Android Studio, Gemini Enterprise Agent Platform and Gemini Enterprise. It’s also now available to everyone in the Gemini app and AI Mode in Search. On behalf of the entire Gemini team, we can’t wait to see what you build.

    Original source
  • May 19, 2026
    • Date parsed from source:
      May 19, 2026
    • First seen by Releasebot:
      May 19, 2026
    Google logo

    Gemini by Google

    Gemini for Science: AI experiments and tools for a new era of discovery

    Gemini launches Gemini for Science with new Science Skills in Google Antigravity and experimental Google Labs tools that speed hypothesis generation, computational discovery and literature insights, while gradually opening access to researchers and expanding enterprise R&D support.

    A force multiplier for human ingenuity

    Explore the future of discovery with new Science Skills in Google Antigravity and three new experimental tools on Google Labs. These tools are designed to help accelerate core steps of the scientific method, built with Co-Scientist, Alpha Evolve, Empirical Research Assistance and NotebookLM.

    For centuries, the scientific method has been the greatest engine of human progress. At Google, our mission is deeply rooted in building tools to accelerate it. We believe that a new era of discovery won’t come from narrow, specialized models, but general agents that empower researchers across every scientific field.

    That’s why we are introducing Gemini for Science, a collection of science tools and experiments designed to expand the scale and precision of scientific exploration.

    Today science faces a paradox: our collective knowledge is growing so fast that it’s becoming harder for individual scientists to see the full picture. Scientific breakthroughs often rely upon making creative connections between data, but the time required to do this manually can take weeks or even months. AI can help eliminate this bottleneck and serve as a force multiplier for scientific work by handling complex tasks. This allows researchers to focus on identifying and tackling the most impactful scientific problems and directions that would drive progress.

    Gemini for Science experimental tools on Google Labs include three primary prototypes designed to handle such tasks.

    1. Hypothesis Generation, built with Co-Scientist: Ideation is the heartbeat of science, but no human can synthesise the millions of papers published annually. Hypothesis Generation bridges this gap by simulating the scientific method: it collaborates with researchers to define a research challenge, then uses a multi-agent “idea tournament” to generate, debate and evaluate hypotheses. To ensure absolute rigor, claims are deeply verified and supported by clickable citations.

    2. Computational Discovery, built with AlphaEvolve and ERA (Empirical Research Assistance): Scientific progress is often limited by the number of hypotheses we can realistically test with computational experiments. Computational Discovery, an agentic research engine, is a prototype that solves this by generating and scoring thousands of code variations in parallel. This allows scientists to test novel modeling approaches — for complex fields like solar forecasting or epidemiology — that would take months to navigate manually.

    3. Literature Insights, built with Google NotebookLM: Understanding scientific literature is a core part of all research journeys. Literature Insights searches scientific literature and structures results into tables with custom, searchable attributes for side-by-side analysis. Researchers can use chat to uncover nuances grounded in their curated corpus, and create high-fidelity artifacts such as reports, slide decks, infographics and audio and video overviews. With the power of NotebookLM, Literature insights helps synthesize findings across papers, identify research gaps and uncover areas of opportunity.

    Starting today, we’ll begin gradually opening access to these experiments. Visit labs.google/science to register your interest.

    Beyond the individual experiments, we’re also bringing these advanced AI capabilities to enterprise organizations through Google Cloud. Our enterprise-grade solutions for scientific and industrial R&D are already being used by a range of partners in private preview to drive real-world impact. Companies like BASF are using AlphaEvolve to optimize their supply chains, and Klarna is leveraging it to enhance their machine learning models. In parallel, organizations like Daiichi Sankyo, Bayer Crop Science and the U.S. National Labs (as part of the U.S. Department of Energy's Genesis Mission) are using Co-Scientist to accelerate their research and tackle fundamental scientific challenges. These enterprise-grade tools are demonstrating significant value in their current preview phase. We are excited about the breakthroughs our partners are unlocking and look forward to expanding access to more organizations in the coming months.

    Several validation papers have been already published based on these and other tools. The ERA and Co-Scientist research papers are published today in Nature.

    A scientific workbench on your desktop

    As part of Gemini for Science, we are also launching Science Skills, a specialized bundle that integrates insights from over 30 major life science databases and tools including UniProt, AlphaFold Database, AlphaGenome API and InterPro. Using these skills on agentic platforms like Google Antigravity allows researchers to perform complex and often manual workflows like structural bioinformatics and genomic analyses in minutes rather than hours.

    Our research teams using Science Skills have already seen this speedup in practice. In early testing, our team used Science Skills to perform a complex analysis that normally takes hours in minutes. This led to novel insights about potential mechanisms for a rare genetic disease caused by mutations in the AK2 gene.

    To learn more on how to use Science Skills in Google Antigravity visit antigravity.google/use-cases/science.

    A collaborative effort with the scientific community

    Our commitment to responsibly develop and deploy tools for science begins with the scientific ecosystem. We are collaborating with over 100 institutions — including Stanford University on liver fibrosis, Imperial College London on antimicrobial resistance and a multi-year effort with The Crick Institute — to validate our new systems and tools. To ensure the integrity of AI-generated insights, we’ve built a trusted tester community — ranging from PhD students to industry researchers to Nobel laureates — to stress test our systems against complex real-world challenges.

    In addition, we’ve also created dedicated pilots with leading scientific conferences like ICML, STOC and NeurIPS to develop pioneering tools for agentic peer review and scientific validation such as our experimental Paper Assistant Tool (PAT) and ScholarPeer.

    All of this work builds on a long history of AI advancements. Our specialized AI models are already accelerating progress: AlphaFold has helped over 3 million researchers tackle malaria vaccines and plastic-eating enzymes; and AlphaGenome is helping scientists identify the drivers of disease. These sit alongside everyday tools researchers rely on — from Google Scholar and Earth Engine to Colab, MedGemma, Earth AI and Gemini Deep Research. With our latest Gemini Deep Think release, we continue to improve our core model capabilities on complex scientific tasks. Together, these tools have already become essential parts of the scientific ecosystem, helping researchers organize information and perform complex data analysis at scale.

    As we explore the future of agentic research together, we continue to work towards a future where AI accelerates scientific progress and helps solve our most pressing societal challenges.

    Original source
  • May 19, 2026
    • Date parsed from source:
      May 19, 2026
    • First seen by Releasebot:
      May 19, 2026
    Google logo

    Gemini by Google

    Co-Scientist: A multi-agent AI partner to accelerate research

    Gemini launches Co-Scientist, a collaborative AI partner for researchers that generates, debates, and refines scientific hypotheses. The experimental Hypothesis Generation tool is rolling out soon, with early life sciences use cases and future access planned for more researchers and enterprise partners.

    Introducing a collaborative AI partner for researchers to develop new hypotheses in life sciences and beyond.

    Every great scientific breakthrough begins with a single, transformative idea. The spark of discovery relies on a researcher's ability to connect disparate facts and formulate the right hypothesis to test. But in an era of information overload and increasingly complex challenges, the search for these needle-in-a-haystack ideas has become a significant bottleneck for progress.

    We believe AI can help dramatically accelerate the pace of breakthroughs by serving as a dedicated partner in the generation and refinement of breakthrough scientific hypotheses.

    Today, in Nature we published our latest Co-Scientist research, introducing a new multi-agent AI system built with Gemini that iteratively generates, debates, and evolves novel hypotheses for complex scientific problems.

    We are making the Co-Scientist system available to individual researchers through Hypothesis Generation, a new experimental tool jointly developed across Google DeepMind, Google Research, Google Cloud and Google Labs. We’ll begin rolling out in the coming weeks and researchers can register their interest at labs.google/science.

    Since sharing our early research last year, we’ve been developing and testing Co-Scientist together with teams who are leveraging it to tackle challenging problems - from antimicrobial resistance and plant immunity to liver fibrosis. We’re excited to share some of the ways it is already being applied across fundamental biology, the natural sciences, and engineering.

    How Co-Scientist works: A multi-agent system built with Gemini

    Scientific discovery is rarely a straight line; it is a cycle of ideation and hypothesis generation, critique, and refinement. Scientists often reach their most profound insights only after wrestling with a complex problem for days, months, or even years. The core research question behind Co-Scientist was: How can an AI system engage in this rigorous structured thinking for scientific discovery?

    The Co-Scientist AI system is made of a collaborative coalition of specialized agents based on the Gemini model, which we can group into three different phases:

    Generate ideas:

    • Generation agent - Proposes initial focus areas and novel hypotheses grounded in scientific literature and data.
    • Proximity agent - Maps and clusters generated hypotheses to help ensure a diverse, comprehensive exploration of the research space.

    Debate ideas:

    • Reflection agent - Acts as a "virtual peer reviewer," critically evaluating hypotheses for correctness, quality, and novelty.
    • Ranking agent - Orchestrates an “idea tournament”, using pairwise comparisons and simulated scientific debates to prioritize the most promising paths and hypotheses.

    Evolve ideas:

    • Evolution agent - Continuously refines, combines, and builds upon the top-ranked hypotheses in the tournament to help iteratively improve their quality.
    • Meta-review agent - Synthesizes insights from the debates and idea tournament to continuously optimize the system and generates the final research proposal for the scientist to review.

    Orchestrating the agent coalition is a supervisor agent acting as an adaptive planner. Unlike AI models that think linearly, this freeform planner breaks down high-level research goals into executable steps, coordinating agents to run in parallel and explore multiple avenues simultaneously.

    Tournament of ideas: How our system verifies, refines, and ranks hypotheses

    Co-Scientist can explore thousands of research directions. To help find the most impactful ones, we developed the ‘tournament of ideas’. The approach draws from principles used in AlphaGo and AlphaStar - but instead of playing a game, our AI agents hold scientific debates to generate, refine and rank ideas.

    To push the boundaries of novelty while ensuring the hypotheses are robust and testable, the majority of the system's computation is dedicated to verifying these hypotheses. By deeply cross-checking claims against scientific literature and data, the system ensures that claims remain grounded, factually accurate, and logically coherent. The system currently integrates web search and specialized databases like ChEMBL and UniProt to incorporate additional knowledge. It can also leverage advanced specialized models as tools like AlphaFold, which we are testing in select research collaborations.

    This combination of these capabilities helps make Co-Scientist one of the first examples of a reliable multi-agent system for structured scientific thinking, enabling it to deliver tangible results in novel hypothesis generation for complex scientific problems.

    Validating Co-Scientist in the lab, starting with life sciences

    Over the past year, we have collaborated with global experts to evaluate Co-Scientist on complex problems in the life sciences. We have also been previewing an enterprise-grade version with a number of organizations including Daiichi Sankyo, Bayer Crop Science, and the US National Laboratories as part of the Genesis Mission.

    Case studies:

    • Uncovering repurposed medicines to fight liver fibrosis: Co-Scientist helped accelerate Gary Peltz’s search for liver fibrosis treatments, highlighting overlooked drug-repurposing candidates, including one that successfully blocked 91% of a scarring-linked response in lab tests. Published in Advanced Science.
    • Uniting biological toolkits for a new approach to ALS: Co-Scientist helped unite Ritu Raman and Ryan Flynn’s labs around ALS, proposing testable ideas and sparking collaboration on RNA-based approaches.
    • Fast-tracking genetic leads to reverse cellular aging: Biologists Omar Abudayyeh and Jonathan Gootenberg use Co-Scientist to speed research on reversing cellular aging, synthesizing decades of literature and reducing analysis time from months to days.
    • Accelerating discovery of liver disease mechanisms: For Filippo Menolascina, Co-Scientist turned literature overload into high-quality hypotheses for metabolic liver disease, highlighting promising mechanisms and drug combinations.
    • Finding the molecular switches behind new infectious diseases: Clare Bryant uses Co-Scientist to identify proteins causing severe disease in pathogens like flu and COVID-19, narrowing experimental focus.
    • Opening new paths in aging research: At Calico Life Sciences, Matt Onsum and Katherine Labbé use Co-Scientist to tackle aging biology, generating novel hypotheses confirmed in the lab.

    Developing agentic tools with the scientific community

    Co-Scientist was developed in collaboration with researchers from over 100 institutions to test its capabilities and ensure it is a high-quality, useful tool for the scientific community.

    As part of our responsible AI approach, Co-Scientist underwent extensive internal and external safety evaluations, including misuse evaluations in Chemical, Biological, Radiological and Nuclear (CBRN) domains. Custom safety classifiers were developed to flag unethical research goals and mitigate unsafe information.

    We will continue to iterate and develop the tool alongside feedback and collaboration with the scientific community and are excited to be making Co-Scientist available to individual researchers through Gemini for Science. We also look forward to expanding access to more Google Cloud enterprise partners soon.

    We have been deeply inspired by the scientists who have built up our understanding of the world today. And we hope that AI can help researchers to usher in and accelerate a new era of scientific progress.

    Note: Co-Scientist is intended to be a partner in research, not a replacement for scientific or clinical expertise, and users are responsible for any decisions they make using the outputs as they continue their scientific journey.

    Original source
  • May 19, 2026
    • Date parsed from source:
      May 19, 2026
    • First seen by Releasebot:
      May 19, 2026
    Google logo

    Gemini by Google

    Introducing Google Antigravity 2.0

    Gemini launches Antigravity 2.0, a standalone agent-first desktop app for macOS, Linux, and Windows. It brings stronger agents, dynamic subagents, async task management, scheduled tasks, JSON hooks, live voice transcription, and a polished new workflow for complex work.

    Google Antigravity 2.0

    Google Antigravity 2.0 is a new, standalone desktop application that fully delivers on a truly agent-optimized experience, available on macOS, Linux, and Windows (download here). Users interact with powerful agents both synchronously and asynchronously, and there is no IDE. While it retains many of the core principles of the Antigravity IDE’s Agent Manager surface, it is a completely separate desktop application. It is available to enterprises, powered by the latest Gemini models, and orchestrates agents capable of completing complex tasks.

    Let us walk through the high level of the new features and capabilities. For more demos and deep dives, check out this blog post.

    At the core is still an agent. You can synchronously have a conversation with an agent, view the artifacts it produces, and provide feedback directly on the artifacts to guide towards your desired outcomes:

    Agent-first layout of Antigravity 2.0.

    These agents are more powerful than before. Some new capabilities include:

    • Dynamic subagents: the main agent can dynamically choose to define and invoke subagents to complete focused subtasks, thereby not polluting the main agents’ context window and allowing for parallelism of work.
    • Asynchronous task management: Tasks and commands are managed and can run asynchronously to not block the main agent from continuing its work.
    • JSON hooks: You can now define hooks in a simple JSON format, allowing you to intercept and control the Antigravity agent’s behavior.

    A brand new way of interacting with agents in Antigravity 2.0 is through Scheduled Tasks, where you can define crons to trigger the invocation of Antigravity agents on a predefined schedule. No longer do you need to manually invoke every agent:

    Set recurring schedules or one-off timers using the /schedule command or Scheduled Tasks.

    We’ve also removed the tight coupling between agent and repository. Instead of agent conversations being grouped by “workspace” (i.e. repository), they are now grouped by “project,” which can correspond to multiple folders and enforce its own agent settings and permissions. This allows you to let agents access more information and tackle more complex tasks, while still providing the knobs to put appropriate, specific guardrails.

    There is a whole list of new fun slash commands:

    • /goal: Run until the specified task is completely finished, not asking for intermediate input from the user.
    • /grill-me: Before starting to implement, ask questions back to align on the specific details of the plan.
    • /schedule: Run an instruction as a one-time timer in the future or on some recurring schedule (via Scheduled Tasks)
    • /browser: We heard the feedback that the agents were still not capable enough to determine exactly when to be using the browser. So for now, we’ve made it such that an explicit slash command controls these behaviors. When used, the agent diligently uses the browser primitives, and when not, it will ignore.

    One favorite new feature is that the voice input (mic icon next to text input boxes) now does live transcription of your words, as opposed to collecting the raw audio file to pass into the model:

    Live voice transcription.

    And of course, there is a long list of UI polish and performance improvements to make Antigravity 2.0 the most powerful and intuitive way to work with agents: sidebar organization, standalone conversations, a sleek review flow for changes, new UI elements for all of the new agent capabilities, and much more.

    Again, check out this deep-dive into a number of the new features in Antigravity 2.0.

    Why a 2.0?

    When we launched the Google Antigravity IDE in November 2026, there was no agent-first GUI surface in the market. We wanted to prove that such a surface worked, at least for software development. So, while the core of the Antigravity IDE was a familiar agent-powered IDE, we introduced the Agent Manager, a second surface that stripped away much of the “IDE” UI. This allowed users to focus on the agent conversations themselves, the artifacts the agents produced, and multi-agent management. Since that launch, millions of developers have adopted the Antigravity IDE, and this agent-first paradigm has become standard across the industry.

    However, we knew the entire time that:

    • At some point, coding would expand to knowledge work, both because the models would become better and because naturally there is a ceiling to the overall value we can provide users by accelerating just coding.
    • In such a world, tying together an IDE and agent-first surface in a single application would be confusing and potentially daunting to those less familiar with code and IDEs. Even without this separation, we have been pleasantly surprised how many people have adopted the Agent Manager in the Antigravity IDE for such non-development tasks, but it is not particularly intuitive.
    • The product, agent harness, and model layers all had to be co-optimized and co-developed, and while agentic coding is a necessary step towards general model intelligence, it is not sufficient.

    We’ve been busy in the last few months so that we could:

    • Integrate the Antigravity product agent harness with the Gemini training and evaluation stacks
    • Rearchitect the product to be agent-first from the ground up, independent of an IDE or other dev-specific concepts such as repositories
    • Round out the Antigravity platform with more surfaces and tooling to be a complete offering (see the launches for the Antigravity CLI, the Antigravity SDK, and more).

    Getting Started with Antigravity 2.0

    If you are brand new to Google Antigravity, visit our download page.

    If you already have installed the Antigravity IDE, when that application next updates, it will automatically update to Antigravity 2.0. At this point, you will be asked if you would like to still keep the Antigravity IDE, which is recommended for developers:

    Update screen from the Antigravity IDE to Antigravity 2.0.

    These two applications will be differentiated on your machine’s dock via icon background. Antigravity 2.0 has the logo on a white background while Antigravity IDE has the logo on top of a black grid:

    Antigravity IDE and Antigravity 2.0 logos, respectively.

    The Antigravity IDE

    Although Antigravity 2.0 is the future, we won’t disrupt your workflows right away. For now, both the Antigravity IDE application itself and the Agent Manager in the Antigravity IDE will remain available. In an upcoming release, we will remove the Agent Manager from the Antigravity IDE, turning the IDE into a purely agent-powered IDE.

    We recommend dual-wielding Antigravity 2.0 with your IDE of choice, whether it is the Antigravity IDE or otherwise. Googlers have already been dual wielding Antigravity 2.0 with a whole host of IDEs! We will have compatible extensions and plugins into other popular IDEs shortly.

    Looking Forward

    Alongside Antigravity 2.0, we have introduced a CLI, SDK, API, and more. We have built integrations with other Google products and tech stacks. And we have the momentum of the model and agent harness being co-optimized.

    This is just the beginning. We look forward to launching remote control, more product integrations, cloud-deployed agents, and much more.

    Original source
  • May 19, 2026
    • Date parsed from source:
      May 19, 2026
    • First seen by Releasebot:
      May 19, 2026
    Google logo

    Gemini by Google

    Making it easier to understand how content was created and edited

    Gemini expands content transparency and verification tools across Search, Chrome, Pixel and Cloud, adding SynthID and C2PA checks, broader media origin verification, and a new AI Content Detection API for businesses.

    We're expanding our tools to help you understand how content was created and edited across the web.

    As generative media becomes more advanced and accessible, it’s helpful to know where content comes from, and whether it’s been altered. Today, we’re expanding our content transparency and verification tools in Search, Gemini, Chrome, Pixel and Cloud, and deepening our partnership with the broader industry.

    Scaling our technology

    Three years ago, we introduced SynthID, our industry-leading digital watermarking technology that embeds imperceptible signals into AI-generated content. Since then, we've integrated SynthID into our generative media models and products, watermarking over 100 billion images and videos and 60,000 years of audio.

    Across a growing number of our generative media tools, we use C2PA Content Credentials, the industry standard that shows how media was created and modified, with or without AI. Pixel 10 was the first smartphone to provide Content Credentials for images in its native camera app, and we are expanding this technology to include video on Pixel 8, 9 and 10 phones in the coming weeks.

    By using this technology at the point of capture, Pixel documents when content has been captured by a camera. In an era of generative media, we believe that identifying authentic, unedited content can be just as important as knowing when a file was made or edited using AI.

    Providing more ways to verify content

    Our goal is to make it easier to learn more about the content you encounter online. That’s why we recently added SynthID verification for image, video and audio to the Gemini app. Already, it’s been used 50 million times globally, and we’re expanding this verification capability to Search today and Chrome over the coming weeks.

    You can learn about an image's origin by using Search features like Lens, AI Mode and Circle to Search, as well as Gemini in Chrome. Just ask, "Is this made with AI?” or “Is this AI generated?”

    We’re also adding verification for C2PA Content Credentials, to easily check if content is an unaltered original from a camera or if it has been modified, and by what tools. This feature is rolling out in the Gemini app starting today, and it will come to Search and Chrome in the coming months. This builds on features like the labels on YouTube that identify AI-generated content and our work with trusted testers on Backstory to make detection tools faster and more reliable.

    Partnering across the industry

    Because digital media travels across many platforms, industry-wide partnership and adoption of robust, interoperable tools is essential. More content across the web will soon carry these imperceptible watermarks, as companies like OpenAI, Kakao and ElevenLabs are bringing SynthID technology to more of their AI-generated content. This builds on our ongoing work to make content origins clearer, like open-sourcing our SynthID text watermarking technology and partnering with NVIDIA to watermark AI-generated video from their Cosmos world foundation models.

    To help more organizations identify AI-generated media, we’re launching a new AI Content Detection API on Google Cloud’s Gemini Enterprise Agent Platform. This gives businesses a powerful way to spot AI content made by both Google and other popular models, helping them decide how to evaluate and manage media across their own platforms — whether that’s for backend operations like sorting feeds and preventing insurance fraud, or for user-facing content like fact-checking and labeling synthetic media. We’re launching with a group of trusted partners and will continue to refine the API based on their feedback.

    We also continue to advocate for global standards for provenance technology as a member of the C2PA steering committee. This ensures that transparency tools built into our devices work seamlessly across the platforms you use every day. For example, Meta — a fellow C2PA Steering Committee member — will start labeling camera-captured media with Content Credentials on Instagram. This means authentic photos and videos shot natively on Pixel phones will soon be recognized and labeled as such when you share them on Instagram.

    We’ve long invested in providing helpful context about the information you find online. Content transparency is a complex challenge, but we’ll keep developing ways to push the technology forward and set a high bar for the industry. Our goal is to empower you with the tools needed to determine the history of any content you encounter.

    Original source
Releasebot

Curated by the Releasebot team

Releasebot is an aggregator of official product update announcements from hundreds of software vendors and thousands of sources.

Our editorial process involves the manual review and audit of release notes procured with the help of automated systems.

Similar to Gemini with recent updates: