- Nov 17, 2025
- Parsed from source:Nov 17, 2025
- Detected by Releasebot:Nov 18, 2025
November 17, 2025
Generative AI on Vertex AI
v1
Veo video generation
Veo 3.1 is Generally Available, and introduces the following models:
- Veo 3.1
- Veo 3.1 Fast
For more information, see the following:
- Generate videos with Veo on Vertex AI
- Generate Veo videos from text prompts
- Generate Veo videos from an image
- Generate Veo videos using first and last frames
- Veo video generation API
- Nov 13, 2025
- Parsed from source:Nov 13, 2025
- Detected by Releasebot:Nov 15, 2025
November 13, 2025
Generative AI on Vertex AI v1
Updated Prompt Caching for Anthropic Claude Models
Prompt caching for Anthropic Claude models now supports a one-hour Time To Live (TTL).
For more information, see Prompt caching.Kimi K2 Thinking is available in Model Garden. This model is a thinking model that excels at complex problem-solving and deep reasoning. Kimi K2 Thinking is available as a managed API in Model Garden. To learn more, see Kimi K2 Thinking.
Original source Report a problem - Nov 11, 2025
- Parsed from source:Nov 11, 2025
- Detected by Releasebot:Nov 12, 2025
- Modified by Releasebot:Nov 15, 2025
November 11, 2025
Generative AI on Vertex AI v1
Anthropic's Claude 3.7 Sonnet
Anthropic's Claude 3.7 Sonnet is deprecated as of November 11, 2025 and will be shut down on May 11, 2026. For more information, see Partner model deprecations.
Original source Report a problem - Nov 10, 2025
- Parsed from source:Nov 10, 2025
- Detected by Releasebot:Nov 12, 2025
- Modified by Releasebot:Nov 15, 2025
November 10, 2025
Colab Enterprise
The default latest Python version is now 3.12. See Python versions.
Original source Report a problem - Nov 7, 2025
- Parsed from source:Nov 7, 2025
- Detected by Releasebot:Nov 11, 2025
- Modified by Releasebot:Nov 15, 2025
November 07, 2025
Vertex AI Agent Engine and Agent Builder debut with Preview observability, testing playground, Gen AI evaluation, Memory Bank revisions, and IAM-based agent identities. GA adds Express mode and a new free tier, boosting production use. Powerful upgrades for testing, deployment, and access control.
Generative AI on Vertex AI v1
Vertex AI Agent Engine
The following features are now available in Preview:
- Configure, manage, and view observability features such as sessions, traces, logs, and events for your agent in the Google Cloud console.
- Use the playground to test and interact with your agent in the Google Cloud console.
- Evaluate your agents using the Gen AI evaluation service's GenAI Client in Vertex AI SDK.
- Create and manage memory revisions for Memory Bank.
- Use Identity Access Management (IAM) to create an agent identity to manage access and authentication when using agents on Vertex AI Agent Engine Runtime.
The following features are now available in GA:
- Express mode support for Vertex AI Agent Engine Runtime.
- Use the new free tier with Vertex AI Agent Engine Runtime. For more information, see Pricing.
Vertex AI Agent Builder
Vertex AI Agent Engine
The following features are now available in Preview:
- Configure, manage, and view observability features such as sessions, traces, logs, and events for your agent in the Google Cloud console.
- Use the playground to test and interact with your agent in the Google Cloud console.
- Evaluate your agents using the Gen AI evaluation service's GenAI Client in Vertex AI SDK.
- Create and manage memory revisions for Memory Bank.
- Use Identity Access Management (IAM) to create an agent identity to manage access and authentication when using agents on Vertex AI Agent Engine Runtime.
The following features are now available in GA:
- Express mode support for Vertex AI Agent Engine Runtime.
- Use the new free tier with Vertex AI Agent Engine Runtime. For more information, see Pricing.
- Nov 4, 2025
- Parsed from source:Nov 4, 2025
- Detected by Releasebot:Nov 6, 2025
- Modified by Releasebot:Nov 15, 2025
November 04, 2025
Generative AI on Vertex AI v1
MiniMax M2
MiniMax M2 is available in Model Garden. This model is is built for end-to-end development workflows and has strong capabilities in planning and executing complex tool-calling tasks. The model is optimized to provide a balance of performance, cost, and inference speed. MiniMax M2 is available as a managed API in Model Garden. To learn more, see MiniMax M2.
Original source Report a problem - Oct 23, 2025
- Parsed from source:Oct 23, 2025
- Detected by Releasebot:Nov 2, 2025
- Modified by Releasebot:Nov 15, 2025
October 23, 2025
Generative AI on Vertex AI v1
The following models are available through Model Garden:
- DeepSeek-OCR
- Qwen3-VL
- Earth AI
- Oct 21, 2025
- Parsed from source:Oct 21, 2025
- Detected by Releasebot:Nov 2, 2025
October 21, 2025
Vertex AI API fixes a streaming misrouting bug where responses could be delivered to the wrong request due to Expect: 100-continue handling. The issue is resolved and Google Gemini models were not affected. This is a user-facing bug fix and stability improvement.
On September 23, 2025, we discovered a technical issue in the Vertex AI API that resulted in a limited amount of responses being misrouted between recipients for certain third-party models when using streaming requests. This issue is now resolved. Google models, e.g. Gemini, were not impacted.
Some internal proxies did not properly handle HTTP requests that have an Expect: 100-continue header, resulting in a desynchronization in a streaming response connection, where a response intended for one request was instead delivered as the response for a subsequent request.
For more information, see Security bulletins.
Original source Report a problem - Oct 20, 2025
- Parsed from source:Oct 20, 2025
- Detected by Releasebot:Nov 2, 2025
- Modified by Releasebot:Nov 12, 2025
October 20, 2025
Colab Enterprise
Visualization cells: You can use visualization cells to generate interactive and editable visualizations from within a Colab Enterprise notebook. You can configure the chart type, aggregation, colors, labels, and other aspects of the visualization to help you explore data and discover insights. For more information, see Use visualization cells.
Original source Report a problem - Oct 16, 2025
- Parsed from source:Oct 16, 2025
- Detected by Releasebot:Nov 2, 2025
- Modified by Releasebot:Nov 15, 2025
October 16, 2025
Generative AI on Vertex AI v1
vLLM TPU, a highly-efficient serving framework for large language models (LLM) that's optimized for Cloud TPU hardware, is available through Model Garden.
Mistral's Codestral 2
You can use Mistral's Codestral 2 in Model Garden.
Original source Report a problem