Leaderboard Updates & Release Notes
26 updates curated from 1 source by the Releasebot Team. Last updated: May 25, 2026
- May 25, 2026
- Date parsed from source:May 25, 2026
- First seen by Releasebot:May 25, 2026
May 25, 2026
Leaderboard adds qwen3.7-max-20260517 to the Code leaderboard.
qwen3.7-max-20260517 has been added to the Code leaderboard.
Original source - May 22, 2026
- Date parsed from source:May 22, 2026
- First seen by Releasebot:May 23, 2026
May 22, 2026
Leaderboard adds recraft-v4.1-pro and recraft-v4.1-utility-pro to the Text-to-Image leaderboard.
recraft-v4.1-pro and recraft-v4.1-utility-pro has been added to the Text-to-Image leaderboard.
Original source All of your release notes in one feed
Join Releasebot and get updates from Arena AI and hundreds of other software products.
- May 21, 2026
- Date parsed from source:May 21, 2026
- First seen by Releasebot:May 22, 2026
- Modified by Releasebot:May 23, 2026
May 21, 2026
Leaderboard adds hidream-o1-image to Text-to-Image and grok-imagine-image-quality to Image Edit leaderboards.
hidream-o1-image has been added to the Text-to-Image leaderboard.
grok-imagine-image-quality (20260519) has been added to the Image Edit leaderboard.
Original source - May 19, 2026
- Date parsed from source:May 19, 2026
- First seen by Releasebot:May 19, 2026
May 19, 2026
Leaderboard adds gemini-3.5-flash to Text and Code leaderboards.
gemini-3.5-flash has been added to Text and Code leaderboards.
Original source - May 14, 2026
- Date parsed from source:May 14, 2026
- First seen by Releasebot:May 14, 2026
- Modified by Releasebot:May 19, 2026
May 14, 2026
Leaderboard adds qwen3.7-max-preview, granite-4.1-8b, trinity-large-thinking and gpt-5.5-xhigh to its leaderboards.
qwen3.7-max-preview has been added to the Text and Vision leaderboards.
granite-4.1-8b has been added to the Text and Code leaderboards.
trinity-large-thinking has been added to the Text leaderboard.
gpt-5.5-xhigh (codex-harness) has been added to the Code Arena leaderboards.
Original source - May 12, 2026
- Date parsed from source:May 12, 2026
- First seen by Releasebot:May 13, 2026
May 12, 2026
Leaderboard adds Battles in Direct votes to its leaderboards, with backfilled March to May data coming over the next month. The update boosts daily vote volume, steadies ranks faster, and corrects two voting biases across text, vision, search, and document arenas.
Votes collected from Battles in Direct now count toward leaderboards.
Since March 2026, we've been running an experiment where 10% of direct-chat sessions on Arena are converted into battles between two randomly sampled models. Starting today, votes collected this way will be included in leaderboards. Over the next month, we'll also gradually backfill Battles in Direct votes collected from March through May, so models active during that time period may see vote counts increase and scores shift slightly.
This change meaningfully increases daily vote volume, which tightens confidence intervals and stabilizes ranks faster. The distribution of prompts also shifts: direct battles arrive with prior conversational context and skew toward longer queries, harder prompts, coding, and instruction following. Direct battles where the user votes skip are not used, so every direct-battle vote is a decisive A-or-B preference.
We observed two new biases in the Battles in Direct voting data and corrected for them in the Bradley-Terry fit for the text, vision, search, and document arenas. The first is a position bias favoring Model A (the model on the left); the second is an advantage given to models that share an organization with the prior turns of context. To correct each bias, we added a corresponding feature to the model: is_direct_battle and same_org_indicator, respectively. The model fitting learns coefficients for these features that absorb the biases.
Original source - May 8, 2026
- Date parsed from source:May 8, 2026
- First seen by Releasebot:May 10, 2026
May 8, 2026
Leaderboard adds gpt-5.5-instant to Text, Vision, and Document leaderboards and ernie-5.1 to Search.
gpt-5.5-instant has been added to the Text, Vision, and Document leaderboard.
ernie-5.1 has been added to Search leaderboard.
Original source - May 7, 2026
- Date parsed from source:May 7, 2026
- First seen by Releasebot:May 10, 2026
May 7, 2026
Leaderboard adds Gemma 4 and preview Qwen and Hunyuan models to its Vision and Code leaderboards.
gemma-4-31b and gemma-4-26b-a4b has been added to the Vision leaderboard.
qwen3.6-max-preview and hunyuan-hy3-preview has been added to Code leaderboard.
Original source - May 6, 2026
- Date parsed from source:May 6, 2026
- First seen by Releasebot:May 10, 2026
May 6, 2026
Leaderboard adds grok-imagine-image-quality and gpt-5.5-instant to multiple AI leaderboards.
grok-imagine-image-quality has been added to the Text-to-Image leaderboard and Image edit leaderboard.
gpt-5.5-instant has been added to the Vision, Text and Document leaderboards.
Original source - May 5, 2026
- Date parsed from source:May 5, 2026
- First seen by Releasebot:May 10, 2026
May 5, 2026
Leaderboard adds uni-1.1 models, updates gpt-image-2 scores, and expands Code Arena with gemma-4 models.
uni-1.1-max and uni-1.1 has been added to the Text-to-Image leaderboard and Image Edit leaderboard.
gpt-image-2 (medium) has been updated on the Image leaderboards. The updated score reflects performance across Arena's full user base. As public usage scales, scores stabilize to reflect a larger sample of real-world use.
gemma-4-26b-a4b & gemma-4-31b have been added to the Code Arena leaderboard.
Original source - May 1, 2026
- Date parsed from source:May 1, 2026
- First seen by Releasebot:May 10, 2026
May 1, 2026
Leaderboard adds mimo-v2-omni, grok-4.3 and trinity-large-thinking to its Vision, Text, Search and Code leaderboards.
mimo-v2-omni has been added to the Vision leaderboard.
grok-4.3 has been added to the Text, Search, Code and Vision leaderboards.
trinity-large-thinking has been added to the Text and Code leaderboards.
Original source - Apr 30, 2026
- Date parsed from source:Apr 30, 2026
- First seen by Releasebot:May 10, 2026
April 30, 2026
Leaderboard adds qwen3.6-max-preview, hunyuan-hy3-preview and glm-5v-turbo to its Text and Vision leaderboards.
qwen3.6-max-preview has been added to the Text leaderboard.
hunyuan-hy3-preview has been added to the Text leaderboard.
glm-5v-turbo has been added to the Vision leaderboard.
Original source - Apr 29, 2026
- Date parsed from source:Apr 29, 2026
- First seen by Releasebot:May 10, 2026
April 29, 2026
Leaderboard adds ernie-5.1-preview to the Text leaderboard.
ernie-5.1-preview has been added to the Text leaderboard.
Original source - Apr 27, 2026
- Date parsed from source:Apr 27, 2026
- First seen by Releasebot:May 10, 2026
April 27, 2026
Leaderboard adds gpt-5.5-high to Code, Text, Expert, Search, Document and Vision, plus claude-opus-4-7 to Search.
gpt-5.5-high has been added to Code, Text, Expert, Search, Document, and Vision leaderboards.
claude-opus-4-7 has been added to the Search leaderboard.
Original source - Apr 23, 2026
- Date parsed from source:Apr 23, 2026
- First seen by Releasebot:May 10, 2026
April 23, 2026
Leaderboard adds qwen-image-2.0-pro-2026-04-22 to Text-to-Image and Image Edit, plus DeepSeek models to Text and Code leaderboards.
qwen-image-2.0-pro-2026-04-22 has been added to the Text-to-Image and Image Edit leaderboards.
deepseek-v4-pro, deepseek-v4-pro-thinking, and deepseek-v4-flash-thinking has been added to the Text and Code leaderboards.
Original source
Curated by the Releasebot team
Releasebot is an aggregator of official product update announcements from hundreds of software vendors and thousands of sources.
Our editorial process involves the manual review and audit of release notes procured with the help of automated systems.
Similar to Leaderboard with recent updates:
- Claude updates93 release notes · Latest May 28, 2026
- Slack updates89 release notes · Latest May 21, 2026
- Polestar 2 updates21 release notes · Latest May 5, 2026
- Notion updates107 release notes · Latest May 27, 2026
- Rippling updates23 release notes · Latest Apr 28, 2026
- Claude Code updates327 release notes · Latest May 29, 2026