Meta Release Notes
178 release notes curated from 145 sources by the Releasebot Team. Last updated: May 7, 2026
Meta Products
- May 6, 2026
- Date parsed from source:May 6, 2026
- First seen by Releasebot:May 7, 2026
May 6, 2026
Instagram Platform brings multiple image sending out of beta to all accounts.
Sending multiple images is now out of beta and available to all accounts.
Original source - May 6, 2026
- Date parsed from source:May 6, 2026
- First seen by Releasebot:May 6, 2026
19.2.6 (May 6th, 2026)
React improves Server Components with type hardening and performance boosts.
React Server Components
Type hardening and performance improvements
(#36425 by @eps1lon and @unstubbable)
Original source All of your release notes in one feed
Join Releasebot and get updates from Meta and hundreds of other software products.
- May 6, 2026
- Date parsed from source:May 6, 2026
- First seen by Releasebot:May 6, 2026
19.1.7 (May 6th, 2026)
React adds Server Components type hardening and performance improvements.
React Server Components
Type hardening and performance improvements
(#36425 by @eps1lon and @unstubbable)
Original source - May 6, 2026
- Date parsed from source:May 6, 2026
- First seen by Releasebot:May 6, 2026
19.0.6 (May 6th, 2026)
React ships Server Components type hardening and performance improvements.
React Server Components
Type hardening and performance improvements
(#36425 by @eps1lon and @unstubbable)
Original source - April 2026
- No date parsed from source.
- First seen by Releasebot:Apr 23, 2026
Segment Anything 2 Demo
Meta AI launches Segment Anything 2 demo for video cutouts and effects with a few clicks.
- April 2026
- No date parsed from source.
- First seen by Releasebot:Apr 23, 2026
FAIRChem v2
Meta AI reports FAIRChem v2 introduces UMA, a universal machine learning potential with state-of-the-art accuracy.
FAIRChem v2 introduces the UMA model — a universal machine learning potential for atoms. This is a breaking change from v1 and is not compatible with previous pretrained models.
UMA is trained on 500M+ DFT calculations across molecules, materials, and catalysts — achieving state-of-the-art accuracy with energy conservation and fast inference.
Original source - April 2026
- No date parsed from source.
- First seen by Releasebot:Apr 23, 2026
Seamless Communication
Meta AI releases Seamless Communication, a suite of AI translation models that aims to make cross-language speech more natural, expressive and fast. It includes SeamlessExpressive, SeamlessStreaming and SeamlessM4T v2, and is publicly releasing the models, data and tools.
AI research by Meta
Seamless Communication
A significant step towards removing language barriers through expressive, fast and high-quality AI translation
A family of AI research models that enable more natural and authentic communication across languages
The Seamless Communication models
SeamlessExpressive
A model that aims to preserve expression and intricacies of speech across languages.
SeamlessStreaming
A model that can deliver speech and text translations with around two seconds of latency.
SeamlessM4T v2
A foundational multilingual and multitask model that allows people to communicate effortlessly through speech and text.
Seamless
A model that merges capabilities from SeamlessExpressive, SeamlessStreaming and SeamlessM4T v2 into one.
Preserving prosody
SeamlessExpressive
Translations should capture the nuances of human expression. While existing translation tools are skilled at capturing the content within a conversation, they typically rely on monotone, robotic text-to-speech systems for their output. SeamlessExpressive aims to preserve intricacies of speech; such as pauses and speech rate, in addition to vocal style and emotional tone.
Try the SeamlessExpressive demo
English input: whisper
Please keep the volume down. We just put the baby to sleep.
Spanish output: non-expressive
Spanish output: expressive
English input: sad
Please, don't leave. I hate being here alone.
French output: non-expressive
French output: expressive
Near real-time translation
SeamlessStreaming
SeamlessStreaming is the first massively multilingual model that delivers translations with around two-seconds of latency and nearly the same accuracy as an offline model. Built upon SeamlessM4T v2, SeamlessStreaming supports automatic speech recognition and speech-to-text translation for nearly 100 input and output languages, in addition to speech-to-speech translation for nearly 100 input languages and 36 output languages.
Foundational model for universal translation
SeamlessM4T v2
In August 2023, we introduced the first version of SeamlessM4T, a foundational multilingual and multitask model that delivered state-of-the-art results for translation and transcription across speech and text. Built upon this work, our improved model, SeamlessM4T v2, serves as the foundation for our new SeamlessExpressive and SeamlessStreaming models. It features a new architecture with a non-autoregressive text to unit decoder that delivers improved consistency between text and speech output.
More model details
Learn more about the research behind Seamless Communication
Try the SeamlessExpressive demo
Try the SeamlessExpressive demo to hear how you sound in a different language while maintaining elements of your expression and tone.
Our approach to research
Open innovation
We believe in the power of collaboration and open research to break down communication barriers. To enable our fellow researchers to build upon this work, we’re publicly releasing the full suite of Seamless Communication models, along with metadata, data and tools.
Safety and responsibility
We’re dedicated to promoting a safe and responsible AI ecosystem. We have taken a number of steps to improve the safety of our Seamless Communication models; significantly reducing the impacts of hallucinated toxicity in translations, and implementing a custom watermarking approach for audio outputs from our expressive models.
Resources
More on Seamless Communication
Explore additional resources, including the research paper, model details and more.
Technical overview
More details on how we developed the suite of Seamless Communication models.
Seamless research paper
Methodology, benchmarks, research findings and more from the Seamless Communication project.
AI at Meta blog
Read the full post about the journey, research and milestones achieved.
Download the models
Get access to our suite of publicly available models.
SeamlessExpressive Demo
Hear how you sound in a different language while maintaining elements of your expression and tone.
Original source - April 2026
- No date parsed from source.
- First seen by Releasebot:Apr 23, 2026
Meta Video Seal
Meta AI introduces Video Seal, an open-source video watermarking model that embeds durable, invisible watermarks and hidden messages to help verify video origin even after editing.
Introducing Meta Video Seal
A state-of-the-art, open-source model for video watermarking
With AI-generated content on the rise, verifying video origins is crucial. Video Seal is a neural watermarking model that embeds durable, invisible watermarks - even after video editing.
Imperceptible watermarks
Video Seal embeds an invisible watermark into videos, with the option to include a hidden message.
Robust and Resilient
Video Seal's watermarks are resilient, withstanding distortion efforts such as flipping and blurring.
Origin Verification
The watermark and hidden message can be revealed to verify the video's origin.
How the demo works
- Choose a video from the library to explore the model, or upload your own to get started.
- Embed up to a 6-character hidden message and watermark in your video.
- Use the comparison slider to view an enhanced X-ray visualization of the watermark on the video.
- Stress test the watermark by distorting the video and verifying if the watermark and hidden message remain detectable.
- April 2026
- No date parsed from source.
- First seen by Releasebot:Apr 23, 2026
Introducing Meta Motivo
Meta AI releases Meta Motivo, a behavioral foundation model for zero-shot control of a virtual physics-based humanoid. It also adds a new humanoid benchmark, training code, and a demo, with strong whole-body task performance across motion tracking, pose reaching, and reward optimization.
A Meta FAIR release
Introducing Meta Motivo
A first-of-its-kind behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.
Try the demo
Download the model
Zero-Shot Whole-Body Humanoid Control via Behavioral Foundation Models
Meta Motivo is a behavioral foundation model pre-trained with a novel unsupervised reinforcement learning algorithm to control the movements of a complex virtual humanoid agent. At test time, our model can be prompted to solve unseen tasks such as motion tracking, pose reaching, and reward optimization without any additional learning or fine-tuning.
Read the research paper
Physics-based environment
The model has learned to control the agent, subject to the physics of its body and environment. Its behaviors are robust to variations and perturbations.
Different prompts for behaviors
The model can be prompted with motions to track, poses to reach, and rewards to optimize.
Zero-shot capability
The model computes the best behavior for each prompt without any additional learning or fine-tuning.
Explore the Research
We are releasing the pre-trained model together with the new humanoid benchmark and the training code. We hope this will encourage the community to further develop research towards building behavioral foundation models that can generalize to more complex tasks, and potentially different types of agents.
Key takeaways
- We introduce a new algorithm grounding the forward-backward unsupervised reinforcement learning method with an imitation objective leveraging a dataset of unsupervised trajectories.
- With this new approach, we train Meta Motivo, a behavioral foundation model that controls a high-dimensional virtual humanoid agent to solve a wide range of tasks.
- We evaluated our model using a new humanoid benchmark across motion tracking, pose reaching, and motion tracking tasks. Meta Motivo achieved competitive performance with task-specific methods, while outperforming state-of-the-art unsupervised RL and model-based baselines.
The Algorithm
Forward-Backward representations with Conditional Policy Regularization (FB-CPR) is a novel algorithm combining unsupervised forward-backward representations [1, 2, 3] with an imitation learning loss regularizing policies to cover states observed in a dataset of unlabeled trajectories. Our algorithm is trained online through direct access to the environment and it crucially learns a representation that aligns the embedding of states, motions, and rewards into the same latent space. As a result, we can train models whose policies are grounded towards useful behaviors, while being capable of zero-shot inference across a wide range of tasks, such as goal-based RL, imitation learning, reward optimization, and tracking.
The final model includes two components: 1) an embedding network that receives as input the state of the agent and it returns its embedding; 2) a policy network parameterized with the same embedding that receives an input the state and returns the action to take.
Inference from various types of prompts
Our algorithm learns a representation that aligns states, rewards, and policies into the same latent space. We can then leverage this representation to perform zero-shot inference for different tasks
Motion tracking
Pose reaching
Reward optimization
Performance improvement during pre-training
Meta Motivo is a behavioral foundation model trained on a SMPL-based humanoid simulated with the Mujoco simulator using a subset of the AMASS motion capture dataset and 30 million online interaction samples.
The videos below illustrate the behaviors corresponding to one motion tracking task (a cartwheel motion), one pose reaching task (an arabesque pose), and one reward optimization task (running) at different stages of the pre-training process. Despite the model not being explicitly trained to optimize any of these tasks, we see the performance improving during training and more human-like behaviors emerge.
Motion tracking
Pose reaching
Reward optimization
Evaluation Results
For evaluation, we have developed a new humanoid benchmark including motions to track, stable poses to reach, and reward functions to optimize. We consider several different baselines including 1) methods that are retrained for each task separately; 2) behavioral foundation models and model-based algorithms. We are releasing the code with the specification files needed to use the simulator and evaluate the model performance on the tasks that are used in the paper.
Quantitative
Our model achieves between 61% to 88% of the performance of top-line methods retrained for each task, while outperforming all other algorithms except for the tracking: in this case it is second best behind Goal-TD3, which cannot be used for reward-based tasks.
Results
Motion tracking
Pose reaching
Reward optimization
Qualitative
To further analyze the performance gap in reward-based and goal-based tasks between Meta Motivo and single-task TD3, we ran a human evaluation with the objective of having a qualitative assessment of the learned behaviors in terms of human-likeness. This evaluation reveals that policies purely optimized for performance (TD3) produce much less natural behaviors than Meta Motivo, which better trades off performance and qualitative behaviors.
Results
Pose reaching
Reward optimization
Understanding the behavioral latent space
One of the crucial aspects of our new algorithm is that it uses the same representation to embed states, rewards, and motions in the same space. We have then investigated the structure of the learned behavioral latent space.
Visualization
Interpolation
In the image above, we visualize the embedding of motions classified by their activity (e.g., jumping, running, crawling) and reward-based tasks. Not only does the representation capture semantically similar motions in similar clusters, but it creates a latent space where rewards and motions are well aligned.
Limitations
Meta Motivo is our first attempt to train behavioral foundation models with zero-shot capabilities across several different prompt types. While the model achieved strong quantitative and qualitative results, it still suffers from several limitations.
Motion tracking
Pose reaching
Reward optimization
Fast movements and motions on the ground are poorly tracked. The model also exhibits unnatural jittering.
Try it yourself
Control the behavior of an embodied virtual agent through various prompts, including creating your own! See how the agent adjusts to changes in physics and environmental conditions, like gravity and wind.
Try the demo
References
- Ahmed Touati, Yann Ollivier, Learning One Representation to Optimize All Rewards, NeurIPS 2021
- Ahmed Touati, Jérémy Rapin, Yann Ollivier, Does Zero-shot Reinforcement Learning Exist?, ICLR 2023
- Matteo Pirotta, Andrea Tirinzoni, Ahmed Touati, Alessandro Lazaric, Yann Ollivier, Fast Imitation via Behavior Foundation Models, ICLR 2024
- Matthew Loper, Naureen Mahmood, Javier Romero, Gerard Pons-Moll, and Michael J. Black, SMPL: a skinned multi-person linear model, ACM Transactions on Graphics 2015.
- MuJoCo - Advanced physics simulation
- Naureen Mahmood, Nima Ghorbani, Nikolaus F. Troje, Gerard Pons-Moll, and Michael J. Black. AMASS: archive of motion capture as surface shapes, ICCV 2019.
- https://github.com/facebookresearch/humenv
Acknowledgements
Research Authors
Andrea Tirinzoni, Ahmed Touati, Jesse Farebrother, Mateusz Guzek, Anssi Kanervisto, Yingchen Xu, Alessandro Lazaric, Matteo Pirotta
Project Contributors (alphabetical)
Claire Roberts, Dominic Burt, Jiemin Zhang, Leonel Sentana, Maria Ruiz, Matt Hanson, Morteza Behrooz, Ryan Winstead, Spaso Ilievski, Vincent Moens, Vlad Bodurov, William Ngan
© 2024 Meta
Original source - April 2026
- No date parsed from source.
- First seen by Releasebot:Apr 23, 2026
DINOv3
Meta AI releases DINOv3, a self-supervised vision foundation model that brings stronger universal backbones, dense image features, and broad performance across detection, segmentation, depth estimation, and tracking. It also expands the model suite with efficient options for diverse deployment needs.
INTRODUCING DINOV3
Self-supervised learning for vision at unprecedented scale
DINOv3 scales self-supervised learning (SSL) for images to produce our strongest universal vision backbones, enabling breakthrough performance across diverse domains.
Download DINOv3
Read the research paper
DINOV3 OVERVIEW
Cutting-edge image representations, trained without human supervision
We scaled unsupervised training to 7B-parameter models and 1.7B image datasets, using a fraction of compute compared to weakly-supervised methods. Despite keeping backbones frozen during evaluation, they achieve absolute state-of-the-art performance across diverse domains.
Read the research paper
Exceptional performance across visual domains
SSL unlocks domains where annotations are scarce or costly. Backbones enable state-of-the-art results for tasks including object detection in web imagery, but also canopy height mapping in satellite and aerial imagery.
Versatile backbone with powerful dense image features
High-resolution dense features from a single DINOv3 backbone enable leading performance across vision tasks, including object detection, depth estimation, and segmentation, without any finetuning.
Efficient model sizes and architectures
We release a comprehensive model suite addressing a wide range of use cases, including broad coverage of ViT sizes and efficient ConvNeXt models for on-device deployment.
PERFORMANCE
Evaluating DINOv3's Performance
DINOv3 sets a new standard in vision foundation models. For the first time, a model trained with SSL outperforms weakly-supervised models on a broad range of probing tasks, from fine-grained image classification, to semantic segmentation, to object tracking in video.
APPLICATIONS
DINO in action
From challenging annotation scenarios to efficiency-critical deployments, see how researchers and developers use DINO to build breakthrough applications.
Download DINOv3
World Resources Institute
WRI measures tree canopy heights with DINO, helping civil society organizations worldwide monitor reforestation.
Learn more
NASA JPL
NASA JPL uses DINO for Mars exploration robots, enabling multiple vision tasks with minimal compute.
Learn more
Orakl Oncology & CentraleSupelec
Orakl Oncology & CentraleSupelec pre-trains DINO on organoid images, producing a backbone to power prediction of patient responses to cancer treatments.
Learn more
APPROACH
Self-supervised pre-training unlocks simple task adaptation
Pre-training data is curated from a large unlabeled dataset. During pre-training, the model learns general-purpose visual representations, matching features between different augmented views of the same image. In post-training, the model is distilled into more efficient models.
A pre-trained DINOv3 model can be easily tailored by training a lightweight adapter on a small amount of annotated data.
DINO Evolution
DINOv3 marks a new milestone in self-supervised training at scale. It builds upon the scaling progress of DINOv2, further increasing the model size x6, and training data x12.
DINO
Initial research proof-of-concept, with 80M-parameter models trained on 1M images.
Read the research paper
Download the model
DINOv2
First successful scaling of a SSL algorithm. 1B-parameter models trained on 142M images.
Read the research paper
Download the model
DINOv3
An order of magnitude larger training compared to v2, with particular focus on dense features.
Read the research paper
Download the model
Explore additional resources
Read the AI at Meta blog
Read the research paper
Download DINOv3
DINOv3 on Hugging Face
Original source - April 2026
- No date parsed from source.
- First seen by Releasebot:Apr 23, 2026
Introducing Meta Segment Anything Model Audio (SAM Audio)
Meta AI launches SAM Audio, a multimodal sound separation model that uses text, visual, and span prompts to isolate target audio from complex mixes. It also adds PE-AV to Perception Encoder and releases a new OSS evaluation set with a judge model.
With SAM Audio, you can use simple text prompts to accurately separate any sound from any audio or audio-visual source.
SAM AUDIO CAPABILITIES
SAM Audio separates target and residual sounds from any audio or audiovisual source—across general sound, music, and speech.
Text prompts
SAM Audio enables you to use text-based prompts to describe the specific target audio they want to separate.
Visual prompts
SAM Audio lets you pick out and separate sounds by clicking on the part of the video where you hear them.
Span prompts
SAM Audio is the first model to introduce span prompting, selecting the desired point in the timespan that contains the target audio.
Multi-modal prompts
SAM Audio provides you flexibility with three unifying prompt modalities (text, visual, timespan).
A NEW WAY TO EXPERIENCE SOUND
State-of-the-art model for all sound
SAM Audio is a state-of-the-art, unified multimodal model that sets a new standard for audio separation, enabling users to isolate general sounds, music, and speech from complex mixtures using intuitive prompts.
PERFORMANCE
State-of-the-art model performance
SAM Audio achieves beyond state-of-the-art performance for all prompting capabilities.
OUR APPROACH
Model architecture
SAM Audio is a generative separation model that extracts both target and residual stems from an audio mixture using text, visual, or temporal prompts. It is powered by a flow-matching Diffusion Transformer and operates in a DAC-VAE latent space, enabling high-quality joint generation of target and residual audio.
OUR APPROACH
Audiovisual Perception Encoder
PE-AV is a new open source model, bringing audio capabilities to Meta's Perception Encoder.
THE SAM AUDIO EVALUATION DATASET
A first-of-its-kind audio separation OSS evaluation set
SAM Audio is releasing a first-of-its-kind OSS evaluation set for prompted audio separation and a judge model highly correlated with human subjective evaluation.
Real world opportunities
"Artificial Intelligence has been a game changer for the disabled community and the use cases for AI-focused start-ups in our ecosystem are vast. By incorporating open source models like SAM Audio into their work, 2GI’s cohort participants can advance their missions while gaining competitive advantage, showcasing that disabled founders are on the cutting edge of technology."
- Diego Mariscal, CEO of 2gether-International
2gether-International empowers disabled founders with resources to launch high-impact startups. In partnership with Meta’s AI for Good team, 2GI leverages open AI models like SAM Audio to accelerate innovation for early-stage, founder-led AI companies.
"For years, Starkey has led the industry in applying artificial intelligence to revolutionize hearing technology. Our ground-breaking work continues to elevate what hearing aids can achieve, particularly in challenging listening situations like noisy environments and overlapping speech. With open models like SAM audio, we see tremendous opportunity to build on our innovations and further our mission to help people hear better and live better."
- Achin Bhowmik, Chief Technology Officer and Executive Vice President of Engineering at Starkey
Starkey is the global leader in hearing technology and the only global American-owned hearing aid manufacturer. Using AI, Starkey transforms hearing aids into smart health and communication devices—delivering innovative, connected solutions that enhance lives
Original source - April 2026
- No date parsed from source.
- First seen by Releasebot:Apr 23, 2026
Introducing Meta SAM 3D
Meta AI introduces SAM 3D, a new single-image 3D reconstruction system that brings objects and humans to life with accurate shape, pose, geometry, texture, and full scene context. It includes SAM 3D Body and SAM 3D Objects and is aimed at practical 3D applications.
AI RESEARCH FROM META
Introducing
Meta SAM 3D
SAM 3D can bring any 2D image to life, accurately reconstructing objects and humans, including their shape and pose.
SAM 3D CAPABILITIES
Accurately reconstruct objects and bodies
Object reconstruction
SAM 3D enables precise 3D reconstruction of objects from real images, while accurately reconstructing their geometry and texture.
Body pose & shape estimation
SAM 3D allows for accurate 3D reconstruction of human body shape and position from a single image.
Scene reconstruction
SAM 3D works on real images in-the-wild, maintaining strong fidelity and quality.
Real world 3D perception
SAM 3D enables full scene reconstructions, placing objects and humans in a shared context together.
The SAM 3D models
SAM 3D contains two state-of-the-art models that enable 3D reconstruction of objects and humans from a single image.
SAM 3D Objects
Single image input
Detailed 3D reconstruction of any masked objects, including geometry and texture
Independent, posed 3D models, suitable for manipulation & interaction
Reconstructions are robust to occlusion in the input image
Position multiple objects into a scene, jointly with SAM 3D Body reconstructions
SAM 3D Body
Single image input
Reconstructs body shape and pose, including unique positions and partial visibility
Suitable for manipulation and interaction
Promptable with joint reconstructions
Position multiple people into a scene, jointly with SAM 3D Objects reconstructions
Designed for practical 3D applications
Enhancing Facebook Marketplace shopping
Place a 3D AR overlay of home decor, like a lamp or a table, from Marketplace in your room to visualize the style and fit within your space before purchasing.
Experiment with SAM 3D today
OUR APPROACH
Model architecture
SAM 3D is a suite of two models: SAM 3D Body and SAM 3D Objects:
- The SAM 3D Body model architecture uses a transformer-based encoder-decoder architecture to predict 3D human pose and mesh parameters directly from images, enabling accurate and interactive pose regression.
- The SAM 3D Objects model employs two stages of DiTs—first generating 3D object shape and pose, then refining texture and details—to deliver high-fidelity, realistic 3D reconstructions.
BENCHMARKS
State-of-the-art performance
SAM 3D achieves beyond state-of-the-art performance across a series of benchmarks for both its models.
THE SAM 3D ARTIST OBJECT DATASET
A dataset of diverse and high-quality 3D meshes
A new first-of-its-kind evaluation set for visually grounded 3D reconstruction in real-world images, with diverse images and objects that are significantly more challenging than existing 3D benchmarks. This represents a new way to measure research progress in 3D, and pushes the field away from curated images/synthetic assets and towards real-world perception and common-sense 3D understanding.
More from Segment Anything
SAM 3
With SAM 3, you can use text and visual prompts to precisely detect, segment and track any object in an image or video.
Original source - April 2026
- No date parsed from source.
- First seen by Releasebot:Apr 23, 2026
Introducing Meta Segment Anything Model 3 (SAM 3)
Meta AI adds SAM 3, a promptable segmentation model that uses text, exemplars and visual clicks to identify, segment and track objects in images and videos, with state-of-the-art performance and upcoming support for Instagram Edits and Vibes.
With SAM 3 you can use text and visual prompts to precisely identify, segment, and follow any object in images or videos—coming soon to Instagram Edits and Vibes on the Meta AI app.
SAM 3 CAPABILITIES
Advanced features, simple prompts
Using open vocabulary text or visual prompts, SAM 3 can detect, segment and track all matching objects in images and videos.
Text prompts
You can prompt SAM 3 with words and short phrases, to mask all objects matching the text description.
Exemplar Prompts
With exemplar prompts, you can simply draw a box around an example of the object you want to segment, and SAM 3 will mask all objects matching the outlined example.
Visual prompts
With all the capabilities of SAM 2, SAM 3 allows you to segment objects using positive and negative clicks.
Interactivity
If SAM 3 ever misses an object or makes a mistake, you can easily add follow-up prompts to help further guide the model.
BENCHMARKS
State-of-the-art performance
SAM 3 is state-of-the-art across all text and visual segmentation tasks in both images and videos. The model additionally maintains all the performance and functionality of SAM 2.
Designed for real-world applications
Edits is the new video creation app by Instagram that helps creators make great videos on their phones. Creators will soon be able to use SAM 3 in Edits to quickly apply effects to people or objects in their videos, helping their creations stand out.
ENHANCED CAPABILITIES
Evolution of SAM
The Segment Anything models build on each other, offering increasingly advanced capabilities for developers and researchers to create, experiment and uplevel media workflows.
SAM 3
Detect, segment and track every example of any object category in an image or video, using text or examples
Segment an object from a click
Track segmented objects in videos
Refine prediction with follow up clicks
Detect and segment matching instances from text
Refine detection with visual examples
SAM 2
Segment and track any object in any image or video using click, box or mask prompts
Segment an object from a click
Track segmented objects in videos
Refine prediction with follow up clicks
SAM 1
Segment any object in any image with as little as a single click
Segment an object from a click
Refine prediction with follow up clicks
Try SAM 3 today
Experiment with SAM 3 in the Segment Anything Playground.
OUR APPROACH
New unified architecture
SAM 3 is built as a unified, promptable model that enables segmentation with language, exemplars and visual prompts across images and videos. It leverages a large-scale, diverse training dataset and a powerful perception encoder backbone to achieve state-of-the-art performance in segmentation and tracking using open-vocabulary short text phrases and visual prompts.
More from Segment Anything
SAM 3D enables precise reconstruction and analysis of 3D people and objects, providing new opportunities for spatial understanding and applications.
Original source - Apr 22, 2026
- Date parsed from source:Apr 22, 2026
- First seen by Releasebot:Apr 22, 2026
April 22, 2026
Instagram Platform adds richer media engagement metrics, expanded views reporting, collaborative media access, paid partnership labels at publish time, and new like and comments actions through the IG User API to better measure, publish, and manage content engagement.
New Media Engagement Fields
Applies to all versions.
Three new fields are now available on the IG Media endpoint for apps using Facebook Login, giving deeper insight into how content is being distributed:
- reposts_count — Number of times the media has been reposted by other users
- saved_count — Number of times the media has been saved
- shares_count — Number of times the media has been shared
Endpoint
- GET /{ig_media_id}?fields=reposts_count,saved_count,shares_count
Aggregated Metrics
Applies to all versions.
Introducing three new aggregated metrics that provide comprehensive engagement totals across all surfaces, including boosted media and Facebook crossposted content. These values match what users see in the Instagram Insights Dashboard and are available through both the IG Media and Insights endpoints:
- total_like_count / total_likes — Aggregated likes across Instagram, Facebook, and promoted media
- total_comments_count / total_comments — Aggregated comments across all surfaces
- total_views_count / total_views — Aggregated views across all surfaces (video media only)
Endpoints
- GET /{ig_media_id}?fields=total_like_count,total_comments_count,total_views_count
- GET /{ig_media_id}/insights?metric=total_likes,total_comments,total_views
Enhanced Views Metrics
Applies to all versions.
- facebook_views expansion: The facebook_views metric now supports Feed post, Reels, and Story media types, and coverage has expanded to include both crossposted plays and cross-app recommended plays.
- Cross-platform view_count: The view_count field now returns combined Instagram and Facebook views for crossposted video content when looking up another user's public media via the Business Discovery API.
Endpoints
- GET /{ig_media_id}/insights?metric=facebook_views
- GET /{ig_user_id}?fields=business_discovery.username({username}).media{view_count}
Collaborative Media API
Applies to all versions.
Introducing the Collaborative Media API on the IG User. The API lets your app retrieve all media where your app user is an accepted collaborator, making it easier to track and measure performance of collaborative content across partnerships.
Endpoints
- GET //collaborative_media — Fetch all collaborative media for the app user
- GET /?fields=collaborative_media_search.media_id() — Search for a specific collaborative media
Partnership Ads Label
Applies to all versions.
The Content Publishing API now supports adding the "Paid partnership" disclosure label at publish time, eliminating the need to manually add partnership labels after publishing. Two new parameters have been added to the create media endpoint:
- branded_content_sponsor_ids — Tag up to 2 brand partners by their Instagram user ID
- is_paid_partnership — Enable the "Paid partnership" label with or without naming specific brands
Endpoint
- POST //media with branded_content_sponsor_ids and is_paid_partnership parameters
See the updated post-level permissioning guidance for details on brand approval flows.
Like Media and Comments API
Applies to all versions.
Introducing the Like Media and Comments API on the IG User. The API lets your app like and unlike Instagram media and comments on behalf of your app users, enabling engagement workflows such as responding to comments on business posts or engaging with collaborative content.
Endpoints
- POST //likes with media_id={media_id} or comment_id={comment_id} — Like a Feed post, Reel, comment, or reply
- DELETE //likes with media_id={media_id} or comment_id={comment_id} — Unlike a Feed post, Reel, comment, or reply
The new instagram_manage_engagement permission is required. Stories and content from private accounts are not supported.
Original source - Apr 20, 2026
- Date parsed from source:Apr 20, 2026
- First seen by Releasebot:Apr 18, 2026
- Modified by Releasebot:May 6, 2026
[email protected] (April 16, 2026)
React adds ESLint v10 support and performance improvements by skipping compilation for non-React files. It also brings compiler lint upgrades with better set-state-in-effect detection, improved ref validation, and clearer error reporting for faster debugging.
This release adds ESLint v10 support, improves performance by skipping compilation for non-React files, and includes compiler lint improvements including better set-state-in-effect detection, improved ref validation, and more helpful error reporting.
Add ESLint v10 support
- Add ESLint v10 support. (@azat-io in #35720)
- Skip compilation for non-React files to improve performance. (@josephsavona in #35589)
- Fix exhaustive deps bug with Flow type casting. (@jorge-cab in #35691)
- Fix useEffectEvent checks in component syntax. (@jbrown215 in #35041)
- Improved set-state-in-effect validation with fewer false negatives. (@jorge-cab in #35134, @josephsavona in #35147, @jackpope in #35214, @chesnokov-tony in #35419, @jsleitor in #36107)
- Improved ref validation for non-mutating functions and event handler props. (@josephsavona in #35893, @kolvian in #35062)
- Compiler now reports all errors instead of stopping at the first. (@josephsavona in #35873–#35884)
- Improved source locations and error display in compiler diagnostics. (@nathanmarks in #35348, @josephsavona in #34963)
Curated by the Releasebot team
Releasebot is an aggregator of official release notes from hundreds of software vendors and thousands of sources.
Our editorial process involves the manual review and audit of release notes procured with the help of automated systems.
Similar to Meta with recent updates:
- xAI release notes69 release notes · Latest May 6, 2026
- Cursor release notes76 release notes · Latest May 11, 2026
- Perplexity release notes24 release notes · Latest May 11, 2026
- OpenAI release notes648 release notes · Latest May 9, 2026
- Notion release notes125 release notes · Latest May 11, 2026
- Anthropic release notes557 release notes · Latest May 13, 2026