AI Image and Video Release Notes
Release notes for AI image, text-to-video, image editing, and generative media platforms
Products (11)
Latest AI Image and Video Updates
- May 11, 2026
- Date parsed from source:May 11, 2026
- First seen by Releasebot:May 12, 2026
How Real-Time Video Generation Is Changing Online Interaction
Runway AI launches Runway Characters, an audio-driven real-time video model built on GWM-1 that creates expressive conversational characters from a single image. It ships as an API and is live in the web app for interactive support, storytelling and branded experiences.
For most of the internet's history, the interaction model has been the same: you type something, and a result comes back. Web searches. Emails. Product pages. Chatbots. Large language models made the exchange more fluid and conversational, but at its core, it's still text in a box.
We think that era is ending.
The future of online interaction is real-time video, generated on the fly – responsive, personalized, alive.
What Real-Time Video Generation Actually Means
Real-time video generation refers to AI models that synthesize video frame by frame, live, in response to user input – rather than producing a completed output all at once.
Pre-rendered video is static: it was made once, and you watch it. Generative tools like Gen-4.5 let you create video from scratch, but the output is still an artifact you produce and then share. Today's top generative models are limited by their architecture: no matter how complex the prompt, or how sophisticated the output, the model is still predicting and generating the output you want all at once.
Real-time video generation is interactive. The model generates what you see as you see it, responding to what you say and what you do. Every frame is synthesized in the moment, conditioned on the current context of the interaction.
This is only possible because of a fundamental shift in how we think about video models. At sufficient scale, video models go beyond generating plausible-looking footage – they begin to develop an internal representation of how the world works. These micro-interactions—how faces move when people speak, how expressions change when emotions shift, how physics propagates when forces act on objects—drive much of how we experience the world around us.
GWM-1, which we launched last December, is our first general world model family – an autoregressive model that generates frame by frame, runs in real time and can be controlled interactively with actions: camera pose, speech, robot commands. It comes in three variants today: Runway Characters for conversational characters, GWM-Worlds for explorable environments and GWM-Robotics for robotic manipulation. These are distinct post-trained models now; we're working toward unifying them under a single base.
What This Unlocks
The near-term applications of real-time video generation touch almost every domain where digital interaction matters.
Gaming and Interactive Entertainment
NPCs in games today are largely static, with branching dialogue trees, pre-recorded voice lines and scripted behaviors. Real-time video generation makes it possible to build characters that actually listen and respond, holding genuine conversations with players about the world they inhabit. Imagine a guide who can answer any question about lore, or a sports simulation responding live to your choices.
Beyond traditional gaming, real-time video generation opens up new territory for fan platforms, creator experiences and interactive narrative.
Learning and Education
The case for interactive video in education is straightforward: a personalized tutor who reacts to confusion, adjusts explanations in real time and responds to where you actually are in your understanding is categorically more effective than a static lesson. Real-time video generation makes it possible to deploy that kind of experience at scale, across languages, grade levels, subjects and time zones.
There's also an access dimension. A real-time video experience is available at 3am, in any language, with infinite patience. For a student who needs to work through a concept 50 times without embarrassment, or one who's far from any formal support infrastructure, that matters.
Training and Simulation
Some of the most consequential conversations people have in the workplace can't be fully prepared for in a classroom. Real-time video generation enables realistic practice for high-stakes scenarios: an upset customer who escalates, a nervous interviewee who needs to be put at ease, a manager who pushes back on your proposal. For use cases like sales coaching, clinical simulation or law enforcement de-escalation training, real-time video generation is key.
Customer Experience and Brand
The current state of the art for AI customer support is a text chatbot with a company logo on it. Real-time video generation clears that bar significantly by presenting a responsive, expressive presence that engages customers more like a human interaction and less like a form submission. For brands with existing characters or mascots, the opportunity is especially interesting: IP that's existed as a static asset can become genuinely interactive.
Characters: Real-Time Video Generation, Available Now
The most tangible example of real-time video generation we have today is Runway Characters: an audio-driven interactive video generation model, built on GWM-1, that produces fully expressive conversational characters from a single reference image.
The model handles what makes a face feel alive: natural eye movements, lip-sync, facial expressions, gestures during speaking and listening. It sustains quality across extended conversations. And because it ships as an API, developers can create a branded character that can pull from your product catalog, open a support ticket and escalate to a human agent. Companies like BBC, R/GA, Silverside and Supersonik are already building with Runway Characters.
Characters is live now for developers at dev.runwayml.com and available in the Runway web app for anyone who wants to experience it directly.
Nothing quite like this has existed before, which means the questions around responsible deployment are ones we're actively working through. We've written about our approach to identity, consent and transparency—and what responsible deployment looks like—here.
What Comes Next
We wrote last year that we expect to achieve human-scale world simulation within half a decade. Within a decade, we expect to simulate physics and biology accurately enough to meaningfully address a significant percentage of today's scientific challenges.
That's a long arc, but the near-term steps are already visible, and real-time generation will continue to improve. The consistency across extended interactions will deepen. The action spaces these models can respond to will expand.
Enterprises can build with real-time video generation today. To learn more, visit runwayml.com/enterprise or contact our sales team.
Original source - May 4, 2026
- Date parsed from source:May 4, 2026
- First seen by Releasebot:May 5, 2026
Announcing Avatar V: The most realistic AI avatar model in the world
HeyGen launches Avatar V, its most advanced AI avatar model, bringing highly realistic, identity-consistent video generation from a simple 15-second recording. It adds lifelike motion, multi-angle stability, long-form performance, and studio-quality results.
Summary
Introducing Avatar V, HeyGen’s most advanced AI avatar model. It delivers unmatched realism and identity consistency. Create studio-quality videos from a simple 15-second recording with lifelike motion, multi-angle stability, and long-form performance.
Every few months, a new AI model ships with a bold claim about realism. The demos look impressive, the side-by-side comparisons are compelling, and the launch post makes it sound like everything before it was a rough draft. Then you actually use it, and that familiar feeling sets in: the slight uncanny quality, the face that drifts, the avatar that starts as you and quietly stops being you twenty seconds in.
We've seen this too. We built around it.
Today, we're introducing Avatar V, HeyGen's most advanced AI avatar model and the most realistic in the world.
What is Avatar V?
Avatar V is HeyGen's next-generation avatar model and the foundation everything else in HeyGen runs on now.
Most avatar systems optimize for a single impressive moment: the screenshot, the short clip, the controlled demo environment where everything is working in the model's favor. They look great in two seconds and fall apart in twenty. Avatar V was built to do something harder.
It was built to hold.
What that means in practice is that one short recording from you generates studio-quality video that maintains your face, your voice, and your presence across angles, looks, and runtime. Not just for the opening shot, but for the whole thing, from the first frame to the last.
We've been training avatar models for years and going deep on the specific problem of human identity in video: the micro-expressions, the natural movement, the quality threshold that separates a good talking head from footage that could genuinely pass as real. Avatar V is the result of that work compounding over time.
Why it’s the best model
The AI video market has a quality problem that most people describe wrong. They say the output looks AI, but what they actually mean is it doesn't look like the person it's supposed to be.
Identity drift is the real problem.
An avatar that starts as you and slowly stops being you. A face that holds in static shots but breaks under motion. A model that generates one great look but can't give you another without becoming someone else in the process. These aren't edge cases. They're the norm.
Avatar V solves identity consistency at the model level, not as a post-processing patch applied after the fact. We trained it specifically on the hard cases: multi-angle footage, long-form content, varied looks generated from a single input recording. The result is an avatar that stays true to who you are across every variable we could throw at it.
Plus, companies like Synthesia still requires studio time to get anywhere close to this output quality. HeyGen does not. Rated number one for most realistic avatars on G2, Avatar V makes that claim stronger than it's ever been.
How it works
Record a 15-second clip
That's the input. Fifteen seconds, no professional camera setup, no studio lighting, no crew required. You need a phone and a few seconds of your time.
From that reference clip, Avatar V builds a complete model of your identity, not just what you look like in one frame, but how you move, how your face settles naturally, and what makes you recognizably you across different contexts. Everything it generates afterward comes from that foundation, which is what makes the output so consistent.
That gap between what goes in and what comes out is exactly where Avatar V does its work.
Multi-angle consistency
Real video isn't a single locked-off shot. It moves, it cuts, and the camera finds you from different positions and angles, and if the avatar can't hold up across that motion, the entire thing falls apart immediately.
Avatar V holds. Your avatar maintains consistency across different shots and angles without drift, without inconsistency, and without the uncanny valley breaking through at the worst possible moment. The face that appears at the top of your video is the same face that appears at the bottom, from any angle the output requires.
This is genuinely difficult to do well. Most models treat each frame as an isolated generation problem. Avatar V treats your identity as a constant and builds outward from there.
Multi-look generation
Every video you've ever recorded came with baggage you didn't choose. The outfit you happened to be wearing that day. The background behind you. The lighting in the room. If you wanted to look different, the answer was always the same: go record again.
Avatar V changes that entirely.
With Avatar IV, what you recorded was what you got. The performance and the appearance were locked together. Avatar V is the first model to separate them.
You record yourself once, naturally. Avatar V captures your real movements, your real expressions, and the specific way you carry yourself when you're actually talking. That performance becomes the foundation. Then you choose how you appear: a different outfit, a different setting, a different version of yourself entirely. Your motion stays real. Everything else is yours to decide.
This matters for real work. You might want one look for a sales video, another for a company-wide announcement, and another for a product walkthrough. With Avatar V, you don't film three separate times to get three distinct results. You record once and choose from there.
Long-form stability
Short clips are easy. Long-form is where most avatar models quietly fall apart.
Avatar V maintains your identity across your longest videos, delivering the same face, the same voice, and the same presence from the first second to the last without degradation or drift. No moment where the avatar stops looking like you and starts looking like a close approximation of someone adjacent to you.
This is the capability that makes Avatar V genuinely useful for the content that matters most: full training modules, product walkthroughs, onboarding videos, and the kinds of recordings that used to require a camera crew and a full studio day to produce.
Pair it with Seedance 2.0
Avatar V handles the message. Seedance 2.0 earns the watch.
Once you have your Avatar V recording, it becomes the foundation for a scroll-stopping video when paired with Seedance 2.0. Avatar V delivers your message with the stable, long-form presence that professional video requires. Seedance generates the cinematic hooks and motion-first scenes that pull people in before you say a word. They cover opposite ends of the same video: the opening that demands attention and the body that holds it.
Most people think about the hook and the message as separate production problems. With Avatar V and Seedance 2.0, they both start from the same 15-second clip. You record once to create cinematic videos starring you.
What comes next
Video is the highest-trust medium for human communication, and when it works, it works better than anything else. When it looks fake, trust collapses immediately and there's no recovering it.
Avatar V was built on a single belief: the output has to be good enough that you'd be willing to put your name on it. Not good for AI. Just good.
We think we're there.
Try Avatar V today
Original source All of your release notes in one feed
Join Releasebot and get updates from Runway AI and hundreds of other software products.
- May 4, 2026
- Date parsed from source:May 4, 2026
- First seen by Releasebot:May 5, 2026
What’s new at HeyGen: March 2026
HeyGen releases March updates that make AI video creation more on-brand, interactive, and builder-friendly, with Brand Systems, interactive video, new Video Agent styles, 4K enhancement, stronger enterprise controls, and expanded API, fal, and MCP access.
Summary
HeyGen’s March updates focus on making AI video creation feel truly yours by improving branding speed, adding interactivity, enhancing quality, and expanding tools for builders.
Every tool eventually hits the same wall. It can do the thing, but making the thing feel like yours takes extra work. Your brand colors, your style, your level of polish. March was about closing that gap. We shipped seven updates that make HeyGen videos look better, feel more on-brand, and reach more developers out of the box.
Brand Systems: One URL, every asset on brand
Paste your company's website URL and HeyGen extracts your logo, typography, and color palette automatically. That's your Brand System. It works across Templates, AI Studio, and Video Agent.
In Templates, you apply your brand with a single click. In AI Studio, elements default to your brand fonts and colors. In Video Agent, you can prompt it to use your Brand System and it generates motion graphics that actually match your identity. Set it up once and stop manually adjusting colors on every project.
This is the kind of thing that sounds simple but changes your workflow. No more eyedropping hex codes from your website or hunting for the right font file. Your brand just shows up.
Interactive video experiences
Static video is a one-way street. Interactive Video turns it into a conversation. You can now add in-video quizzes for real-time knowledge checks, branching paths for choose-your-own-adventure scenarios, and CTA buttons that link externally or jump between chapters.
This is a big deal for training and education teams. Build a compliance module where learners make decisions and see consequences. Create onboarding flows that adapt based on role. The branching logic means one video can serve multiple paths without duplicating content.
It also exports to SCORM, so it plugs directly into your existing LMS. Available on Business and Enterprise plans.
Styles for Video Agent
Video Agent can now generate videos in over 100 curated visual styles. Before you generate, pick a style that controls typography, font pairings, color systems, motion pacing, transitions, animation timing, and layout composition.
The same script with the same avatar produces a completely different video depending on which style you choose. A product demo can feel polished and corporate or bold and editorial without changing a single word of the prompt. This makes Video Agent dramatically more flexible for teams producing content across different brands, campaigns, or audiences.
4K video enhancement
You can now upscale any video to 4K, powered by Topaz Starlight Precise 2.5. There are two engines: Standard is faster and uses fewer credits, while Precise delivers sharper results for when quality matters most.
Beyond resolution, you also get frame interpolation that takes footage from 24fps up to 120fps. Find it under the Apps tab as "Upscale Video." Available on all plans with credits.
If you've been sitting on older content that looks dated at lower resolutions, this is the fastest way to bring it up to standard.
Enterprise admin controls
Enterprise teams now have granular control over who can do what. Admins can manage public invite links, sub-workspace access, and default sharing permissions. Video distribution controls let you restrict downloads, public publishing, and social sharing at the org level.
Feature-level settings go further: restrict custom avatar creation, AI-generated avatars, public avatar availability, and Brand Kit creation. The Members tab now supports search, active/pending filtering, and exportable status lists. Sub-workspace management got a refreshed UI with cleaner billing setup and pagination.
All changes apply going forward without affecting existing content. Enterprise plan only.
For builders: Pay-as-you-go API, fal, and MCP
Three updates for developers this month. First, API access no longer requires a subscription. You can top up starting at $5 with straightforward USD-per-unit pricing across all API features.
Optional auto-reload keeps things running without manual intervention. If you're already subscribed, your existing discounts stay intact.
Second, HeyGen's core capabilities are now available on fal's, replicate, runware developer platform. That includes Video Agent, Image-to-Video, Translate (both Precision and Speed modes), and Digital Twin with Avatar 3 and Avatar 4. If you're already building on fal, you can add AI video generation without switching platforms or managing a separate integration.
Lastly, HeyGen MCP is available on Claude, Manus, and OpenAI — you can generate videos directly on these platforms.
Looking ahead
That's seven major updates in one month, and we're just getting started. Seedance 2.0 and Gamma went live last week. April's going to be fun.
Original source - May 4, 2026
- Date parsed from source:May 4, 2026
- First seen by Releasebot:May 5, 2026
HeyGen November 2025 product release
HeyGen releases a major update with Android video creation in the U.S., smarter video translation for speed or precision, upgraded Avatar IV realism and control, and a refreshed AI Studio with faster editing and new avatar styling and backgrounds.
Summary
This month, we're raising the bar on avatar realism, creative control, and global reach. With HeyGen now on Android, a smarter video translation engine, major upgrades to our latest avatar model (Avatar IV), and a refreshed AI Studio experience, you can produce polished, professional content even faster, whether you're scaling training, creating courses, or building your brand through video.
What’s new
HeyGen is now on Android
HeyGen is officially available on Android in the U.S. Create, edit, and publish videos directly from your phone, regardless of device.
For course creators, coaches, and anyone who needs to move fast, the Android app puts full video creation in your pocket. Capture ideas on the go, make quick edits between meetings, or publish content without ever opening your laptop.
U.S. Android user? Get started here.
Translate videos with speed and precision
We've redesigned how video translation works with two distinct engines built for different use cases.
Speed Mode provides fast, reliable translation in 175+ languages, accents, and dialects. It's optimized for quick turnaround, making it perfect for daily social posts, quick updates, or any content where you need to move fast. Speed Mode handles sentence-level translation with light alignment and solid lip-sync.
Precision Mode is our best-in-class translation engine, built for when accuracy matters most. It's video-aware: better occlusion handling, better multi-speaker support, context-integrated translation that's character-sensitive and timing-aware. The result is truly lifelike lip-sync and natural voice output, even for movement-heavy content. Ideal for courses, tutorials, product demos, and client deliverables.
If you're scaling content globally, you now have clear options: Speed when you need volume, Precision when you need polish.
Learn more about translating with Heygen.
Create more natural, controllable avatars with Avatar IV upgrades
Our SOTA avatar engine just got more powerful.
More natural movements
Control when gestures happen and in what sequence: prompt for a wave at the beginning, then natural movement as your avatar speaks. Your avatars finally move the way a real person would: purposefully, not on a loop.
Better control
With better expression controls, your avatars now display subtler, more realistic expressions and body language, the kind of micro-movements that make the difference between "AI video" and "professional video."
Intelligent render selection
We've simplified how rendering works. Avatar IV automatically uses the best rendering approach for your video. No more fretting over which mode to pick. When you add custom motion prompts, the system routes to our higher-fidelity engine under the hood. When you don't, you get our most stable, consistent output by default. Less guesswork, better results.
And AI Studio updates for all
We've redesigned the AI Studio to eliminate friction and give you more creative control without leaving the editor.
Style your avatar without leaving the studioGenerate looks directly inside Studio, including outfits, styles, or visual treatments for your avatar without switching to the Avatar tab. Need a blazer for a pitch or something casual for a course? Switch looks in seconds without breaking your flow.
Your avatar, anywherePlace your avatar in any environment instantly with custom Avatar Backgrounds. Swap backgrounds for solid colors, stock photos, AI-generated scenes, or your own custom uploads. Every background works seamlessly with all avatar framings: Full, Circle, or Close-up.
A faster, cleaner editing experienceOur fully redesigned Studio layout feels faster, cleaner, and easier to work in, especially for creators who spend hours editing.
Enjoy smoother interactions across long editing sessions, faster response times when adjusting parameters, and a cleaner interface that consolidates essential controls.
We've also overhauled the context menu, replacing More Options with a unified Properties panel. Edit any element, such as text, image, and shape, from one consistent menu, and access interactivity controls without digging through submenus.
What’s leaving
At HeyGen, we live for customer feedback. Our users shape who we are, what we build, and sometimes, what we remove. To enhance the user experience, please note the following changes we’ve made to the platform:
Voice Tab relocation
The standalone Voices tab has been relocated for your workspace. Nothing has been removed, your Voice tools now live in the workflows where they’re used most:
- AI Studio: Sidebar → Voice → Create / Integrate
- Proofread Studio: Voice dropdown → Track Voices
- Avatars: Voice setup for Avatar creation
Flux product placement removal
Flux product placement (Assets) are now removed as an editing option. Don’t worry:
- Your existing Flux Looks with product placement will remain untouched
- You can download and reuse anything you've already created but
- New Looks made with Flux won't include the product placement feature
While Flux product placement was valuable for specific use cases, HeyGen will rely on Nanobanana exclusively for Look edits using reference images. This means product placement will now be entirely handled through the Edit existing look option, ensuring superior results.
Media Tab removal
This feature was removed as part of our simplification efforts. Users did not find it helpful, and thus it was deprecated.
Questions? If this change impacts your workflow, our support team can help you explore options or talk through alternatives.
Looking ahead
The holidays are just around the corner, but for HeyGen customers, Christmas always comes early. Introducing (even more) avatars from HeyGen!
We release new avatars and looks every week! Be sure to regularly check the Avatars tab for even more ways to tell your story.
Sign up for free here
Original source - May 4, 2026
- Date parsed from source:May 4, 2026
- First seen by Releasebot:May 5, 2026
HeyGen October 2025 product release
HeyGen releases an October 2025 update with LiveAvatar for real-time interactive avatars, new Veo 3.1 and OpenAI Sora 2 integrations for cinematic video creation, and expanded learning tools including quizzes, a redesigned interactivity UI, and multilingual SCORM export support.
Bring real conversations to life with LiveAvatar
HeyGen’s October 2025 release redefines interactive storytelling with lifelike LiveAvatars, cinematic Veo 3.1 and Sora 2 integrations, and powerful new learning tools.
This month, we’re taking storytelling and interactivity to the next level with new integrations, intelligent learning features, and lifelike avatars. From cinematic B-roll powered by Sora 2 and Veo 3.1 to interactive quizzes and LiveAvatar, these updates enable more dynamic, personal, and engaging video communication.
We’re excited to introduce LiveAvatar by HeyGen, offering hyper-realistic, real-time interactive avatars that enable face-to-face human conversation experiences on demand, and at scale. You can create your own avatar with just two minutes of footage or choose from diverse presets to instantly bring realism and personality to customer support, coaching, or education experiences. Already used by visionaries like Reid AI, Coursera, HP, Bosch, and Proto Hologram, LiveAvatar combines authenticity, responsiveness, and enterprise reliability to bring a new level of human touch to AI-powered communication.
Add integrations for next-level video communication
We’re bringing cinematic power directly into your creative workflow. New Veo 3.1 and OpenAI Sora 2 integrations give you complete control over avatars, visuals, and storytelling in HeyGen.
Veo 3.1
The Veo 3.1 integration gives creators full control over every voice, character, and scene. Generate once in Veo and keep your avatar consistent across shots, preserving your real voice, true look, and seamless motion. Perfect for dynamic action, scroll-stopping hooks, and multi-speaker avatar videos, you can use your existing avatar or start from a photo for future use. Upload up to three images per scene to add people, products, or environments mid-story.
OpenAI Sora 2
OpenAI Sora 2 is now fully integrated into HeyGen. Bring cinematic-quality video generation directly into your creative workflow. With Sora 2, you can instantly generate B-roll, scenes, and dynamic visuals from a simple prompt without switching tools, exporting files, or juggling software. This integration unlocks new levels of creativity and speed for creators, educators, and businesses alike, making it easier than ever to turn ideas into polished, professional video content.
Make interactive learning even easier
HeyGen’s Learning & Development Solution is now more powerful and interactive, enabling teams to create personalized, trackable learning experiences that keep learners engaged.
Add quizzes to videos
With built-in quiz features, L&D teams can reinforce key concepts, measure learner comprehension, and keep training sessions interactive from start to finish. By embedding quizzes directly into videos, you transform passive lessons into engaging learning experiences, turning viewers into active participants who retain more through real-time understanding checks.
Improved interactivity UI
HeyGen’s new interactivity layout makes it easier than ever to add and manage engagement features within your videos. The redesigned UI gives creators a clear, visual way to locate and insert branching paths, quizzes, and embedded links directly in the timeline.
Multilingual player support for SCORM export
The multilingual player now supports SCORM export, enabling the delivery of localized training at scale. With this update, L&D teams can share all translated videos through a single link that integrates directly into their LMS, allowing learners to access and complete training in the language of their choice within a fully trackable, SCORM-compliant experience.
Looking ahead
This month’s releases strengthen how teams create, communicate, and teach with video. With LiveAvatar, Veo 3.1, and Sora 2 integrations, as well as new interactive and multilingual training tools, HeyGen empowers every team to create content that’s more engaging, adaptable, and globally accessible.
Sign up for free here
Original source - May 4, 2026
- Date parsed from source:May 4, 2026
- First seen by Releasebot:May 5, 2026
HeyGen September 2025 product release
HeyGen launches Video Agent and expands AI Studio with new AI-powered creative tools, multilingual playback, LMS integration, and caption customization, making it easier to create, localize, and share professional videos at scale.
Summary
This month’s updates include the public launch of Video Agent, new creative tools in AI Studio, multilingual playback, and LMS integration, making it faster and easier than ever to create, scale, and share professional videos.
This month, we’re introducing some of our biggest updates yet, including the public launch of Video Agent, powerful new creative tools in AI Studio, multilingual video playback, and LMS integration. These features are designed to help you create, scale, and share professional videos faster and more seamlessly than ever.
Generate a professional video from a simple prompt
We’re excited to announce the public release of Video Agent, the world’s first creative operating system for video. With just a single prompt, Video Agent handles the entire production workflow—scripting, visuals, voiceovers, avatars, editing, and delivery. It automatically builds a narrative, selects the right assets, syncs voiceovers with AI avatars, applies editing effects, and produces a final, ready-to-use video. Plus, with support for localization in over 175 languages and dialects, it’s built to scale content creation for global audiences.
The real value of Video Agent is in its speed and simplicity. What once took days or weeks can now be done in minutes, dramatically reducing costs and complexity for creators, marketers, and educators. By making video creation more consistent and brand-aligned, it lowers the barriers for anyone without professional production skills.
New tools for global training at scale
To support enterprises and L&D teams delivering training worldwide, we’re introducing new features that make HeyGen videos easier to integrate and share across platforms and languages.
LMS integration
L&D teams can now seamlessly scale training with support for learning management system (LMS) integration. Import HeyGen videos and courses directly into your LMS with just a few clicks. Simply open a project, select a generated video, click share, and choose LMS to get a ready-to-use link. This integration ensures that AI-powered training content fits into existing workflows, providing employees and learners with direct access to engaging, localized videos where they already learn.
Multi-lingual video player
We’re excited to introduce the multi-lingual player, a powerful new way to share translated content at scale. Instead of juggling multiple links, all your translated videos now live in a single, shareable link with a simple dropdown that lets viewers switch between languages for both audio and captions. Fully compatible with avatar and standard translated videos—including batch translations—the multi-lingual player ensures your audience can engage with your content in the language they know best, making training, marketing, and communications more accessible worldwide.
Make every video stand out with instant B-roll
To give creators even more flexibility and control, we’ve added powerful new tools for visual generation and caption customization inside AI Studio.
New AI tab in the editor
We added a new AI tab in AI Studio, designed to make image and video generation more seamless than ever. With support for Nano Banana and Flux Kontext, the update puts visual creation front and center, enabling high-quality B-roll generation directly in HeyGen. With support for image-to-image editing in this release cycle, it’s easier to enhance or adapt visuals, drop them into your canvas, and instantly level up your video projects without leaving AI Studio.
Caption customization
We’ve expanded creative control with 15 new caption styles and full font customization, making it easier to match your video’s tone, whether professional, playful, or cinematic. You can now upload and use your brand’s exact fonts for a consistent look across all content with no extra steps required.
Looking ahead
Together, these September updates mark a major leap forward in making HeyGen faster, smarter, and more global. From the launch of Video Agent to new AI-powered creative tools, caption customization, multilingual playback, and LMS integration, we’re building a platform that empowers creators, marketers, and L&D teams to scale video like never before.
Not a user yet?
Sign up for free here.
Original source - May 4, 2026
- Date parsed from source:May 4, 2026
- First seen by Releasebot:May 5, 2026
HeyGen August 2025 product release
HeyGen releases August 2025 updates with more realistic full-body Digital Twins, improved voice design, faster or higher-quality Avatar IV creation, and expanded enterprise and API tools for more personalized, scalable video creation.
HeyGen’s August 2025 release introduces more realistic full-body avatars, enhanced voice design, flexible creation modes, and expanded enterprise and API capabilities for scalable, personalized video content.
Users now have greater flexibility in designing and personalizing avatars and voices. Lifelike digital twins, customizable voice prompts, and new options for speed or quality enable fine-tuning for truly unique content.
Full control of how your avatars look and sound
Users now have greater flexibility in designing and personalizing their avatars and voices. From lifelike full-body Digital Twins to customizable voice prompts and new options for speed or quality in avatar creation, these updates provide users with more ways to fine-tune their content and make it truly their own.
Digital Twin is now powered by Avatar IV
HeyGen’s new Digital Twin, powered by Avatar IV, takes personalization and realism to an entirely new level. Built with the world’s leading avatar model, the technology enables users to create a version of themselves that doesn’t just talk, but also moves, gestures, and emotes in ways that feel unmistakably real. From every blink to every tonal shift, the digital twin mirrors its human counterpart with striking authenticity.
What sets this release apart is how immersive and complete the experience is. Digital Twin now represents your full body presence, not just your upper frame, and offers the flexibility of clean or dynamic backgrounds. With Avatar IV, the result is more than realistic footage; it’s recognizable and deeply personal, enabling avatars that don’t just sound like you but feel like you. By combining hyper-realism with adaptability, HeyGen’s Digital Twin redefines what it means to maintain a human presence in digital communication, offering creators the most personal experience the platform has ever built.
Improved voice design
HeyGen is enhancing the voice design experience by updating the modal to display the auto-populated prompt and give users the ability to customize it. Voice design now supports more descriptive prompts beyond age, gender, and ethnicity. This update provides both transparency and flexibility. Users will see the generated prompt, preview voices, and have the option to edit the prompt and regenerate new voices tailored to their needs.
Avatar IV speed and quality modes
HeyGen has introduced the option to choose between higher quality or faster creation when creating an avatar in Avatar IV. With the new update, users can now select a “Max Quality” mode for more detailed results or a “Faster” mode for quicker turnaround, depending on their needs. This flexibility is available across Avatar IV, AI Studio, and quick avatar video creation.
Expanded enterprise and API capabilities
HeyGen is evolving with new capabilities designed to meet the needs of enterprises. From expanded customization and workflow integrations to stronger admin controls, these updates make it easier for organizations to create content at scale, streamline collaboration, and maintain clear oversight across teams.
Template API support
AI Studio is now compatible with template and video creation through our API, giving teams greater flexibility and control. It introduces expanded customization options for variables, including avatars, text, and elements such as frames, icons, images, and videos. Even the script of a scene can be set as a variable and customized via the API, allowing for more dynamic and tailored video generation.
Support for Google Slides
Support for Google Slides has been added to editable templates, expanding beyond the existing PowerPoint integration. Users can now import their Google Slides directly into HeyGen, making it simple to transform presentations into engaging videos. This is especially useful for training content, where teams can quickly repurpose slide decks into video lessons without extra formatting or setup, streamlining the entire content creation process.
Export usage history by product
Admins now have the capability to export usage history by product directly from settings. Previously, only the full audit log could be downloaded, but this update makes it easier to access and analyze product-specific usage data. This is especially important for admins who need clear visibility into how different teams are using HeyGen, enabling better reporting, compliance tracking, and resource management.
Looking ahead
Together, these August updates are a major step forward in making HeyGen more powerful, customizable, and enterprise-ready. By combining hyper-realistic avatars, smarter voice tools, flexible quality modes, and stronger admin and API capabilities, HeyGen empowers both individuals and organizations to create with more precision and efficiency.
Not a user yet?
Sign up for free here.
Original source - May 4, 2026
- Date parsed from source:May 4, 2026
- First seen by Releasebot:May 5, 2026
Announcing the Avatar IV API: Lifelike image-to-video, now in your product
HeyGen adds Avatar IV API, bringing its advanced image-to-video model to developers for lifelike talking videos from a photo and script, with clean lip-sync, expressive facial motion, natural gestures, and support for angled, profile, lifelike, and stylized characters.
Summary
Avatar IV API lets you generate lifelike talking videos from any photo, with clean lip-sync, expressive facial motion, and natural gestures.
Earlier this summer, we launched Avatar IV, our most advanced image-to-video model ever, in the HeyGen web app. Today, I’m excited to share that Avatar IV is now available via API, allowing you to embed it directly into your product experiences.
What’s new: Avatar IV, programmatic
With the Avatar IV API, a photo and a script are all you need to generate a realistic talking video, featuring all the HeyGen magic: accurate lip-sync, expressive facial movement, and even authentic hand gestures. Now you can trigger that exact flow with our API inside your app or workflow.
Under the hood, Avatar IV supports angled or profile photos and works across lifelike or stylized characters (humans, anime, and even pets). It’s built to handle real-world inputs while maintaining natural timing and motion.
Why partners are excited
- Frictionless creation from a photo. Let users start with an existing image (or a generated one) and produce a studio-quality avatar performance from just a script. No cameras. No shoots. Seconds, not days.
- More expressive, more engaging. Avatar IV syncs voice with emotion and powers gesture-aware motion, making every message land with clarity and charisma.
- Flexible by design. Works with front-facing, angled, or profile images; supports lifelike and stylized outputs for on-brand experiences across education, CX, marketing, internal comms, and more.
- Built for developers and product teams. Spin up jobs with a simple POST to our video generation endpoints; use photo-avatar endpoints when you want to programmatically add motion and sound effects or manage avatar assets at scale.
Here’s an example Avatar IV video to show the model’s realism and gesture quality.
Common partner use cases
- Learning and development (L&D), and Education: Turn slide thumbnails or instructor headshots into narrated modules and localize at scale.
- Sales and CX platforms: Auto-generate personalized explainers from a CRM record and just a photo of yourself, to help build personal relationships with your audience with video.
- Creative & marketing tools: Provide users with an “instant host” for product demos, social content, or ads, without needing to book talent.
Plans and availability
Avatar IV API is self-serve on our Pro and Scale tiers, with Enterprise available for custom rates and high-volume usage. You can find the Avatar IV API documentation here.
A note on quality and control
We purpose-built Avatar IV to feel and look natural in real product UX, with clean lip-sync, nuanced facial dynamics, and gesture timing that follows the script. It also pairs beautifully with our voice tools for delivery control, so content sounds as intentional as it looks.
Get started with Avatar IV today.
Original source - May 4, 2026
- Date parsed from source:May 4, 2026
- First seen by Releasebot:May 5, 2026
HeyGen July 2025 product release
HeyGen introduces July 2025 enterprise video updates with native screen recording, larger editable PPT and PDF imports, side-by-side voice testing, generative B-roll, premium motion elements, and new collaboration controls to help teams create polished videos faster.
HeyGen’s July 2025 release introduces powerful new features like screen recording, generative B-roll, and expressive voice tools to help enterprise teams create high-quality video content at scale.
This month, we’re releasing a suite of enhancements that give enterprise teams more power, flexibility, and creative control to produce video at scale. From integrated screen recording and expanded PPT/PDF editing to premium motion elements, expressive voice models, and generative B-roll, these updates streamline complex workflows while elevating video quality.
Whether your teams are building training programs, customer communications, or global marketing campaigns, HeyGen’s July updates help you work faster, collaborate more effectively, and deliver content that connects.
Expanding capabilities to meet enterprise content demands
Creating training, education, and marketing content at scale requires tools that simplify complex workflows. Our latest releases bring powerful enhancements to screen recording, voice testing, and presentation editing.
Screen recorder in AI Studio
Responding to strong demand from enterprises, we integrated our native screen recorder directly into the AI Studio text editor. You can now capture your screen—with optional mic audio—and instantly create a new scene with synced blocks, all without leaving AI Studio.
This feature is particularly valuable for training, enablement, and product education workflows, enabling teams to produce tutorial-style content faster and with fewer steps.
Editable PPT/PDF improvements
We expanded support for editable PowerPoint and PDF imports by raising the slide limit from 15 to 50 slides. You can now work with larger, more complex presentations while maintaining full editability in AI Studio. This improvement is particularly beneficial for training, corporate comms, and product marketing teams managing high-volume slide decks.
Side-by-side video testing
Side-by-side TTS (text-to-speech) testing is now live in AI Studio, enabling you and your team to directly compare voice engines within the same language and select the best fit for global audiences. Upcoming enhancements will also automatically correct suboptimal engine selections, ensuring even higher voice quality.
Enhance storytelling with seamless visuals and expressive audio
Our latest updates empower your teams to create videos that look and sound more polished than ever. We’ve introduced tools that elevate both the visual and voice experience, making it easier to create B-roll, deliver seamless transitions, dynamic motion elements, and expressive narration for more engaging storytelling at scale.
Generate B-roll with Veo 3 and Seedance 1.0
We’ve expanded HeyGen’s creative toolkit with support for Veo 3 and Seedance 1.0, enabling your team to generate high-quality B-roll directly within the platform. This eliminates the need to source supplemental footage externally, streamlining production while maintaining a consistent visual style. Advanced generative video models in HeyGen let you enrich storytelling, add variety to training and marketing content, and produce more engaging videos at scale.
Magic Match transitions
Our Magic Match transition feature automatically animates matching elements between scenes, eliminating jarring cuts and ensuring a fluid, professional-grade flow from start to finish. By intelligently connecting visuals, it delivers seamless scene-to-scene continuity that enhances storytelling and gives your content a polished, dynamic feel.
Premium motion elements
You can already enhance your videos by adding customizable elements such as text, speed, and zoom effects, making it easier to create engaging and professional content. We’re adding 12 new premium motion elements, including intros, speaker cards, quotes, headlines, and typewriter effects, giving creators everything they need to produce high-quality videos from start to finish without needing to use different tools.
Integration with Eleven v3
We’ve made significant progress in delivering more natural, expressive, and precise voice experiences. We’ve added integration for Eleven v3, ElevenLabs’ most expressive text-to-speech model, giving you greater control over expressiveness. We also added language-specific defaults to ensure the voice panel aligns automatically with the user’s chosen language, streamlining workflows for global enterprise teams.
Streamlined access control for seamless collaboration
Managing users and collaboration at scale requires simple, reliable tools. Our latest updates introduce new permissioning functionality and seat activation improvements, giving you more control over access while streamlining the process of adding and managing team members.
New permissioning functionality
We now directly enable the ability to invite someone to your account within the new share project or video experience, making collaboration more seamless and immediate. For enterprises, this capability is especially valuable because projects often involve cross-functional teams, external partners, or global stakeholders who need quick and secure access.
Seat activation improvements
We’ve refreshed the enterprise invite modal to make it easier for admins to review suggested users and issue mass invites with confidence. For large organizations, onboarding and managing teams efficiently is critical. This upgrade streamlines the setup process, reduces administrative overhead, and ensures the right people are added quickly and securely.
Looking ahead
These July enhancements demonstrate our ongoing commitment to empowering you teams with intuitive, scalable, and high-impact video solutions. From seamless screen recording and expressive voice controls to premium visual elements and streamlined collaboration, HeyGen continues to evolve to meet the complex needs of global organizations.
Experience how these updates can transform your video production workflows, elevate content quality, and give your teams the tools to scale with confidence and control.
Enjoy learning about new product releases? Please take a moment to fill out the enterprise education survey so we can better understand your preferred learning formats.
Not a user yet? Sign up for free here.
Original source - May 4, 2026
- Date parsed from source:May 4, 2026
- First seen by Releasebot:May 5, 2026
HeyGen June 2025 release: Empowering your enterprise with enhanced control and creative agility
HeyGen releases June 2025 enterprise video updates with AI-powered scripting, Quick Commands, Scene Split, enhanced security, and better team collaboration. The new Share Page adds comments, engagement metrics, and translations, while split-out consent and admin notifications streamline user management.
Summary
HeyGen’s June 2025 release brings smarter, faster, and more secure video creation to enterprises with AI-powered scripting, Quick Commands, Scene Split, enhanced security, and improved team collaboration.
June was an exciting month at HeyGen, highlighted by our inaugural keynote, where we showcased the future of AI-driven video creation. Building upon this milestone, our latest product enhancements empower Enterprise teams to create impactful, high-quality videos faster, smarter, and with greater collaboration and security than ever before.
Whether your objective is to elevate internal training, streamline corporate communications, or captivate audiences with marketing content, these innovations are designed to dramatically boost productivity and ensure exceptional results every time. Here’s how these improvements translate directly into benefits for your organization:
Accelerate content creation with intelligent workflows
Our latest enhancements empower your content teams to work smarter, not harder. We've introduced tools that simplify the video production process, allowing for quicker iterations and more dynamic storytelling.
Quick commands: Instant editing at your fingertips
Imagine effortlessly enhancing your videos without interrupting your workflow. With Quick Commands, simply type '/' within your script editor to instantly unlock a suite of powerful editing and enhancement tools. From GPT script writer and Voice Director to advanced visual adjustments, your team can seamlessly execute edits, significantly reducing production cycles and unlocking features you didn’t even know you had.
This translates to significant time savings for your content creators. With Quick Commands, your teams can quickly refine scripts, add effects, and make on-the-fly adjustments, accelerating content turnaround times and boosting overall productivity. It also aids in the discovery of powerful (yet sometimes hidden) features, ensuring your teams leverage the full potential of HeyGen.
GPT script writer: AI-powered scripting and refinement
Great videos start with great scripts. Our GPT-powered script assistant provides enterprise teams with intelligent, on-demand scripting support directly in your editor. Generate polished, brand-consistent scripts quickly, refine your messaging effortlessly, and maintain the highest standards of content quality across all your communications.
For large organizations, content consistency and rapid script generation are crucial. The GPT Script Writer can help standardize messaging, generate initial drafts for various campaigns, and provide intelligent suggestions for refining existing scripts. This empowers marketing, training, and communications teams to produce high-quality, on-brand video content at an unprecedented pace, reducing reliance on external copywriting resources and ensuring consistent brand voice across all video assets.
Scene split: Granular control over pacing
Structure and pacing can make or break viewer engagement. Scene Split gives you unprecedented control to precisely segment any scene, enhancing narrative flow and ensuring your message is compelling and clear. Perfect for training videos, key announcements, or engaging storytelling, this feature enables you to deliver content exactly as intended. This is particularly valuable for complex training modules, detailed product demonstrations, or impactful corporate communications where precise timing can significantly enhance clarity and retention.
Enhanced security and streamlined user management
Managing users and ensuring compliance is paramount for enterprise organizations. Our June releases bring significant improvements to engagement tracking, user consent, and administrative oversight, giving you peace of mind and more efficient team management.
New share page: Gain deeper insights and collaboration
Understanding how your content performs is critical for your team’s success. Our revamped Share Page not only consolidates video comments for easy review but also provides essential engagement metrics like views, shares, watch time, and completion rates. Plus, viewers can instantly translate your videos into multiple languages, extending your reach and ensuring your message resonates globally.
Split out consent: Delegated avatar usage for scalability
Scale your video production securely and effortlessly across teams and trusted external partners. Our Split-out Consent feature allows enterprise customers to delegate avatar video creation rights independently from avatar management. Retain absolute control over compliance and usage, ensuring secure, efficient collaboration across your entire organization.
This is a game-changer for large-scale avatar deployments and team collaboration. For organizations with numerous spokespeople, brand ambassadors, or internal subject matter experts, this feature allows designated video producers or marketing teams to create content using pre-approved avatars without needing direct access to the original avatar creator's account.
Admin control notifications: Keeping you informed
Efficient team management requires clear oversight. We've introduced robust notification enhancements to simplify the administration of your growing teams on HeyGen:
- Admins receive notifications when new free users sign up for HeyGen within your domain.
- Free users see in-app notifications when a team plan for their domain is set up.
- Free users see their request is pending until an admin accepts it within notifications.
These consolidated notifications provide critical insights for administrators. Knowing when new free users from your domain sign up allows you to identify potential team members who could benefit from a full enterprise license, facilitating seamless onboarding and team expansion. Furthermore, the clear communication to free users about pending requests ensures transparency and reduces friction in joining your established team plan. This empowers administrators to maintain better oversight, manage licenses more effectively, and ensure that all relevant users are integrated into your HeyGen ecosystem.
Ready to unlock HeyGen's latest innovations?
These powerful enhancements reflect our unwavering commitment to providing enterprise customers with intuitive, secure, and scalable video solutions tailored specifically to complex organizational demands. From advanced editing capabilities and AI-driven scripting to robust administrative oversight, HeyGen continually evolves to support your organization's ambitious goals.
Discover firsthand how these June 2025 product updates can revolutionize your enterprise video production workflows, dramatically improve content quality, and scale confidently and securely.
Not a user yet?
Sign up for free here.
Original source