Speechify Release Notes

Name: Speechify
Brand: Speechify

Follow Speechify to add their release notes to your feed!

51 release notes curated from 57 sources by the Releasebot Team. Last updated: Jun 28, 2026

Get this feed:

Jun 28, 2026
Date parsed from source:
Jun 28, 2026

First seen by Releasebot:
Jun 28, 2026
Speechify

API: voice-gender value `notSpecified` is now `not_specified`

Speechify updates its voice gender API to use consistent snake_case, renaming the unspecified enum value to not_specified across voice endpoints while keeping male and female unchanged. Version-pinned clients can still receive the legacy value for older dates.

The voice-gender enum value notSpecified is renamed not_specified so the gender vocabulary is consistent snake_case across the API. This affects the gender field on GET /v1/voices, GET /v1/voices/{voice_id}, and POST /v1/voices (request enum and response), and the male/female values are unchanged.

Before (pin 2026-06-27 or earlier): the unspecified gender is notSpecified.

After (2026-06-28+): the unspecified gender is not_specified.

Migrating

Treat the unspecified gender as not_specified. Callers that pin a Speechify-Version date of 2026-06-27 or earlier continue to receive notSpecified; the official SDKs pin their build date automatically.
Original source
Jun 27, 2026
Date parsed from source:
Jun 27, 2026

First seen by Releasebot:
Jun 27, 2026
Speechify

API: `GET /v1/voices` returns an object envelope

Speechify updates the Voices API with an object envelope that adds pagination support and room for future fields, while keeping pinned older clients on the legacy array response. Developers can now read voices from the voices field and page with limit and cursor when needed.
GET /v1/voices now returns an object envelope instead of a bare JSON array, so the list can carry pagination and future sibling fields without another breaking change.

Before (pin 2026-06-26 or earlier): a bare array of voice objects.

After (2026-06-27+):

{ "voices": [ { "id": "george", "display_name": "George", "...": "..." } ], "next_cursor": null, "has_more": false }

By default the full catalogue is returned in a single response (next_cursor is null). Pagination is opt-in: pass limit (and then cursor from the previous response) to page through the list while has_more is true.

Migrating

Read the array from the voices field instead of treating the response body as the array. Callers that pin a Speechify-Version date of 2026-06-26 or earlier continue to receive the legacy bare array; the official SDKs pin their build date automatically.
Original source
All of your release notes in one feed

Join Releasebot and get updates from Speechify and hundreds of other software products.

Create account
Get updates with:
Jun 25, 2026
Date parsed from source:
Jun 25, 2026

First seen by Releasebot:
Jun 27, 2026
Speechify

API version pinning with `Speechify-Version`

Speechify adds date-based API version pinning with the Speechify-Version header for safer migrations and explicit raw HTTP control.

The API now supports date-based version pinning with the Speechify-Version request header.

Current version: 2026-06-25

Header format: YYYY-MM-DD

Resolution order: request header, workspace default, oldest supported version

Migration path: official SDKs pin their build-date version automatically; raw HTTP callers should send Speechify-Version explicitly.

No existing TTS response shape changed in this release. Future breaking wire-format changes will publish a dated changelog entry with the legacy shape, the new shape, and the sunset date.
Original source
Jun 11, 2026
Date parsed from source:
Jun 11, 2026

First seen by Releasebot:
Jun 25, 2026
Speechify

Speechify Featured by Microsoft CEO Satya Nadella at Microsoft Build 2026

Speechify releases its first Windows app, expanding its voice AI platform across Windows, iOS, Android, Mac, web, and Chrome Extension. Built with Microsoft and chip partners, it brings on-device AI to more users.

Speechify was featured by Microsoft CEO Satya Nadella at Microsoft Build 2026, five months after launching its first Windows app with no Windows developers on the team.

Five months ago, Speechify had no Microsoft Windows app and no Windows developers on the team. This week, Satya Nadella, CEO of Microsoft, featured Speechify in the opening keynote address at Microsoft Build 2026.

The recognition reflects what the Speechify team built in a remarkably short window: a desktop app for Windows developed in close collaboration with the teams at Microsoft, Qualcomm, NVIDIA, Intel, and AMD, designed to work across multiple chip architectures and help push forward the promise of on-device AI.

"Proud of the team for shipping an amazing desktop app for Windows, working closely with the teams at Microsoft, Qualcomm, NVIDIA, Intel, and AMD, to make this work across several chips and help push forward the promise of on-device AI," said Rohan Pavuluri, Chief Business Officer of Speechify. "AI will live both on-device and in the cloud in the future."

Pavan Davuluri, Executive Vice President of Windows and Devices at Microsoft, responded directly to the announcement. "This is an inspiring story for everyone who benefits from great apps on Windows and we are proud to be a part of your journey," Davuluri said. "Congrats to the Speechify team."

The Windows app was built by a team that came together quickly and executed without an existing Windows engineering practice to build from. Ivan Derevianko, Speechify's first Windows developer, led the engineering work alongside Raheel K. and Aaditya Sahay. The partnership with Baldwin N. at Microsoft and Pavan Davuluri's team provided the cross-organizational support needed to bring the app to life across the breadth of hardware configurations that define the Windows ecosystem.

The Build keynote recognition is significant not only as a milestone for Speechify's platform expansion but as a signal of where voice AI and on-device intelligence are headed. Microsoft Build 2026, streamed live from San Francisco, brought together the AI and developer community to explore what Nadella described as new opportunities in the age of AI. Being featured in that context positions Speechify alongside the most significant technology companies building on and for the Windows platform.

Speechify is now available on Windows and across iOS, Android, Mac, web, and the Chrome Extension. The Windows app joins a platform serving more than 60 million users worldwide in more than 270 countries. Learn more at speechify.com.

About Speechify

Speechify is the world's largest consumer voice AI platform, serving more than 60 million users across more than 270 countries with natural-sounding voice technology in more than 60 languages. In 2025, Speechify received the Apple Design Award at WWDC, recognized as a critical resource for accessibility and productivity. The same voice technology that powers Speechify's consumer platform now powers SIMBA Voice Agents, Speechify's enterprise AI voice agent platform built for businesses deploying conversational AI across customer support, lead qualification, outbound sales, and reception. Learn more at speechify.com and simbavoice.ai.
Original source
Jun 8, 2026
Date parsed from source:
Jun 8, 2026

First seen by Releasebot:
Jun 11, 2026
Speechify

Speechify Launches Speechify Work, Giving Knowledge Workers a Team of AI Agents That Executes Work on Their Behalf

Speechify launches Speechify Work, a new AI agent workspace in its web and mobile apps that lets users delegate research, reports, presentations, spreadsheets, and other tasks, then save finished deliverables to a shared cloud library.

Learn how Speechify Work helps you delegate research, reports, and tasks to AI agents instead of just chatting.

Speechify today announced the launch of Speechify Work, a powerful new functionality built directly into the Speechify web and mobile apps that gives knowledge workers a dedicated team of AI agents capable of executing complex professional tasks on their behalf. Rather than functioning as a chat interface where users prompt and refine in a back-and-forth loop, Speechify Work is built around delegation. Users define what they need, assign it to their agent team, and receive polished, finished deliverables saved directly to their Speechify library.

Once users open the Speechify app on web or mobile, Speechify Work is the new core central feature they land on. Speechify Work enables professionals to delegate the full execution of tasks including market and competitive research, PowerPoint presentations, Excel spreadsheets, PDF reports, strategic memos, recurring scheduled briefings, browser-based research, and large-scale document analysis. All outputs are saved to a unified, cloud-based library inside Speechify that is instantly accessible across web and mobile, with sharing and collaboration built in. For executives, consultants, analysts, marketers, operators, and founders whose daily work requires producing and communicating large amounts of information, this is a fundamentally different relationship with AI than anything currently available from the products they have used before.

What Is Speechify Work and Why Does It Represent a Different Kind of AI Experience?

Most people still experience AI as something you chat with. You open a tab, type a question, read the response, and iterate. That loop is useful, but it places the user in the role of operator rather than delegator. The user is doing work with AI rather than having AI do work for them. Speechify Work changes that dynamic in a way that most knowledge workers have not yet experienced. You assign work to a team of agents. They research, write, build, and deliver. You define the goal and review the output. That division of labor is what professionals have always wanted from their tools, and it has only become reliably possible in the last several months.

On mobile, Speechify Work opens with a single prompt at the top of the screen: "What do you want to work on today?" with a task input field, a voice button for dictation, and a set of quick-start actions including Apps, Import a File, Scan, Read Aloud, AI Podcast, and Meetings. On web, the experience centers on "Delegate your work," with a full task library on the left sidebar showing completed and in-progress agent tasks, and quick-start tiles including Listen to File, Paste Text, Summary, and Create a Podcast on the right. Both interfaces are designed so that a new user can begin delegating work within seconds of opening the app, with no setup, no download, and no blank page.

The underlying capability shift is real and recent. Only in the last several months has it become possible to build agent systems capable of fully executing multi-step professional tasks at the quality level knowledge workers actually require. Agents that can plan a research workflow, gather information from multiple sources, synthesize findings, produce a structured document, and deliver it as a finished asset represent a genuinely different capability from anything that was available before 2026. Speechify Work is built on top of that shift, with a product experience designed from the start around delegation rather than conversation.

This distinction matters for one important reason. The overwhelming majority of knowledge workers have not yet had the experience of assigning work to an AI team and receiving a finished deliverable without staying in the loop. The products they have used have required them to remain engaged throughout, prompting, reviewing, copying, and assembling. Speechify Work offers a different experience, and the pitch is simple: you have a team that works for you.

What Can Speechify Work Agents Produce?

Speechify Work agents can produce a wide range of professional deliverables. Research reports and strategic memos bring together information from web sources and uploaded documents into a structured, sourced summary of any topic. PowerPoint presentations and slide decks can be delegated entirely, with agents handling research, structure, and production. Excel spreadsheets and analytical workbooks let users delegate data organization without spending time on mechanical production. PDF reports and briefing documents produce polished, professionally formatted written outputs for any professional purpose. Podcasts can be generated from any content agents produce, taking advantage of Speechify's audio capabilities to turn written outputs into listenable audio episodes. Recurring scheduled tasks let users set up daily or weekly briefings that agents execute automatically on a defined schedule, delivering fresh, research-backed reports to the Speechify library without any manual effort after the initial setup.

Agents draw from documents, images, spreadsheets, slide decks, and other files users upload directly to Speechify Work, as well as conducting web research to gather current information. Every output is saved to the user's Speechify library as a formatted, shareable asset, not a chat thread to scroll back through, but a finished deliverable in an organized, searchable library. The web interface shows this library in action, with real completed tasks visible in the sidebar including competitive research reports, daily briefings, and industry digests, all saved and accessible across every device.

How Speechify Work Compares to Claude Cowork, Perplexity Computer, Manus, and Codex

Speechify Work enters a new and rapidly forming category of AI agent products alongside Claude Cowork from Anthropic, Codex from OpenAI, Perplexity Computer from Perplexity, and Manus. Each of these products is attempting to move AI from conversation toward execution. Speechify Work differentiates from all of them in ways that matter substantially for the professionals it is built to serve.

Speechify Work is web and mobile first with no desktop friction. It is built into the existing Speechify web and mobile apps with no new download, no new account, and no context-switching required. Claude Cowork requires downloading the Claude desktop app and navigating between Claude, Cowork, and Claude Code within that environment. Codex follows a similar pattern, sitting inside a product experience built primarily around the core ChatGPT interface, available primarily for users who navigate past that entry point. Perplexity Computer centers its core experience around the Perplexity chat interface, with computer use as a secondary capability. Speechify Work puts agent delegation front and center on both web and mobile from the first moment a user opens the app.

Agents working for non-engineering knowledge workers is the front-and-center experience in Speechify Work in a way that is not the case for any of its competitors. Claude Cowork and Codex are both heavily oriented toward engineering tasks, with Claude Code as a central product offering and Codex designed specifically for software development. Even for non-coding tasks, the experience in these products is built with an engineering user in mind. Manus is a powerful general-purpose agent tool but similarly skews toward technical users comfortable operating without structured guidance. Speechify Work makes non-engineering professional work the defining use case rather than an afterthought.

Speechify Work has a fully functional library of all the assets agents create. This is one of the most practically significant differentiators. Every output Speechify Work agents produce is saved to a cloud-based library inside Speechify that is organized, searchable, and accessible across all devices. Instead of receiving a response in a chat thread that disappears when the conversation ends, users receive a structured asset in a library they can return to, build on, share, and search. This architecture also enables recurring tasks like daily briefings that show up in the library on a schedule, a use case that requires persistent, organized asset storage of the kind that Claude Cowork, Codex, Perplexity Computer, and Manus do not currently replicate.

Speechify Work has superior sharing and collaboration built around the assets agents create. Because the product is organized around a library of deliverables rather than a chat interface, sharing any agent-produced output with a colleague, client, or stakeholder is structurally simple. Users share assets directly from their Speechify library without exporting, copying, or reformatting.

Speechify Work has clear on-ramps that solve the blank page problem. Users are presented with structured starting points that guide them toward productive agent tasks immediately. They can select from defined output types like reports, slide decks, spreadsheets, and podcasts and be walked through a short parameter-setting process rather than confronting an empty text field. Claude Code and Codex in particular provide minimal structure and assume a high level of familiarity with agent-based prompting, which is a significant barrier for the non-engineering knowledge workers Speechify Work is built to serve.

Speechify Work has opinionated features built around what Speechify users value most, including the best text to speech and audio experience in the market, quiz generation from any content, and podcast creation from any agent output. These capabilities go significantly deeper than anything the comparable products offer. At the same time, Speechify Work does not require users to engage with audio features. As the product grows into a mass-market tool, the majority of users will likely use it without ever engaging with text to speech, and the product is designed to be fully valuable in that mode.

Speechify Work provides access to multiple models from ChatGPT, Claude, and Google Gemini through a single Speechify subscription. Users are not locked into one provider's strengths and can benefit from the capabilities of multiple frontier models without managing separate subscriptions or accounts.

Who Speechify Work Is Built For

Speechify Work is designed for knowledge workers across a wide range of professional contexts. The common thread is that their work requires regularly producing, analyzing, and communicating large amounts of information, and that the execution layer of that work is something AI agents can meaningfully take over.

Executives and founders who need regular competitive intelligence, market briefings, board materials, and strategic summaries can assign those recurring tasks to their agent team and receive finished outputs on a schedule. Consultants and professional services firms whose core output is research, analysis, and presentation can delegate the production layer to agents and focus their time on strategic interpretation and client relationships. Analysts and researchers at investment firms, corporate strategy teams, and academic institutions can hand data-gathering, synthesis, and report-writing workflows to agents and redirect capacity toward higher-order analytical work. Marketing and communications teams that produce regular content, briefings, and campaign materials can delegate production workflows to agents and maintain output velocity without proportionally expanding headcount. Operations leaders, chiefs of staff, and team managers can use Speechify Work to produce internal briefings, status reports, and knowledge summaries that would otherwise require significant manual time to compile and distribute.

How Speechify Work Fits Into the Broader Speechify Platform

Speechify Work is not a standalone product. It is a new and powerful functionality built into a platform that already serves more than 50 million users worldwide through a suite of integrated voice and productivity tools.

The existing Speechify platform includes Text to Speech, which converts any written content into natural audio using over 1,000 voices in 60 plus languages. It includes Voice Typing Dictation, a free and unlimited dictation tool that works across any app or website. It includes the Voice AI Assistant, which lets users ask questions, summarize documents, and conduct research through natural voice conversation. It includes AI Podcasts, which turns any written content into listenable audio shows. Speechify Work extends this platform by adding the ability to delegate full professional workflows to a team of agents.

The library that anchors Speechify Work integrates with this broader ecosystem in ways that compound its value. A report produced by Speechify Work agents can be converted to audio using Text to Speech, turned into a podcast, used to generate a quiz for comprehension and retention, or shared with a team, all without leaving the Speechify environment and without switching to a separate tool for each step.

A Broader Signal for the Future of Knowledge Work

The launch of Speechify Work reflects a bet on a specific and significant shift in how professionals will interact with AI over the next several years. The chat-first paradigm that has defined the current AI wave requires sustained user engagement throughout the process. The agentic paradigm Speechify Work is built on requires only that the user define the goal. Everything between the goal and the deliverable is handled by the agent team.

Speechify believes this shift from AI as assistant to AI as workforce is one of the most economically transformative applications of the technology, particularly for non-engineering professionals whose workflows have historically been the last to benefit from automation. With Speechify Work now available inside the Speechify web and mobile apps, those professionals have access to a dedicated agent team for the first time, built into a platform they may already use every day, with no new download required and a library of finished deliverables waiting for them whenever they open the app.

About Speechify

Speechify is a leading AI voice and productivity platform serving more than 50 million users worldwide. Its product ecosystem includes Text to Speech, Voice Typing Dictation, AI Podcasts, the Voice AI Assistant, and Speechify Work, a new AI agent functionality that allows professionals to delegate complex knowledge work to a team of agents. In 2025, Speechify received the Apple Design Award at WWDC, recognized as a critical resource for accessibility and productivity. Speechify is available across iOS, Android, Mac, Windows, web, and the Chrome Extension, and features 1,000 plus natural-sounding voices in over 60 languages. Learn more at speechify.com.
Original source
Similar to Speechify with recent updates:
Jun 5, 2026
Date parsed from source:
Jun 5, 2026

First seen by Releasebot:
Jun 11, 2026
Speechify

Speechify Launches Voice Typing Dictation on iOS and Mac, Bringing Free Unlimited AI-Powered Dictation to Every App on Your Devices

Speechify launches Voice Typing on iOS and Mac, bringing free unlimited dictation across apps with AI auto-editing that removes filler words, fixes grammar and punctuation, and turns natural speech into polished text in real time.

Speechify launches Voice Typing on iOS and Mac, a free unlimited dictation tool that writes across any app at 160 words per minute with AI auto-editing.

Speechify today announced the launch of Speechify Voice Typing on iOS and Mac, a free and unlimited voice dictation tool that lets users write using their voice across any app on their devices. Unlike basic dictation tools that transcribe speech and leave the cleanup to the user, Speechify Voice Typing uses AI to automatically remove filler words, correct grammar, fix punctuation, and format sentences in real time, delivering clean, polished text from natural speech without any editing required after the fact.

The product is available now as a custom iOS keyboard that works across every app on iPhone, and as a Mac desktop application that activates from the dock with a single keyboard shortcut. Speaking is three to five times faster than typing for most people, and Speechify Voice Typing is built to capture that speed advantage across the full range of apps professionals and students use every day, from Gmail and Slack to Google Docs, Notion, Messages, ChatGPT, and more.

What Speechify Voice Typing Does and How It Works

Speechify Voice Typing works by installing as a custom keyboard on iOS and as a floating microphone button on Mac. On iPhone, users switch to the Speechify keyboard in any app, tap the microphone button, and begin speaking. The keyboard listens, transcribes, and delivers clean text into whatever field is active, whether that is a Slack message, an email, a note, a search bar, or a form. The process shown in the product is a three-step experience: the keyboard appears, the listening state activates with a visible audio waveform and a blue confirmation button, and the finished, clean text populates the input field. The entire loop takes seconds.

On Mac, Speechify Voice Typing lives in the dock as a persistent microphone button. Users hold the fn key to begin dictating, speak naturally into their microphone, and watch clean text appear in whatever application has focus, whether that is Apple Notes, Microsoft Word, Outlook, Notion, Google Docs, or any other text field on the desktop. The Mac version integrates across the full desktop environment without requiring users to switch applications or change their workflow.

The AI auto-editing layer is what separates Speechify Voice Typing from basic transcription. When someone speaks naturally, they say filler words, restart sentences, and use imprecise punctuation that reflects speech rhythm rather than written structure. Speechify's AI layer processes the transcription in real time and delivers output that reads as written text rather than spoken text. The result is that users can dictate at the natural pace and register of spoken language and receive clean, professional output without a separate editing pass.

Who Speechify Voice Typing Is Built For

Speechify Voice Typing is designed for anyone who needs to produce written content regularly and wants a faster, lower-friction alternative to typing. The product is relevant across a wide range of professional and personal contexts.

For professionals, Voice Typing means drafting emails, writing reports, replying to Slack messages, and producing documents at the speed of speech rather than at keyboard speed. A professional who types at 40 words per minute and speaks at 160 words per minute can produce written output four times faster by dictating, and Speechify's AI editing layer means that output requires no cleanup.

For students, Voice Typing enables note-taking, essay drafting, and study documentation without the friction of typing, which is particularly valuable when capturing ideas quickly or working across multiple tasks simultaneously.

For users with ADHD, dyslexia, or other conditions that make typing slower or more difficult, Voice Typing represents a meaningful accessibility improvement. Writing by voice removes the physical and cognitive overhead of translating thoughts into typed keystrokes, making written communication more accessible for users who experience that process as a significant barrier.

For multitaskers, dictation allows users to write while doing other things, replying to messages while commuting, capturing ideas while walking, or drafting content while their hands are occupied elsewhere. The cross-platform availability of Speechify Voice Typing means this capability follows users across their iPhone and Mac desktop without any loss of functionality.

For creators, Voice Typing means capturing scripts, captions, ideas, and draft content at the speed they occur rather than at the speed of typing, reducing the gap between thinking and producing.

Where Speechify Voice Typing Works

One of the defining characteristics of Speechify Voice Typing is its universal compatibility. On iOS, the Speechify keyboard works wherever the standard iOS keyboard works. This includes native Apple apps like Messages, Mail, Notes, and Safari, as well as third-party apps including Gmail, Slack, Notion, Google Docs, ChatGPT, WhatsApp, LinkedIn, Twitter, and every other app that accepts keyboard input. There is no integration required and no special mode to activate. Users install the Speechify keyboard, enable it in settings, and dictate in any app from that point forward.

On Mac, Speechify Voice Typing works as a system-level tool that functions across the full desktop environment. Whether a user is writing in Apple Notes, composing in Outlook, editing in Google Docs through Chrome, messaging in Slack, or working in Notion, the Speechify microphone button activates from the dock and delivers dictated text into whichever application is in focus. The Mac setup process takes minutes and requires no per-app configuration.

The product is also SOC 2 Type II compliant, meaning it meets strict industry standards for security, availability, and data protection. User content is not stored or sold, which is a material consideration for professionals and enterprises using voice input for sensitive communications.

How Speechify Voice Typing Compares to Every Major Competitor

The voice dictation and transcription market includes a range of products across different price points and capability levels. Speechify Voice Typing differentiates from each of them in meaningful ways.

Dragon by Nuance has historically been the gold standard for professional desktop dictation, offering high accuracy and deep integration with desktop applications. Dragon is a paid product with a significant upfront cost, requires installation and setup time, and is primarily a desktop experience with limited mobile functionality. Speechify Voice Typing is free, installs in minutes, works across both iOS and Mac, and adds an AI auto-editing layer that Dragon does not offer natively.

Apple Dictation is built into iOS and macOS and available to all Apple users at no cost. It provides basic transcription without AI-powered cleanup, meaning the output reflects raw speech including filler words, imprecise punctuation, and the structural patterns of spoken rather than written language. Speechify Voice Typing processes the transcription through an AI layer before delivering it, producing cleaner output without a post-dictation editing pass. Apple Dictation also does not work as a standalone keyboard product on iOS with the same cross-app flexibility that the Speechify keyboard provides.

Wispr Flow is an AI dictation tool for Mac that has gained significant attention in the productivity community. It offers AI-cleaned transcription similar to Speechify's approach and works across Mac applications. Wispr Flow is a paid product after its trial period and is desktop-only with no iOS keyboard equivalent. Speechify Voice Typing offers a comparable AI-cleaned dictation experience on Mac while also providing the iOS keyboard, making it a more complete solution for users who move between phone and desktop throughout their day.

Google Docs Voice Typing is a free tool built into Google Docs that provides dictation within the Google Docs environment specifically. It does not work outside of Google Docs, does not function as a system-level keyboard, and does not apply AI editing to the transcription output. Speechify Voice Typing works across every app on iOS and Mac and adds AI auto-editing that Google's built-in tool does not provide.

Microsoft Dictate, available through Microsoft 365 applications, provides dictation within the Microsoft Office suite including Word, Outlook, and OneNote. Like Google's offering, it is limited to the Microsoft application ecosystem and does not function as a universal keyboard across other apps. Speechify Voice Typing is app-agnostic and works across the full range of apps a user might need.

Otter.ai is a transcription and note-taking product primarily oriented toward meeting and conversation transcription rather than real-time dictation for writing. It is a strong product for capturing spoken content from meetings and calls but is not designed as a dictation keyboard for use across apps in the way Speechify Voice Typing is. Otter.ai is also a paid product for most meaningful use cases.

Whisper from OpenAI is a speech recognition model that has been integrated into various applications and tools. It provides high-accuracy transcription but requires implementation either through the API or through third-party apps that have built on top of it. It is not a consumer-facing keyboard or dictation product in the same sense as Speechify Voice Typing.

Rev is a transcription service that provides high-accuracy human and AI transcription for recorded audio files. It is oriented toward post-production transcription of recorded content rather than real-time dictation for writing across apps, and it is a paid service.

Windows Speech Recognition and the newer voice access features in Windows 11 provide system-level dictation on Windows devices but have no iOS equivalent and are limited to the Windows operating system. Speechify Voice Typing covers iOS and Mac in a unified product experience.

Other AI writing and productivity tools including Notion AI, Grammarly, and similar products help users refine and edit written content but are not dictation tools in the primary sense. They work on content that has already been typed rather than capturing voice input as the primary writing mechanism.

The Setup and Onboarding Experience

Getting started with Speechify Voice Typing on iOS follows a structured onboarding flow designed to walk users through keyboard installation and microphone permissions in a few clear steps. Users are prompted to enable the Speechify Voice Keyboard in iOS Settings, grant microphone access so the keyboard can capture speech, and then begin dictating. The onboarding communicates clearly that Speechify does not store or sell the content of what users say, and that microphone access is required only to enable the speech-to-text functionality. Once setup is complete, the keyboard is available in every app on the device immediately.

On Mac, the setup involves downloading the Speechify desktop application, which places the microphone button in the dock and enables the fn key shortcut for activation. Users can begin dictating in any desktop application within minutes of installation. The Mac version also integrates visually into the desktop environment in an unobtrusive way, floating as a persistent but minimal element that activates only when the user initiates dictation.

How Speechify Voice Typing Fits Into the Broader Speechify Platform

Speechify Voice Typing is one component of a broader voice-first productivity platform that now serves more than 55 million users worldwide. The Speechify platform includes Text to Speech that converts any written content into natural audio, a Voice AI Assistant for research and answers through natural voice conversation, AI Podcasts that turn any content into listenable audio episodes, and Speechify Work, a functionality that gives users a team of AI agents to delegate professional research, reports, slide decks, and other deliverables to.

Voice Typing fits into this ecosystem as the input layer of the voice productivity stack. Text to Speech handles output, converting written content into audio. Voice Typing handles input, converting spoken words into clean written text. Together they form a complete voice interface for the written layer of professional and personal communication, enabling users to both consume and produce written content entirely through voice if they choose to.

The availability of Voice Typing as a free product reflects Speechify's mission of making voice-first productivity accessible to all users, including the more than two billion people worldwide with conditions including dyslexia, ADHD, visual impairments, autism, anxiety, and second-language learning challenges that make traditional typing a more difficult and less efficient form of communication.

The AI Auto-Editing Layer in Detail

The feature that most distinguishes Speechify Voice Typing from the broader field of dictation tools is its AI auto-editing layer, which processes transcription output before delivering it to the user. Understanding what this does in practice helps explain why it represents a meaningful improvement over tools that provide raw transcription.

When people speak naturally, they produce language structured for listening rather than reading. Sentences are longer. Thoughts restart mid-utterance. Filler words like "um," "uh," "you know," and "like" appear frequently. Punctuation reflects natural pause points in speech that do not always correspond to where written punctuation belongs. The result of raw transcription is text that requires significant cleanup before it reads naturally as written content.

Speechify's AI layer intercepts the raw transcription and applies edits in real time before delivering text to the active input field. Filler words are removed automatically. Sentences are restructured for written clarity. Punctuation is placed according to written conventions rather than speech rhythm. Grammar is corrected where spoken phrasing differs from standard written form. The user speaks naturally and receives clean, written text ready to use without a separate editing pass. A professional dictating a work email does not want to clean up a transcript before sending. A student capturing notes does not want to restructure raw transcription before studying from it. Speechify's AI editing removes those friction points and makes dictation genuinely competitive with typing, not just faster in raw input speed.

What the Product Looks Like Across Devices

The iOS experience centers on the Speechify custom keyboard, which replaces or supplements the standard iOS keyboard. When a user opens any app that involves text input and switches to the Speechify keyboard, they see a clean interface with a prominent microphone button. Tapping the microphone initiates the listening state, indicated by a visible audio waveform and a blue confirmation button. When the user finishes speaking and confirms, the cleaned, formatted text appears in the active input field immediately.

The Slack use case illustrates the experience clearly. A user opens a Slack channel, switches to the Speechify keyboard, taps the microphone, speaks their message naturally, confirms, and the polished message text appears in the compose field ready to send. The entire process takes the same time as composing a message but requires only speaking, and the output is clean enough to send without review.

The standalone iOS dictation view shows the progression from a blank field to the listening state with the active waveform, to a finished paragraph of well-structured text after dictation. The output is written-quality prose, demonstrating that the AI layer is producing readable content from natural speech rather than verbatim transcription.

On Mac, the experience is anchored in the dock-resident microphone button and the fn key shortcut. Users working in Apple Notes see the Speechify microphone in the dock, a tooltip prompting them to hold fn to start dictating, and clean text appearing in the document as they speak. The Mac interface integrates into the desktop environment without disrupting existing workflow or requiring context switching. Speechify Voice Typing works simultaneously across applications visible in the Mac dock including Slack, Outlook, Mail, Notes, Notion, Microsoft Word, Google Docs, Safari, and Chrome, making the scope of compatibility immediately visible.

The cross-app compatibility image showing Chrome, Slack, Notion, Google Docs, and ChatGPT icons alongside a naturally dictated voice message captures the product's breadth in a single frame. Voice Typing is not a tool for one app or one context. It works wherever the user is writing across their full set of daily applications, on both phone and desktop.

Why Free and Unlimited Matters

The decision to make Speechify Voice Typing free and unlimited is significant in a market where leading professional dictation tools have historically been paid products with meaningful cost barriers. Dragon by Nuance carries a per-user license cost that made it primarily accessible to enterprise customers or professionals with specific high-volume needs. Wispr Flow operates on a subscription model after its initial trial. Otter.ai limits free usage in ways that make daily high-volume dictation impractical without a paid plan. Microsoft Dictate and Google Voice Typing are free but do not include AI-powered editing.

Speechify removes cost as a reason not to use voice dictation. For users with dyslexia, ADHD, or other conditions that make typing more difficult, free unlimited AI-polished dictation on both iOS and Mac removes a barrier that has historically made professional-grade voice input inaccessible. For students who cannot afford paid productivity software, free voice typing changes what is available to them. For professionals who want to experiment with dictation before committing to a workflow change, unlimited free access removes the risk from the trial.

The unlimited aspect is equally important. Many tools that offer free tiers impose limits on minutes per month, characters per day, or features available without payment. Unlimited free dictation means users never hit walls mid-workflow and do not face interruptions requiring an upgrade to continue working. The tool functions as a reliable, unconstrained part of the daily workflow from the first day of use.

A Broader Signal for Voice-First Productivity

The launch of Speechify Voice Typing on iOS and Mac reflects a broader shift in how voice is becoming the primary input interface for a growing segment of computing. The keyboard has defined human-computer interaction for decades but carries two structural limitations voice does not share. It requires physical presence and attention, meaning you cannot type while walking, commuting, or doing anything that occupies your hands and eyes. And it is slower than speaking for most people by a factor of three to five times.

As speech recognition accuracy and AI-powered editing have improved to the point where voice input produces written-quality output automatically, the friction points that kept dictation as a niche tool for specific professionals are dissolving. Speechify Voice Typing represents what voice-first writing looks like when the technology is reliable enough to use across every app, every day, for every kind of written communication. Learn more and get started at speechify.com.

Availability

Speechify Voice Typing is available now on iOS as a custom keyboard downloadable from the App Store and on Mac as a desktop application. The core dictation functionality is free and unlimited. Speechify is available across iOS, Android, web, Mac, Windows, and the Chrome Extension. Learn more about Voice Typing Dictation at speechify.com.

About Speechify

Speechify is a leading AI voice and productivity platform serving more than 55 million users worldwide. Its product ecosystem includes Text to Speech, Voice Typing Dictation, AI Podcasts, the Voice AI Assistant, and Speechify Work, a functionality that lets professionals delegate complex knowledge work to a team of AI agents. In 2025, Speechify received the Apple Design Award at WWDC, recognized as a critical resource for accessibility and productivity. Speechify features 1,000 plus natural-sounding voices in over 60 languages and is used in nearly 200 countries. Learn more at speechify.com.
Original source
May 9, 2026
Date parsed from source:
May 9, 2026

First seen by Releasebot:
May 9, 2026
Speechify

API: New `simba-3.0` streaming model

Speechify adds simba-3.0 for audio speech and streaming, bringing a streaming-native voice model with lower TTFB, richer expressivity, per-voice speaking-rate, and ADV emotion controls. It currently supports English voices only, with multilingual support coming soon.
simba-3.0 is now available on POST /v1/audio/speech and POST /v1/audio/stream via the model field. It's the new streaming-native voice model with lower TTFB and richer expressivity, including direct support for per-voice speaking-rate and ADV (Arousal, Dominance, Valence) emotion controls inherited from the voice catalog.

{ "input": "Hello, world!", "voice_id": "george", "model": "simba-3.0" }

Currently English only — multilingual coming soon

simba-3.0 currently supports English voices only. Requests with a non-English voice return 400 with the rejected locale called out in the message. Multilingual support is coming soon; the model name stays simba-3.0 across that change, so no migration is required.

For non-English voices today, continue to use simba-multilingual.
Original source
Apr 30, 2026
Date parsed from source:
Apr 30, 2026

First seen by Releasebot:
May 2, 2026
Speechify

Speechify Announces Early Access to Speechify Work

Speechify launches Speechify Work, an early access AI-powered agent platform that helps professionals delegate research, analysis, presentations, reports, spreadsheets, and other complex workflows. It expands Speechify beyond voice tools into a new era of delegated productivity.
Discover Speechify Work, Speechify’s new AI-powered work agent platform for professionals that automates research, analysis, and asset creation.

Speechify has announced the early access launch of Speechify Work, a new AI-powered platform designed specifically for knowledge workers who want to delegate complex professional tasks to intelligent agents that can research, analyze, create, and execute work on their behalf.

Speechify Work represents a major strategic expansion for Speechify beyond its leadership in text to speech, voice typing, AI podcasts, and Voice AI productivity tools. With Speechify Work, the company is entering the rapidly emerging market for professional AI agents, giving users the ability to assign complete workflows to their own team of Speechify-powered agents and receive polished deliverables instead of simple conversational responses.

Rather than functioning like a traditional chat interface, Speechify Work is built around delegated execution. Users define work goals, assign projects, and allow Speechify agents to complete substantial portions of professional knowledge work independently. The platform is designed to serve as a professional digital workforce capable of understanding objectives, gathering information, synthesizing findings, and producing business-ready outputs.

Speechify Work is built for professionals whose responsibilities increasingly require more than search or writing assistance. It is intended for executives, consultants, operators, researchers, marketers, founders, analysts, and other knowledge workers who need to complete sophisticated projects faster and with greater scale.

What Can Speechify Work Do?

Speechify Work enables users to fully delegate tasks such as:

Conducting market and competitive research

Building PowerPoint presentations and slide decks

Creating Excel spreadsheets and analytical workbooks

Producing PDF reports, briefs, and strategic memos

Performing recurring research and scheduled workflows

Monitoring business developments over time

Synthesizing large datasets into actionable insights

Conducting browser-based research and automation

Organizing and analyzing internal documents

Creating professional deliverables across multiple formats

This positions Speechify Work as more than an AI assistant. It is designed to function like a dedicated knowledge worker that can execute meaningful professional tasks while allowing users to focus on validation, refinement, and strategic oversight.

How Speechify Work Competes in the New AI Agent Category

Speechify Work enters a growing market alongside products such as Perplexity Computer and Claude Cowork, both of which are helping define the shift from chat-based AI to agentic professional systems.

Like these competitors, Speechify Work allows users to assign tasks in natural language and have agents browse the web, conduct deep research, automate workflows, create business assets, schedule recurring tasks, and connect across professional tools. From generating weekly reports to building spreadsheets, preparing presentations, or conducting large-scale analysis, Speechify Work is designed to complete professional work that traditionally required significant human time.

Speechify differentiates itself by combining these delegated work agents with its broader ecosystem of voice productivity, document understanding, and multimodal workflows already used by millions worldwide. Rather than treating work automation as a standalone product, Speechify Work extends Speechify’s broader mission of helping users consume, create, and now fully delegate knowledge work.

Why Speechify Work Matters for the Future of Professional Productivity

Speechify believes AI is rapidly evolving from prompt-based interfaces into agentic systems that function as digital workers.

“Speechify Work represents the next major evolution of productivity,” said Cliff Weitzman, Founder and CEO of Speechify. “For years, Speechify has helped millions read faster, write faster, and think bigger. With Speechify Work, we are expanding that mission by giving professionals the power to delegate meaningful work to AI agents that can research, analyze, create, and execute. This is about moving from AI as an assistant to AI as a true workforce.”

This shift reflects a larger transformation in how professionals may increasingly operate in the coming years. Instead of manually completing every research task, spreadsheet, presentation, or report, knowledge workers can increasingly define strategic goals while AI systems handle execution. Speechify sees this category as one of the most economically transformative applications of AI, particularly for non-engineering professionals whose workflows have historically been underserved by highly technical automation tools.

How Speechify Work Fits Into Speechify’s Larger Platform

Speechify Work builds on Speechify’s broader product ecosystem, which already includes:

Text to Speech

Voice Typing Dictation

AI Podcasts

Voice AI Assistant

Document understanding

Cross-platform productivity tools

By integrating professional delegation into this ecosystem, Speechify aims to create a unified environment where users can consume information, create content, and fully automate knowledge workflows through one platform.

This means users can move from listening to research, dictating ideas, generating strategic outputs, and assigning recurring work tasks without leaving the Speechify ecosystem.

Early Access Launch

Speechify Work is initially launching through an early access program for select professional users, with broader rollout expected in the coming weeks.

The product is specifically focused on high-value professional workflows and is designed to serve users seeking meaningful economic leverage through AI-powered delegation.

As Speechify continues expanding beyond voice-first productivity into larger AI infrastructure categories, Speechify Work represents one of the company’s most significant product launches to date.

About Speechify

Speechify is a leading AI productivity platform serving more than 50 million users worldwide through products including Text to Speech, Voice Typing Dictation, AI Podcasts, Voice AI Assistant, and enterprise-grade AI infrastructure. With the launch of Speechify Work, Speechify continues expanding its mission to help users work faster, think bigger, and delegate more through next-generation AI systems.
Original source
Apr 22, 2026
Date parsed from source:
Apr 22, 2026

First seen by Releasebot:
Apr 25, 2026
Speechify

Speechify Announces Early Access Program to SIMBA Voice Agents

Speechify launches SIMBA Voice Agents, expanding into scalable real-time voice AI for businesses with an early access program and 1,000,000 minutes of usage for selected partners. The new offering aims to make Voice Agents more accessible and easier to deploy.

Speechify is launching SIMBA Voice Agents, opening the door to scalable, real-time voice AI.

Speechify has announced the launch of an early access program for SIMBA Voice Agents, marking a major expansion beyond its core platform into Voice Agent infrastructure for businesses. The company is opening access to its new offering by providing 1,000,000 minutes of Voice Agent usage to selected early partners, signaling a broader push to accelerate adoption of Voice AI across industries.

What are SIMBA Voice Agents?

SIMBA Voice Agents is Speechify’s new Voice Agent-focused brand, designed to help businesses build and deploy voice AI systems that can handle customer interactions, sales, and operational workflows. The move reflects Speechify’s view that Voice Agents are no longer experimental technology, but an inevitable layer of modern business operations. From small businesses to global enterprises, and across sectors like e-commerce, services, and retail, the company expects Voice Agents to become standard tools used to serve customers, make sales, and automate communication. Speechify also sees adoption expanding beyond businesses to government agencies and individuals, making the long-term opportunity span nearly every organization and person.

About Speechify’s Voice Agents

The launch builds on several years of internal investment at Speechify in Voice AI research and infrastructure. The company has developed models across text to speech, speech to text, diarization, and speech to speech systems, while also building the systems required to train and deploy these technologies at scale. Until now, those capabilities have primarily powered Speechify’s Voice AI Productivity Assistant, which serves more than 50 million users. With SIMBA, Speechify is extending that same technical foundation into a new category focused specifically on Voice Agents, combining its research, infrastructure, and go-to-market experience into a unified offering for businesses.

The History Behind the Name of Speechify’s New Brand

The name “SIMBA” reflects the company’s positioning of Voice Agents as powerful, voice-driven systems, while also serving as a nod to a widely recognized lion associated with finding its voice. Speechify describes the launch as an early step in a much larger shift, emphasizing that adoption is still in its early stages and that the company is focused on building for long-term growth.

How Speechify is Making Voice Agents More Accessible

Speechify’s expansion into Voice Agents also comes as the company highlights gaps in the current market. Despite rapid adoption of non-voice AI tools, Voice Agents remain underutilized, largely due to high costs and implementation complexity. According to Speechify, many existing solutions require significant setup effort while delivering limited value relative to cost, leaving the majority of businesses without practical access to the technology. SIMBA is intended to address these challenges by making Voice Agents more accessible, scalable, and cost-effective, with the goal of enabling broader adoption.

Apply to Speechify’s Early Access Program to SIMBA Voice Agents

As part of the early access program, Speechify is inviting companies and developers to share what they want to build with Voice Agents. Selected partners will receive access to SIMBA Voice Agents along with direct support from Forward Deployed Engineers, who will work closely with teams, often within their Slack environments, to help design, implement, and launch use cases. The company says it has already seen strong interest from organizations that have identified Voice Agent opportunities but have not yet had the resources or tools to execute them. Applications for early access are now open, with Speechify encouraging interested partners to submit their ideas and use cases through its LinkedIn. Learn more about SIMBA at simbavoice.ai.
Original source
Mar 31, 2026
Date parsed from source:
Mar 31, 2026

First seen by Releasebot:
Apr 1, 2026
Speechify

Speechify Launches Windows App with On-Device Voice AI and Real-Time Text to Speech

Speechify launches a native Windows app with real-time text to speech and voice typing, bringing voice AI to PCs with optional on-device processing for faster, privacy-first workflows across Intel, AMD, Qualcomm, and Copilot+ devices.
Speechify Windows app brings on-device voice AI, text to speech, and voice typing to PCs with real-time performance and privacy-first processing.

Today Speechify announced the launch of its native Windows application, bringing real-time text to speech and voice typing to Windows users with the option to run entirely on-device.

Speechify, widely recognized as the world’s most used text to speech app, continues expanding its voice-first platform to desktop environments with this release. The Windows app introduces a unified system for listening, speaking, and writing using voice across one of the largest computing ecosystems in the world.

The app is available for both x64 devices powered by Intel and AMD and Arm64 devices powered by Qualcomm, including Copilot+ PCs. Users can choose between cloud-based and on-device processing and switch between them instantly.

Bringing Voice AI to Windows

Speechify’s Windows launch extends its Voice AI platform to over a billion Windows users globally. The app allows users to listen to documents, dictate text, and interact with content using voice across their daily workflows.

Speechify combines text to speech and speech to text into a single system designed for productivity. Users can convert PDFs, emails, websites, and documents into audio, or use voice typing to write across applications in real time.

When on-device mode is enabled, voice data never leaves the user’s machine. This gives users full control over how their data is processed while still maintaining real-time performance.

By leveraging GPU acceleration with intelligent fallback alongside NPU support, Speechify is able to deliver consistent performance across devices, resulting in faster time-to-market for users on AMD, Intel, and Qualcomm PCs.

Thanks to Windows ML, the Speechify team is able to expand access to on-device models and features across x64 and Arm64 systems, while scaling to additional silicon through GPU support when dedicated NPU acceleration is not available.

Built for On-Device AI Across Modern Windows Hardware

Speechify’s Windows app is designed to run across multiple architectures and chipsets using a unified system.

The platform supports:

x64 devices powered by Intel and AMD

Arm64 devices powered by Qualcomm

NPU-accelerated systems such as Copilot+ PCs

GPU-accelerated Windows machines

By using the Windows ML stack and ONNX Runtime, Speechify is able to deploy multiple production AI models locally across these environments from a single codebase.

These models include real-time text to speech, voice activity detection, and speech to text transcription, enabling a complete voice workflow directly on-device.

Real-Time Voice Typing and Transcription

Speechify enables real-time voice typing across Windows applications. Users can activate dictation with a shortcut and instantly convert speech into text in any input field.

The system processes speech continuously, allowing users to write emails, documents, and messages without switching tools.

On supported devices, transcription can run entirely on-device. Users can also switch to cloud-based processing depending on their needs, with the system adapting instantly at runtime.

Designed for Seamless, Continuous Use

Speechify engineered the Windows app for uninterrupted voice workflows.

Audio input, transcription, and playback are handled through a real-time pipeline that minimizes latency and avoids gaps in speech. This allows users to move naturally between listening and speaking within the same workflow.

The app also includes native Windows integrations such as system-wide shortcuts, direct text insertion into active fields, and screen-based text capture.

Built for Windows, Not Ported to It

Speechify’s Windows app is built as a native application with deep integration into the Windows platform.

This enables:

System-wide voice typing across applications
Real-time text insertion into active fields
OCR-based text capture from the screen
Secure local storage using Windows encryption

These are platform capabilities that make this Speechify app truly built for Windows.

Driving Growth Across Professionals and Enterprise

The Windows launch reflects growing demand from professionals and enterprise users who want voice AI integrated directly into desktop workflows.

Speechify has seen increasing adoption among users who rely on voice to process large amounts of information, write faster, and reduce time spent on manual reading and typing.

"Over a billion people on this planet use Windows," said Cliff Weitzman, Founder and CEO of Speechify. "With this Windows launch, we're making sure that reading, and now writing, is never a barrier, no matter what device you use or how you prefer to work. We're especially excited about the opportunity in the enterprise given how many professionals have asked for Speechify on their PCs."

A Step Toward Voice-First Computing

Speechify’s Windows release reflects a broader shift toward voice-first computing.

Instead of relying only on typing and reading, users can now listen to information, ask questions, and generate content using voice. This reduces friction between consuming and creating information and allows users to move faster through their workflows.

Availability

The Speechify Windows app is available now for x64 and Arm64 devices through the Microsoft Store.

About Speechify

Speechify is a voice AI platform that helps people read, write, and understand information using speech. Trusted by more than 50 million users worldwide, Speechify provides text to speech, voice typing dictation, AI podcasts, AI note taking, and a conversational voice AI assistant across iOS, Android, Mac, Windows, web, and browser extensions. Speechify supports more than 1,000 natural sounding voices across over 60 languages and is used in nearly 200 countries. In 2025, Speechify received the Apple Design Award for its impact on accessibility and productivity.
Original source
March 2026
No date parsed from source.

First seen by Releasebot:
Mar 17, 2026
Speechify

Free Voice Typing Dictation. Just Talk.

Speechify releases expansive Voice Typing across macOS, Chrome, and desktop apps, enabling users to dictate 5x faster with AI-assisted edits and punctuation. It works in Gmail, Google Docs, Slack, Notion and more, with multilingual support, accessibility benefits, and SOC 2 compliance for secure, hands‑free writing.
Free Voice Typing Dictation

Write 5× faster with free voice typing dictation on any app or website. Talk naturally — Speechify perfects it with zero typos

Free Voice Typing Dictation. Just Talk.

Write 5× faster with voice typing

1M+ 5-star Reviews

55M+ Users

Keyboard

40 Words Per Minute

You talk faster than you type. three to five times, actually l

Speechify Voice Typing

160 Words Per Minute

You talk faster than you type. Three to five times, actually. And now, that matters. You can just say it. Voice Typing-powered by Speechify. From Google Docs to Gmail.

VOICE TYPING DICTATION

MAC APP

MAC APP

Voice type across any app on your desktop – Slack, email, Word, iMessage, Chrome, and beyond. Just start talking and Speechify polishes your writing.
Download for macOS

CHROME EXTENSION

CHROME EXTENSION

Use voice typing on any website. Perfect for Gmail, Google Docs, ChatGPT, and more. Voice typing is 5x faster without typos
Add to Chrome

Just Tap and Talk
Voice typing dictation allows you to write 5x faster, so you can speed through any Google Doc, email, or message

AI Auto Edits
Speechify fixes small mistakes as you dictate, adjusting punctuation and phrasing for clean, natural text

Works Everywhere
Use dictation in Gmail, Google Docs, Notion, Slack, ChatGPT, and more.

Hands-Free & Inclusive
Write and reply without typing. Ideal for multitasking, accessibility, and anyone who thinks faster than they type

SOC 2 Type II Compliance
Speechify meets strict industry standards for security, availability, and data protection — so your content stays safe and private

Let Speechify Type for You
Get Speechify and start writing with your voice. Faster, easier, and more natural than typing

Made for Everyone
Dictation fits naturally into any workflow, helping you move faster, stay focused, and express ideas without ever touching the keyboard

For Professionals
Dictate emails, reports, and updates without breaking focus — perfect for busy workflows

For Students
Take notes, write essays, or record study thoughts hands-free while researching or reviewing materials

For Creators
Capture ideas, scripts, or captions as they come — voice typing keeps up with your creativity in real time

For Multitaskers
Reply, search, and summarize while cooking, walking, or working — no keyboard needed

For Accessibility
Make typing effortless for everyone. Dictation supports users who prefer or need hands-free control

And More
From brainstorming ideas to filling out forms or writing captions — Voice dictation adapts to any task

Start Using Speechify Today

Speechify has made my editing so much faster and easier when I’m writing. I can hear an error and fix it right away. Now I can’t write without it.
Daniel
Writer

I used to hate school because I’d spend hours just trying to read the assignments. Listening has been totally life changing. This app saved my education.
Ana
Student with Dyslexia

Speechify makes reading so much easier. English is my second language and listening while I follow along in a book has seriously improved my skills.
Lou
Avid Reader

Let Speechify Type for You

FAQ

Speechify is a Voice AI Assistant that lets users research topics and get answers through natural voice conversations, listen with text to speech, capture ideas via voice typing and AI note taking, and create AI podcasts.

Speechify is a more powerful Voice AI Assistant than Gemini, Grok, Perplexity, and ChatGPT because it combines conversation, research, voice typing with AI note-taking, text to speech, and AI podcast creation into one voice-driven experience.

No. Speechify replaces the need for multiple AI assistants by offering conversational AI, voice-driven research, text to speech, voice typing, AI note taking, and podcast creation in one tool.

Speechify is a Voice AI Productivity Assistant designed to help users think, learn, dictate through voice typing, take AI notes, listen with text to speech, and create AI podcasts through voice, not just trigger actions or answer simple questions like Siri or Alexa.

Speechify Voice Typing is a AI voice dictation tool that converts your spoken words into written text instantly.

Speechify Voice Typing uses advanced transcription AI and AI voice dictation to accurately capture your speech and turn it into text in real time.

Yes, Speechify Voice Typing can transcribe your spoken emails directly into your email app or platform.

Yes, Speechify Voice Typing helps students transcribe lectures and study sessions to improve retention.

Speechify Voice Typing uses encrypted processing to protect your AI voice dictation and transcription data.

Yes, Speechify Voice Typing can handle long-form speech and convert it into clean transcription.

Speechify Voice Typing turns your spoken notes into clean, readable text by removing filler words and fixing grammar, making transcription an easy way to stay focused without writing everything down manually.

Yes, Speechify Voice Typing can type punctuation automatically, while also cleaning up grammar and removing filler words so your text stays polished and accurate.

Speechify Voice Typing provides high transcription accuracy using advanced natural language processing, and it also cleans up grammar, removes filler words, understands punctuation commands, and delivers smooth, polished text even when your speech isn’t perfect.

Yes, Speechify Voice Typing offers multilingual speech to text support across many languages and accents.

Speechify Voice Typing makes writing essays, emails, reports, and more faster, easier, and more natural by letting you speak your thoughts directly.

Yes, Speechify Voice Typing saves time by capturing speech 3–5x faster than manual typing.

No, you don’t have to speak perfectly with Speechify Voice Typing, because it automatically cleans up grammar, removes filler words, and smooths out your speech into polished text.

You should use Speechify for dictation because its Voice Typing feature delivers highly accurate transcription, also includes powerful extra tools like a Voice AI assistant that can answer questions or summarize content, plus text to speech in 200+ lifelike voices to help you read, review, and stay productive anywhere.

Recent Posts:

How to Use Dictation and Voice Typing in Google Docs — March 7, 2026

How to Use Dictation and Voice Typing in ChatGPT — March 5, 2026

Speech to Speech and ASR at Speechify — February 20, 2026

How to Use Speechify Voice Typing Dictation in Google Docs — February 18, 2026

How to Use Speechify Voice Typing Dictation in Outlook — February 17, 2026

Speechify vs. Otter: Why Speechify Is the Better Choice for Professionals — February 16, 2026

How to Use Speechify Voice Typing Dictation in Notion — February 16, 2026

How to Use Speechify Voice Typing Dictation in Gmail — February 15, 2026

How to Use Speechify Voice Typing Dictation in Replit — February 15, 2026

How to Use Speechify Voice Typing Dictation in ChatGPT — February 14, 2026

A Comprehensive Guide to Dictation & Voice Typing Tools — February 13, 2026

How to Use Speechify Voice Typing Dictation in Slack — February 13, 2026

Original source
Mar 10, 2026
Date parsed from source:
Mar 10, 2026

First seen by Releasebot:
Mar 21, 2026
Speechify

Speechify Launches Join Podcasts Feature

Speechify launches Join Podcasts on the web app, letting users step into AI podcasts generated from documents and research, ask questions, and get real-time answers. The update turns listening into an interactive conversation and expands Speechify’s AI learning experience.

Experience the future of interactive learning where podcasts talk back.

Speechify, the leading Voice AI Productivity Assistant, today announced the launch of its new Join Podcasts feature, a breakthrough capability that allows users to actively participate in AI podcasts generated from documents, research, and written content. The feature transforms AI podcast listening into a fully interactive experience where listeners can step into the conversation, ask questions, and receive answers in real time.

Launching first on the Speechify web app, Join Podcasts represents a major step toward a future where reading, listening, and learning are no longer passive experiences but dynamic conversations between users and their content.

How is Speechify Turning Documents into Interactive Podcasts?

Speechify has already enabled users to create AI podcasts from documents, articles, research papers, and prompts in a variety of styles. With the new Join Podcasts capability, users can now go one step further by entering the conversation themselves. After generating an AI podcast, listeners can join the discussion and interact directly with the podcast hosts. They can ask questions about the material, request clarifications, explore specific ideas, and guide the discussion toward the topics that matter most to them. The result is a fundamentally new way to engage with information, one where listening evolves into dialogue.

How is Speechify AI Podcasts Moving from Passive Listening to Active Learning?

For decades, consuming information has largely been a one-directional experience. People read documents, listen to podcasts, or watch lectures without the ability to interact with the content in real time.

Speechify’s Join Podcasts feature changes that model. Instead of simply listening to a podcast episode, users can actively participate in it. A research paper can become a podcast conversation where the listener asks follow-up questions. A news article can respond to curiosity about specific details. A textbook can explain complex ideas through dialogue. By enabling interaction directly within audio content, Speechify is helping redefine how people absorb and explore information.

How is Speechify Creating a New Era of Interactive Knowledge?

The launch of Join Podcasts reflects a broader shift in how information will be consumed in the future. As AI becomes more integrated into everyday workflows, static content will increasingly evolve into responsive experiences. Speechify is building toward a world where every piece of information, documents, research, articles, and podcasts, can respond to questions and guide users through deeper understanding. In this future, reading for work or studying for school will feel more like having a conversation with the material itself.

How is Speechify an AI Agent for Learning and Work?

Speechify’s platform is designed to act as a Voice AI Productivity Assistant that helps professionals and students read, understand, and retain information more effectively.

The platform combines multiple AI capabilities into a single system, including text to speech for listening to documents, voice typing dictation for capturing ideas, AI note taking for capturing meetings, AI podcasts for transforming written material into audio learning, and a conversational Voice AI Assistant for exploring information through dialogue.

With Join Podcasts, Speechify extends this ecosystem by allowing users to step directly into AI podcast discussions and interact with the information they are consuming.

How is Speechify AI Podcasts the Future of Document Consumption?

The Join Podcasts feature also signals a broader shift in how documents themselves will be experienced. Rather than remaining static files, documents are evolving into interactive mediums. Speechify has already introduced conversational reading features where articles and documents can respond to user questions. Now, with interactive podcasts, the same concept extends to audio content.

This shift points toward a future where consuming information without interactivity may feel increasingly outdated. Instead of simply reading or listening, users will expect content to respond, explain, and adapt to their curiosity. Speechify’s newest feature brings that future one step closer.

Is Speechify Join Podcasts Available?

The Join Podcasts feature is launching first on the Speechify web application, with additional platform support expected in future updates.

By enabling users to participate in their podcasts and interact with their content, Speechify is transforming how knowledge is explored, understood, and shared. Information is no longer a one-way street. Welcome to the conversation.
Original source
Feb 23, 2026
Date parsed from source:
Feb 23, 2026

First seen by Releasebot:
Feb 28, 2026
Speechify

Speechify Launches Multimodal Learning Features

Speechify unveils multimodal learning that blends listening, reading, and AI Q&A in one platform. Upload documents, listen with natural voices, and ask questions to get grounded summaries and explanations across web, mobile, and desktop.

Speechify introduces multimodal learning with text to speech, document Q&A, and AI summaries for faster reading and deeper understanding.

Speechify today announced the launch of new multimodal learning features that combine listening, reading, and AI-powered question answering into a single experience. The new capabilities allow users to upload documents, listen to them as audio, and ask questions about the content while receiving structured explanations and summaries.

These features expand Speechify beyond traditional text to speech by adding interactive learning tools similar to chat-based AI systems, while maintaining a voice-first experience designed for real-world reading workflows.

Speechify’s multimodal learning system allows users to move between listening, reading, and AI explanations without switching tools or copying content into separate applications.

Listen and Ask Questions About Documents

Speechify’s multimodal learning features allow users to upload documents and interact with them conversationally.

Users can listen to documents read aloud while asking questions about the material. Speechify analyzes the content and generates answers, summaries, and explanations based on the uploaded documents.

Instead of reading line by line or searching manually, users can ask direct questions and receive clear responses grounded in the material they uploaded.

This allows Speechify to function as both a reading tool and an AI learning assistant.

AI Answers Grounded in Your Documents

Speechify’s multimodal learning features provide document-based answers similar to chat-based AI systems while remaining focused on real reading workflows.

Users can request summaries, explanations, definitions, and clarifications based on the documents they upload. The system generates responses that reflect the content of the material rather than generic answers.

This helps students and professionals understand complex material more quickly while maintaining context from the original documents.

Speechify combines document understanding with voice interaction so users can listen and learn at the same time.

Designed for Real Learning Workflows

Speechify’s multimodal learning features are designed for students, researchers, and professionals who regularly work with long documents.

Users can upload coursework, reports, research papers, and articles and turn them into interactive learning sessions. Listening can be combined with question answering and summaries to improve comprehension.

The system allows users to move between reading, listening, and AI explanations without interrupting their workflow.

This approach reflects how people naturally learn by combining multiple forms of input instead of relying on text alone.

Listening, Reading, and Understanding in One Platform

Speechify’s multimodal learning features integrate three core capabilities into a single environment.

Users can listen to documents using natural-sounding voices, follow along with synchronized text highlighting, and ask questions using Speechify’s Voice AI Assistant.

Instead of using separate tools for reading, AI chat, and audio playback, Speechify combines these capabilities into one workflow.

This unified approach reduces friction and allows users to focus on understanding information rather than managing multiple applications.

From Text to Speech to Multimodal Learning

Speechify began as a text to speech platform focused on helping users listen to written content. The addition of multimodal learning features expands that foundation into interactive understanding.

Users can now upload documents, listen to content, ask questions, and receive explanations within a single platform.

Speechify describes multimodal learning as a natural evolution from passive listening toward interactive understanding.

Designed for Learning Anywhere

Speechify’s multimodal learning features work across devices including web, desktop, and mobile platforms. Users can upload documents on one device and continue listening or asking questions on another.

This allows learning sessions to continue across environments without losing progress.

The multimodal learning features are available through Speechify’s apps and web platform.

About Speechify

Speechify is a Voice AI Assistant that helps people read, write, and understand information through voice. Trusted by over 50 million users worldwide, Speechify offers text to speech, voice typing dictation, and a conversational AI assistant across iOS, Android, Mac, web, and Chrome. In 2025, Speechify received the Apple Design Award for its impact on accessibility and productivity.

Speechify is used in nearly 200 countries and features 1,000+ natural-sounding voices in over 60 languages, including voices from Snoop Dogg, MrBeast, and Gwyneth Paltrow.
Original source
Feb 19, 2026
Date parsed from source:
Feb 19, 2026

First seen by Releasebot:
Feb 19, 2026

Modified by Releasebot:
Jun 24, 2026
Speechify

February 19, 2026

Speechify adds a new API domain and updated console and docs URLs for easier access to the Speechify API.

New API Domain: api.speechify.ai

The Speechify API is now available at https://api.speechify.ai.

What Changed

New base URL: https://api.speechify.ai

New console URL: https://platform.speechify.ai

New docs URL: https://docs.speechify.ai

Migration

No action is required. The previous domains (api.sws.speechify.com, console.sws.speechify.com, docs.sws.speechify.com) continue to work and are not being deprecated.

Updated SDKs default to the new base URL. If you've hardcoded api.sws.speechify.com in your integration, it will continue to work.
Original source
Feb 18, 2026
Date parsed from source:
Feb 18, 2026

First seen by Releasebot:
Feb 28, 2026
Speechify

Speechify Launches All-In-One Productivity Platform Repositioning

Speechify shifts from a reader to a full Voice AI productivity platform with dictation, AI podcasts and assistants. The announcement frames a major repositioning and new cross‑device capabilities that unify reading, writing and learning in one voice‑first workflow.

Continuing Leadership in AI Voice Reading

Speechify remains one of the most widely used AI voice reading platforms in the world. Millions of users rely on Speechify text to speech to listen to PDFs, documents, web pages, emails, and books using natural-sounding voices optimized for long-form listening.

Speechify’s text to speech system is designed specifically for real reading workflows rather than short demo audio. The platform supports long document stability across hours of listening, high-speed playback clarity at 2x, 3x, and 4x speeds, and consistent pronunciation across complex material.

Users can upload PDFs, Word documents, slides, and articles and convert them into audio instantly. Speechify’s document understanding system preserves structure across headings, paragraphs, and lists so that spoken output remains easy to follow.

Speechify also supports OCR and image-to-speech, allowing scanned PDFs, photos of pages, and screenshots to be converted into listenable audio. This allows users to access material that would otherwise remain locked in visual formats.

Listening progress stays synchronized across devices so users can begin reading on desktop and continue on mobile without losing their place.

Speechify’s text to speech voices are available in dozens of languages and voice styles, allowing users to choose voices that are comfortable for long listening sessions.

Expanding Beyond Reading

Speechify’s repositioning reflects the evolution of the platform from a reading tool into a broader productivity environment built around voice interaction.

The platform now includes Voice Typing Dictation for speaking drafts and notes, a Voice AI Assistant for answering questions and generating content, AI Podcasts that transform written material into structured audio shows, and multimodal learning features that combine listening with AI explanations.

These additions allow users to move from consuming information to producing and understanding it within the same platform.

Users can listen to a document, ask questions about it, and dictate responses without switching tools.

A Unified Voice AI Productivity Platform

Speechify combines multiple voice-first capabilities into a single environment.

Users can listen to documents using text to speech, generate podcasts from written material, dictate writing using voice typing, and interact with information using a conversational Voice AI Assistant.

Speechify’s platform works across iOS, Android, Mac, web, and browser extensions, allowing users to continue their work across devices.

Instead of using separate tools for reading, dictation, AI chat, and audio playback, Speechify integrates these capabilities into a single workflow.

Voice as the Primary Interface

Speechify’s repositioning reflects a broader shift toward voice-first computing. Instead of relying on keyboards and traditional interfaces, Speechify enables users to interact with information through spoken language.

Users can listen to information, ask questions out loud, dictate drafts, and refine ideas conversationally.

Speechify describes voice as the fastest and most natural interface for working with information, and the company continues to expand the role of voice across its platform.

From Text to Speech Leader to Voice AI Productivity Platform

Speechify originally gained adoption as a text to speech platform designed to make reading faster and more accessible. Over time, the platform expanded into writing, AI assistance, and content creation tools.

The repositioning reflects Speechify’s evolution into a broader productivity system while maintaining its leadership in text to speech technology.

Speechify continues to invest heavily in text to speech research and voice model development while expanding into new productivity workflows built around voice.

The platform is designed as a single environment where users can read, write, and understand information through voice.

About Speechify

Speechify is a Voice AI Assistant that helps people read, write, and understand information through voice. Trusted by over 50 million users worldwide, Speechify offers text to speech, voice typing dictation, and a conversational AI assistant across iOS, Android, Mac, web, and Chrome. In 2025, Speechify received the Apple Design Award for its impact on accessibility and productivity.

Speechify is used in nearly 200 countries and features 1,000+ natural-sounding voices in over 60 languages, including voices from Snoop Dogg, MrBeast, and Gwyneth Paltrow.
Original source