Fish Audio Release Notes

Last updated: Dec 23, 2025

Get this feed: RSS Email API CSV MCP Slack n8n Zapier

Nov 11, 2025
- Date parsed from source:
  Nov 11, 2025
- First seen by Releasebot:
  Dec 23, 2025
Fish Audio

Fish Audio S1
Fish Audio Overview

Historic rebrand from Fish Speech to Fish Audio. #1 ranking on TTS-Arena2 with industry-leading performance.

S1 (4B params): 0.008 WER, 0.004 CER - Available on Fish Audio Playground

S1-mini (0.5B params): 0.011 WER, 0.005 CER - Open source on Hugging Face

48+ emotional expressions with RLHF integration and multilingual support for English, Chinese, Japanese, and more.

Read More about S1
Original source Report a problem
Nov 11, 2025
- Date parsed from source:
  Nov 11, 2025
- First seen by Releasebot:
  Dec 23, 2025
Fish Audio

v1.5.1

Fixed critical PyTorch security settings and improved inference speed significantly. Added ONNX export support for better deployment options and enhanced text processing for Arabic and Hebrew languages. Includes bug fixes for Apple Silicon (MPS) compatibility and reorganized library structure for cleaner codebase.
Original source Report a problem
All of your release notes in one feed

Join Releasebot and get updates from Fish Audio and hundreds of other software products.

Create account
Get updates with: RSS Email API CSV MCP Slack n8n Zapier
Nov 11, 2025
- Date parsed from source:
  Nov 11, 2025
- First seen by Releasebot:
  Dec 23, 2025
Fish Audio

v1.5.0
Release Notes

Introduced v1.5 model architecture with improved dataset handling and bearer token authentication for APIs.

Added reference audio caching by hash for faster performance and better Apple Silicon support. Includes OpenAPI documentation refactoring and base64 reference data support in JSON format.

Original source Report a problem
Nov 11, 2025
- Date parsed from source:
  Nov 11, 2025
- First seen by Releasebot:
  Dec 23, 2025
Fish Audio

v1.4.2

Documentation-focused release with comprehensive updates for v1.4, macOS support, and multiple language translations.

Improved Docker support and API enhancements for JSON format handling. Added audio selection to WebUI and fixed various stability issues including cache handling and backend performance.
Original source Report a problem
Nov 11, 2025
- Date parsed from source:
  Nov 11, 2025
- First seen by Releasebot:
  Dec 23, 2025
Fish Audio

v1.2.1

Replaced Whisper with SenseVoice for better ASR and added native Apple Silicon support.
Includes Portuguese (Brazil) localization, streaming audio functionality, and CPU-only inference improvements. Pinned PyTorch to 2.3.1 to fix inference speed issues and aligned API with official closed-source version.
Original source Report a problem
Nov 11, 2025
- Date parsed from source:
  Nov 11, 2025
- First seen by Releasebot:
  Dec 23, 2025
Fish Audio

v1.2

Introduced auto-reranking system for better results along with bilingual support and model quantization. Replaced standard Whisper with Faster Whisper for improved speed and added Japanese documentation. Enhanced model stability and inference performance with optimized v1.2 architecture.
Original source Report a problem
Nov 11, 2025
- Date parsed from source:
  Nov 11, 2025
- First seen by Releasebot:
  Dec 23, 2025
Fish Audio

v1.1.1
Breaking changes

Replaced zibai with uvicorn for API server, new text-splitter with byte-based length calculation, and license change to CC-BY-NC-SA 4.0.

Added

Apple Silicon (MPS) support

Windows one-click installation

automatic model downloading with resume capability

Improved WebUI with better file selection and download progress indicators

Original source Report a problem
Nov 11, 2025
- Date parsed from source:
  Nov 11, 2025
- First seen by Releasebot:
  Dec 23, 2025
Fish Audio

v1.1.0

Release notes

Added VITS decoder integration with full streaming support and queue management for real-time audio generation.

Introduced internationalization (i18n) with Spanish translation and improved Windows packaging. Optimized GPU memory usage and CPU-only inference performance while adding LoRA support to the Gradio UI.
Original source Report a problem
Nov 11, 2025
- Date parsed from source:
  Nov 11, 2025
- First seen by Releasebot:
  Dec 23, 2025
Fish Audio

v1.0.0

Major milestone release introducing new VQ-GAN architecture with VITS decoder support, LoRA fine-tuning, and streaming inference capabilities.

Breaking changes include removal of the Rust-based data server, new tokenizer replacing phonemizer, and updated model architecture (VQ + DiT + Reflow). Achieved 4x memory reduction during loading and added WebUI for training and annotation.
Original source Report a problem
Nov 11, 2025
- Date parsed from source:
  Nov 11, 2025
- First seen by Releasebot:
  Dec 23, 2025
Fish Audio

v0.2.0

Release notes

First public release of Fish Speech featuring a complete text-to-speech pipeline with VQ-GAN audio codec and LLAMA-based language model. Includes multi-language support (Chinese, English, Japanese), Gradio WebUI for inference, HTTP API server, and Docker support. Added special optimizations for Chinese users including mirror downloads and localized documentation.
Original source Report a problem

Fish Audio Release Notes

Fish Audio S1

Fish Audio Overview

v1.5.1

v1.5.0

Release Notes

v1.4.2

v1.2.1

v1.2

v1.1.1

Breaking changes

Added

v1.1.0

Release notes

v1.0.0

v0.2.0

Release notes

Related vendors