- Nov 11, 2025
- Parsed from source:Nov 11, 2025
- Detected by Releasebot:Dec 23, 2025
v1.5.1
Fixed critical PyTorch security settings and improved inference speed significantly. Added ONNX export support for better deployment options and enhanced text processing for Arabic and Hebrew languages. Includes bug fixes for Apple Silicon (MPS) compatibility and reorganized library structure for cleaner codebase.
Original source Report a problem - Nov 11, 2025
- Parsed from source:Nov 11, 2025
- Detected by Releasebot:Dec 23, 2025
v1.1.2
Minor release
- Minor release adding Chinese text normalization support and a streaming audio download button in the WebUI.
- Fixed LoRA merging issues and improved Firefly performance.
- Nov 11, 2025
- Parsed from source:Nov 11, 2025
- Detected by Releasebot:Dec 23, 2025
v1.2.1
Replaced Whisper with SenseVoice for better ASR and added native Apple Silicon support.
Original source Report a problem
Includes Portuguese (Brazil) localization, streaming audio functionality, and CPU-only inference improvements. Pinned PyTorch to 2.3.1 to fix inference speed issues and aligned API with official closed-source version. - Nov 11, 2025
- Parsed from source:Nov 11, 2025
- Detected by Releasebot:Dec 23, 2025
v1.4.0
Major release with new VQGAN architecture for improved audio quality and faster inference. Updated WebUI with enhanced interface and better language switching. Added Japanese documentation translation and fixed inference warmup issues for better performance.
Original source Report a problem - Nov 11, 2025
- Parsed from source:Nov 11, 2025
- Detected by Releasebot:Dec 23, 2025
v1.4.1
Improvements
Infrastructure improvements focused on Docker optimization and multi-platform builds.
Updated PyTorch version and replaced audio backend from sox for better performance. Enhanced CI/CD pipeline with buildx support and fixed various Docker-related issues.
Original source Report a problem - Nov 11, 2025
- Parsed from source:Nov 11, 2025
- Detected by Releasebot:Dec 23, 2025
v1.4.2
Documentation-focused release with comprehensive updates for v1.4, macOS support, and multiple language translations.
Improved Docker support and API enhancements for JSON format handling. Added audio selection to WebUI and fixed various stability issues including cache handling and backend performance.
Original source Report a problem - Nov 11, 2025
- Parsed from source:Nov 11, 2025
- Detected by Releasebot:Dec 23, 2025
v0.2.0
Release notes
First public release of Fish Speech featuring a complete text-to-speech pipeline with VQ-GAN audio codec and LLAMA-based language model. Includes multi-language support (Chinese, English, Japanese), Gradio WebUI for inference, HTTP API server, and Docker support. Added special optimizations for Chinese users including mirror downloads and localized documentation.
Original source Report a problem - Nov 11, 2025
- Parsed from source:Nov 11, 2025
- Detected by Releasebot:Dec 23, 2025
v1.1.0
Release notes
Added VITS decoder integration with full streaming support and queue management for real-time audio generation.
Introduced internationalization (i18n) with Spanish translation and improved Windows packaging. Optimized GPU memory usage and CPU-only inference performance while adding LoRA support to the Gradio UI.
Original source Report a problem - Nov 11, 2025
- Parsed from source:Nov 11, 2025
- Detected by Releasebot:Dec 23, 2025
v1.0.0
Major milestone release introducing new VQ-GAN architecture with VITS decoder support, LoRA fine-tuning, and streaming inference capabilities.
Breaking changes include removal of the Rust-based data server, new tokenizer replacing phonemizer, and updated model architecture (VQ + DiT + Reflow). Achieved 4x memory reduction during loading and added WebUI for training and annotation.
Original source Report a problem - Nov 11, 2025
- Parsed from source:Nov 11, 2025
- Detected by Releasebot:Dec 23, 2025
v1.1.1
Breaking changes
Replaced zibai with uvicorn for API server, new text-splitter with byte-based length calculation, and license change to CC-BY-NC-SA 4.0.
Added
- Apple Silicon (MPS) support
- Windows one-click installation
- automatic model downloading with resume capability
- Improved WebUI with better file selection and download progress indicators