🧠

Model

Fish Speech 1.5

by fishaudio hf-model--fishaudio--fish-speech-1.5

Nexus Index

23.0 Top 2%

P / V / C / U Breakdown Calibration Pending

Pillar scores are computed during the next indexing cycle.

Tech Context

Vital Performance

1.9K DL / 30D

0.0%

**Fish Speech V1.5** is a leading text-to-speech (TTS) model trained on more than 1 ...

Source →

Audited 23 FNI Score

Tiny - Params

- Context

Hot 1.9K Downloads

Dense DUAL_AR Architecture

Model Information Summary
Entity Passport
Registry ID	hf-model--fishaudio--fish-speech-1.5
Provider	huggingface

📜

Cite this model

Academic & Research Attribution

BibTeX

@misc{hf_model__fishaudio__fish_speech_1.5,
  author = {fishaudio},
  title = {Fish Speech 1.5 Model},
  year = {2026},
  howpublished = {\url{https://huggingface.co/fishaudio/fish-speech-1.5}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}

APA Style

fishaudio. (2026). Fish Speech 1.5 [Model]. Free2AITools. https://huggingface.co/fishaudio/fish-speech-1.5

🔬Technical Deep Dive

Full Specifications [+]

Quick Commands

🤗 HF Download

huggingface-cli download fishaudio/fish-speech-1.5

⚖️ Nexus Index V16.5

Methodology Index Protocol

23.0

ESTIMATED IMPACT TIER

Popularity (P) 0

Freshness (F) 0

Completeness (C) 0

Utility (U) 0

💬 Index Insight

The Free2AITools Nexus Index for Fish Speech 1.5 aggregates Popularity (P:0), Freshness (F:0), and Completeness (C:0). The Utility score (U:0) represents deployment readiness and ecosystem adoption.

Free2AITools Nexus Index

Verification Authority

HuggingFace API GitHub Metadata Arxiv Citation DB System Audit

Unbiased Data Node Refresh: VFS Live

---

🚀 What's Next?

📊

Find Training Datasets

Discover datasets compatible with this model

📈

Compare Benchmarks

See how this model ranks on standard tests

⚡

Deployment Guide

Understand deployment options

Technical Deep Dive

tags:

text-to-speech
license: cc-by-nc-sa-4.0
language:
zh
en
de
ja
fr
es
ko
ar
nl
ru
it
pl
pt
pipeline_tag: text-to-speech
inference: false
extra_gated_prompt: >-
You agree to not use the model to generate contents that violate DMCA or local
laws.
extra_gated_fields:
Country: country
Specific date: date_picker
I agree to use this model for non-commercial use ONLY: checkbox

Fish Speech V1.5

Fish Speech V1.5 is a leading text-to-speech (TTS) model trained on more than 1 million hours of audio data in multiple languages.

Supported languages:

English (en) >300k hours
Chinese (zh) >300k hours
Japanese (ja) >100k hours
German (de) ~20k hours
French (fr) ~20k hours
Spanish (es) ~20k hours
Korean (ko) ~20k hours
Arabic (ar) ~20k hours
Russian (ru) ~20k hours
Dutch (nl) <10k hours
Italian (it) <10k hours
Polish (pl) <10k hours
Portuguese (pt) <10k hours

Please refer to Fish Speech Github for more info.
Demo available at Fish Audio.

Citation

If you found this repository useful, please consider citing this work:

@misc{fish-speech-v1.4,
      title={Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis}, 
      author={Shijia Liao and Yuxuan Wang and Tianyu Li and Yifan Cheng and Ruoyi Zhang and Rongzhi Zhou and Yijin Xing},
      year={2024},
      eprint={2411.01156},
      archivePrefix={arXiv},
      primaryClass={cs.SD},
      url={https://arxiv.org/abs/2411.01156}, 
}

License

This model is permissively licensed under the CC-BY-NC-SA-4.0 license.

📝 Limitations & Considerations

• Benchmark scores may vary based on evaluation methodology and hardware configuration.
• VRAM requirements are estimates; actual usage depends on quantization and batch size.
• FNI scores are relative rankings and may change as new models are added.
• Source: Unknown

Top Tier

Social Proof

HuggingFace Hub

656Likes

1.9KDownloads

Hub Discussions

🤗 Data Source: Hugging Face ↗

🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseℹ️ Verify with original source

🛡️ Model Transparency Report

Verified data manifest for traceability and transparency.

100% Data Disclosure Active

🆔 Identity & Source

id: hf-model--fishaudio--fish-speech-1.5
source: huggingface
author: fishaudio
tags: dual_artext-to-speechzhendejafreskoarnlruitplptarxiv:2411.01156license:cc-by-nc-sa-4.0region:us

⚙️ Technical Specs

architecture: dual_ar
params billions: null
context length: null
pipeline tag: text-to-speech

📊 Engagement & Metrics

likes: 656
downloads: 1,896

Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)

Welcome to Free2AI Tools!

Smart Search

FNI Score

You're All Set!