🧠
Model

Fish Speech 1.5

by fishaudio hf-model--fishaudio--fish-speech-1.5
Nexus Index
23.0 Top 2%
P / V / C / U Breakdown Calibration Pending

Pillar scores are computed during the next indexing cycle.

Tech Context
Vital Performance
1.9K DL / 30D
0.0%

**Fish Speech V1.5** is a leading text-to-speech (TTS) model trained on more than 1 ...

Audited 23 FNI Score
Tiny - Params
- Context
Hot 1.9K Downloads
Dense DUAL_AR Architecture
Model Information Summary
Entity Passport
Registry ID hf-model--fishaudio--fish-speech-1.5
Provider huggingface
📜

Cite this model

Academic & Research Attribution

BibTeX
@misc{hf_model__fishaudio__fish_speech_1.5,
  author = {fishaudio},
  title = {Fish Speech 1.5 Model},
  year = {2026},
  howpublished = {\url{https://huggingface.co/fishaudio/fish-speech-1.5}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
fishaudio. (2026). Fish Speech 1.5 [Model]. Free2AITools. https://huggingface.co/fishaudio/fish-speech-1.5

đŸ”ŦTechnical Deep Dive

Full Specifications [+]

Quick Commands

🤗 HF Download
huggingface-cli download fishaudio/fish-speech-1.5

âš–ī¸ Nexus Index V16.5

23.0
ESTIMATED IMPACT TIER
Popularity (P) 0
Freshness (F) 0
Completeness (C) 0
Utility (U) 0

đŸ’Ŧ Index Insight

The Free2AITools Nexus Index for Fish Speech 1.5 aggregates Popularity (P:0), Freshness (F:0), and Completeness (C:0). The Utility score (U:0) represents deployment readiness and ecosystem adoption.

Free2AITools Nexus Index

Verification Authority

Unbiased Data Node Refresh: VFS Live
---

🚀 What's Next?

Technical Deep Dive


tags:

  • text-to-speech
    license: cc-by-nc-sa-4.0
    language:
  • zh
  • en
  • de
  • ja
  • fr
  • es
  • ko
  • ar
  • nl
  • ru
  • it
  • pl
  • pt
    pipeline_tag: text-to-speech
    inference: false
    extra_gated_prompt: >-
    You agree to not use the model to generate contents that violate DMCA or local
    laws.
    extra_gated_fields:
    Country: country
    Specific date: date_picker
    I agree to use this model for non-commercial use ONLY: checkbox

Fish Speech V1.5

Fish Speech V1.5 is a leading text-to-speech (TTS) model trained on more than 1 million hours of audio data in multiple languages.

Supported languages:

  • English (en) >300k hours
  • Chinese (zh) >300k hours
  • Japanese (ja) >100k hours
  • German (de) ~20k hours
  • French (fr) ~20k hours
  • Spanish (es) ~20k hours
  • Korean (ko) ~20k hours
  • Arabic (ar) ~20k hours
  • Russian (ru) ~20k hours
  • Dutch (nl) <10k hours
  • Italian (it) <10k hours
  • Polish (pl) <10k hours
  • Portuguese (pt) <10k hours

Please refer to Fish Speech Github for more info.
Demo available at Fish Audio.

Citation

If you found this repository useful, please consider citing this work:

@misc{fish-speech-v1.4,
      title={Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis}, 
      author={Shijia Liao and Yuxuan Wang and Tianyu Li and Yifan Cheng and Ruoyi Zhang and Rongzhi Zhou and Yijin Xing},
      year={2024},
      eprint={2411.01156},
      archivePrefix={arXiv},
      primaryClass={cs.SD},
      url={https://arxiv.org/abs/2411.01156}, 
}

License

This model is permissively licensed under the CC-BY-NC-SA-4.0 license.

📝 Limitations & Considerations

  • â€ĸ Benchmark scores may vary based on evaluation methodology and hardware configuration.
  • â€ĸ VRAM requirements are estimates; actual usage depends on quantization and batch size.
  • â€ĸ FNI scores are relative rankings and may change as new models are added.
  • â€ĸ Source: Unknown
Top Tier

Social Proof

HuggingFace Hub
656Likes
1.9KDownloads
🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseâ„šī¸ Verify with original source

đŸ›Ąī¸ Model Transparency Report

Verified data manifest for traceability and transparency.

100% Data Disclosure Active

🆔 Identity & Source

id
hf-model--fishaudio--fish-speech-1.5
source
huggingface
author
fishaudio
tags
dual_artext-to-speechzhendejafreskoarnlruitplptarxiv:2411.01156license:cc-by-nc-sa-4.0region:us

âš™ī¸ Technical Specs

architecture
dual_ar
params billions
null
context length
null
pipeline tag
text-to-speech

📊 Engagement & Metrics

likes
656
downloads
1,896

Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)