Model

chatterbox

by ResembleAI ID: hf-model--resembleai--chatterbox
FNI Rank 37
Percentile Top 2%
Activity
β†’ 0.0%

Chatterbox TTS

Audited 37 FNI Score
Tiny - Params
- Context
Hot 694.1K Downloads
Model Information Summary
Entity Passport
Registry ID hf-model--resembleai--chatterbox
Provider huggingface

πŸ•ΈοΈ Neural Mesh Hub

Interconnecting Research, Data & Ecosystem

πŸ“°

πŸ“° Timeline & Reports

πŸ•ΈοΈ

Intelligence Hive

Multi-source Relation Matrix

Live Index
πŸ“°

Intelligence Reports

πŸ“œ

Cite this model

Academic & Research Attribution

BibTeX
@misc{hf_model__resembleai__chatterbox,
  author = {ResembleAI},
  title = {chatterbox Model},
  year = {2026},
  howpublished = {\url{https://huggingface.co/ResembleAI/chatterbox}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
ResembleAI. (2026). chatterbox [Model]. Free2AITools. https://huggingface.co/ResembleAI/chatterbox

πŸ”¬Technical Deep Dive

Full Specifications [+]

⚑ Quick Commands

πŸ€— HF Download
huggingface-cli download resembleai/chatterbox

βš–οΈ Free2AI Nexus Index

Methodology β†’ πŸ“˜ What is FNI?
37.0
Top 2% Overall Impact
πŸ”₯ Popularity (P) 0
πŸš€ Velocity (V) 0
πŸ›‘οΈ Credibility (C) 0
πŸ”§ Utility (U) 0
Nexus Verified Data

πŸ’¬ Why this score?

This chatterbox has a P score of 0 (popularity from downloads/likes), V of 0 (growth velocity), C of 0 (credibility from citations), and U of 0 (utility/deploy support).

Data Verified πŸ• Last Updated: Not calculated
Free2AI Nexus Index | Fair Β· Transparent Β· Explainable | Full Methodology
---

πŸš€ What's Next?

README

6,715 chars β€’ Full Disclosure Protocol Active

ZEN MODE β€’ README

license: mit
language:

  • ar
  • da
  • de
  • el
  • en
  • es
  • fi
  • fr
  • he
  • hi
  • it
  • ja
  • ko
  • ms
  • nl
  • no
  • pl
  • pt
  • ru
  • sv
  • sw
  • tr
  • zh
    pipeline_tag: text-to-speech
    tags:
  • text-to-speech
  • speech
  • speech-generation
  • voice-cloning
  • multilingual-tts
    library_name: chatterbox

cb-big2

Chatterbox TTS

Made with ❀️ by resemble-logo-horizontal

09/04 πŸ”₯ Introducing Chatterbox Multilingual in 23 Languages!

We're excited to introduce Chatterbox and Chatterbox Multilingual, Resemble AI's production-grade open source TTS models. Chatterbox Multilingual supports Arabic, Danish, German, Greek, English, Spanish, Finnish, French, Hebrew, Hindi, Italian, Japanese, Korean, Malay, Dutch, Norwegian, Polish, Portuguese, Russian, Swedish, Swahili, Turkish, Chinese out of the box. Licensed under MIT, Chatterbox has been benchmarked against leading closed-source systems like ElevenLabs, and is consistently preferred in side-by-side evaluations.

Whether you're working on memes, videos, games, or AI agents, Chatterbox brings your content to life. It's also the first open source TTS model to support emotion exaggeration control, a powerful feature that makes your voices stand out. Try it now on our Hugging Face Gradio app.

If you like the model but need to scale or tune it for higher accuracy, check out our competitively priced TTS service (link). It delivers reliable performance with ultra-low latency of sub 200msβ€”ideal for production use in agents, applications, or interactive media.

Key Details

  • Multilingual, zero-shot TTS supporting 23 languages
  • SoTA zeroshot English TTS
  • 0.5B Llama backbone
  • Unique exaggeration/intensity control
  • Ultra-stable with alignment-informed inference
  • Trained on 0.5M hours of cleaned data
  • Watermarked outputs
  • Easy voice conversion script
  • Outperforms ElevenLabs

Tips

  • General Use (TTS and Voice Agents):

    • The default settings (exaggeration=0.5, cfg=0.5) work well for most prompts.
    • If the reference speaker has a fast speaking style, lowering cfg to around 0.3 can improve pacing.
  • Expressive or Dramatic Speech:

    • Try lower cfg values (e.g. ~0.3) and increase exaggeration to around 0.7 or higher.
    • Higher exaggeration tends to speed up speech; reducing cfg helps compensate with slower, more deliberate pacing.

Note: Ensure that the reference clip matches the specified language tag. Otherwise, language transfer outputs may inherit the accent of the reference clip’s language.
To mitigate this, set the CFG weight to 0.

Installation

pip install chatterbox-tts

Usage

import torchaudio as ta
from chatterbox.tts import ChatterboxTTS

model = ChatterboxTTS.from_pretrained(device="cuda")

text = "Ezreal and Jinx teamed up with Ahri, Yasuo, and Teemo to take down the enemy's Nexus in an epic late-game pentakill." wav = model.generate(text) ta.save("test-1.wav", wav, model.sr)

If you want to synthesize with a different voice, specify the audio prompt

AUDIO_PROMPT_PATH="YOUR_FILE.wav" wav = model.generate(text, audio_prompt_path=AUDIO_PROMPT_PATH) ta.save("test-2.wav", wav, model.sr)

Multilingual Quickstart

import torchaudio as ta
from chatterbox.mtl_tts import ChatterboxMultilingualTTS

multilingual_model = ChatterboxMultilingualTTS.from_pretrained(device="cuda")

french_text = "Bonjour, comment ça va? Ceci est le modèle de synthèse vocale multilingue Chatterbox, il prend en charge 23 langues." wav_french = multilingual_model.generate(french_text, language_id="fr") ta.save("test-french.wav", wav_french, model.sr)

chinese_text = "δ½ ε₯½οΌŒδ»Šε€©ε€©ζ°”ηœŸδΈι”™οΌŒεΈŒζœ›δ½ ζœ‰δΈ€δΈͺζ„‰εΏ«ηš„ε‘¨ζœ«γ€‚" wav_chinese = multilingual_model.generate(chinese_text, language_id="zh") ta.save("test-chinese.wav", wav_chinese, model.sr)

See example_tts.py for more examples.

Acknowledgements

Built-in PerTh Watermarking for Responsible AI

Every audio file generated by Chatterbox includes Resemble AI's Perth (Perceptual Threshold) Watermarker - imperceptible neural watermarks that survive MP3 compression, audio editing, and common manipulations while maintaining nearly 100% detection accuracy.

Disclaimer

Don't use this model to do bad things. Prompts are sourced from freely available data on the internet.

πŸ“ Limitations & Considerations

  • β€’ Benchmark scores may vary based on evaluation methodology and hardware configuration.
  • β€’ VRAM requirements are estimates; actual usage depends on quantization and batch size.
  • β€’ FNI scores are relative rankings and may change as new models are added.
  • ⚠ License Unknown: Verify licensing terms before commercial use.
  • β€’ Source: Unknown
Top Tier

Social Proof

HuggingFace Hub
1.3KLikes
694.1KDownloads
πŸ”„ Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

πŸ“Š FNI Methodology πŸ“š Knowledge Baseℹ️ Verify with original source

πŸ›‘οΈ Model Transparency Report

Verified data manifest for traceability and transparency.

100% Data Disclosure Active

πŸ†” Identity & Source

id
hf-model--resembleai--chatterbox
source
huggingface
author
ResembleAI
tags
chatterboxtext-to-speechspeechspeech-generationvoice-cloningmultilingual-ttsardadeelenesfifrhehiitjakomsnlnoplptrusvswtrzhlicense:mitregion:us

βš™οΈ Technical Specs

architecture
null
params billions
null
context length
null
pipeline tag
null

πŸ“Š Engagement & Metrics

likes
1,318
downloads
694,062

Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)