🧠

Model

chatterbox

Name: chatterbox
Author: ResembleAI

by ResembleAI ID: hf-model--resembleai--chatterbox

FNI Rank 37

Percentile Top 2%

Activity

→ 0.0%

Chatterbox TTS

View Source Code →

Audited 37 FNI Score

Tiny - Params

- Context

Hot 694.1K Downloads

Model Information Summary
Entity Passport
Registry ID	hf-model--resembleai--chatterbox
Provider	huggingface

📰

📰 Timeline & Reports

📰FEATURES

2026 02 14report

Ecosystem Node

→

🕸️

Intelligence Hive

Multi-source Relation Matrix

Live Index

📰

Intelligence Reports

📈

Momentum Index

🏷️

Contextual Anchors

chatterbox text-to-speech speech speech-generation voice-cloning multilingual-tts ar da de el en es

📜

Cite this model

Academic & Research Attribution

BibTeX

@misc{hf_model__resembleai__chatterbox,
  author = {ResembleAI},
  title = {chatterbox Model},
  year = {2026},
  howpublished = {\url{https://huggingface.co/ResembleAI/chatterbox}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}

APA Style

ResembleAI. (2026). chatterbox [Model]. Free2AITools. https://huggingface.co/ResembleAI/chatterbox

🔬Technical Deep Dive

Full Specifications [+]

⚡ Quick Commands

🤗 HF Download

huggingface-cli download resembleai/chatterbox

⚖️ Free2AI Nexus Index

Methodology → 📘 What is FNI?

37.0

Top 2% Overall Impact

🔥 Popularity (P) 0

🚀 Velocity (V) 0

🛡️ Credibility (C) 0

🔧 Utility (U) 0

Nexus Verified Data

💬 Why this score?

This chatterbox has a P score of 0 (popularity from downloads/likes), V of 0 (growth velocity), C of 0 (credibility from citations), and U of 0 (utility/deploy support).

🔗 Source Links (Click to verify)

📊 P: HuggingFace Stats 📈 V: 7-Day Delta 📄 C: Papers With Code 🔧 U: Deploy Score

Data Verified 🕐 Last Updated: Not calculated

Free2AI Nexus Index | Fair · Transparent · Explainable | Full Methodology

---

🚀 What's Next?

📊

Find Training Datasets

Discover datasets compatible with this model

📈

Compare Benchmarks

See how this model ranks on standard tests

⚡

README

license: mit
language:

ar
da
de
el
en
es
fi
fr
he
hi
it
ja
ko
ms
nl
no
pl
pt
ru
sv
sw
tr
zh
pipeline_tag: text-to-speech
tags:
text-to-speech
speech
speech-generation
voice-cloning
multilingual-tts
library_name: chatterbox

Chatterbox TTS

Made with ❤️ by resemble-logo-horizontal

09/04 🔥 Introducing Chatterbox Multilingual in 23 Languages!

We're excited to introduce Chatterbox and Chatterbox Multilingual, Resemble AI's production-grade open source TTS models. Chatterbox Multilingual supports Arabic, Danish, German, Greek, English, Spanish, Finnish, French, Hebrew, Hindi, Italian, Japanese, Korean, Malay, Dutch, Norwegian, Polish, Portuguese, Russian, Swedish, Swahili, Turkish, Chinese out of the box. Licensed under MIT, Chatterbox has been benchmarked against leading closed-source systems like ElevenLabs, and is consistently preferred in side-by-side evaluations.

Whether you're working on memes, videos, games, or AI agents, Chatterbox brings your content to life. It's also the first open source TTS model to support emotion exaggeration control, a powerful feature that makes your voices stand out. Try it now on our Hugging Face Gradio app.

If you like the model but need to scale or tune it for higher accuracy, check out our competitively priced TTS service (link). It delivers reliable performance with ultra-low latency of sub 200ms—ideal for production use in agents, applications, or interactive media.

Key Details

Multilingual, zero-shot TTS supporting 23 languages
SoTA zeroshot English TTS
0.5B Llama backbone
Unique exaggeration/intensity control
Ultra-stable with alignment-informed inference
Trained on 0.5M hours of cleaned data
Watermarked outputs
Easy voice conversion script
Outperforms ElevenLabs

Tips

General Use (TTS and Voice Agents):
- The default settings (exaggeration=0.5, cfg=0.5) work well for most prompts.
- If the reference speaker has a fast speaking style, lowering cfg to around 0.3 can improve pacing.
Expressive or Dramatic Speech:
- Try lower cfg values (e.g. ~0.3) and increase exaggeration to around 0.7 or higher.
- Higher exaggeration tends to speed up speech; reducing cfg helps compensate with slower, more deliberate pacing.

Note: Ensure that the reference clip matches the specified language tag. Otherwise, language transfer outputs may inherit the accent of the reference clip’s language.
To mitigate this, set the CFG weight to 0.

Installation

pip install chatterbox-tts

Usage

import torchaudio as ta
from chatterbox.tts import ChatterboxTTS

model = ChatterboxTTS.from_pretrained(device="cuda")

text = "Ezreal and Jinx teamed up with Ahri, Yasuo, and Teemo to take down the enemy's Nexus in an epic late-game pentakill."
wav = model.generate(text)
ta.save("test-1.wav", wav, model.sr)

# If you want to synthesize with a different voice, specify the audio prompt
AUDIO_PROMPT_PATH="YOUR_FILE.wav"
wav = model.generate(text, audio_prompt_path=AUDIO_PROMPT_PATH)
ta.save("test-2.wav", wav, model.sr)

Multilingual Quickstart

import torchaudio as ta
from chatterbox.mtl_tts import ChatterboxMultilingualTTS

multilingual_model = ChatterboxMultilingualTTS.from_pretrained(device="cuda")

french_text = "Bonjour, comment ça va? Ceci est le modèle de synthèse vocale multilingue Chatterbox, il prend en charge 23 langues."
wav_french = multilingual_model.generate(french_text, language_id="fr")
ta.save("test-french.wav", wav_french, model.sr)

chinese_text = "你好，今天天气真不错，希望你有一个愉快的周末。"
wav_chinese = multilingual_model.generate(chinese_text, language_id="zh")
ta.save("test-chinese.wav", wav_chinese, model.sr)

See example_tts.py for more examples.

Acknowledgements

Built-in PerTh Watermarking for Responsible AI

Every audio file generated by Chatterbox includes Resemble AI's Perth (Perceptual Threshold) Watermarker - imperceptible neural watermarks that survive MP3 compression, audio editing, and common manipulations while maintaining nearly 100% detection accuracy.

Disclaimer

Don't use this model to do bad things. Prompts are sourced from freely available data on the internet.

6,715 chars • Full Disclosure Protocol Active

ZEN MODE • README

license: mit
language:

ar
da
de
el
en
es
fi
fr
he
hi
it
ja
ko
ms
nl
no
pl
pt
ru
sv
sw
tr
zh
pipeline_tag: text-to-speech
tags:
text-to-speech
speech
speech-generation
voice-cloning
multilingual-tts
library_name: chatterbox

Chatterbox TTS

Made with ❤️ by resemble-logo-horizontal

09/04 🔥 Introducing Chatterbox Multilingual in 23 Languages!

Key Details

Multilingual, zero-shot TTS supporting 23 languages
SoTA zeroshot English TTS
0.5B Llama backbone
Unique exaggeration/intensity control
Ultra-stable with alignment-informed inference
Trained on 0.5M hours of cleaned data
Watermarked outputs
Easy voice conversion script
Outperforms ElevenLabs

Tips

General Use (TTS and Voice Agents):
- The default settings (exaggeration=0.5, cfg=0.5) work well for most prompts.
- If the reference speaker has a fast speaking style, lowering cfg to around 0.3 can improve pacing.
Expressive or Dramatic Speech:
- Try lower cfg values (e.g. ~0.3) and increase exaggeration to around 0.7 or higher.
- Higher exaggeration tends to speed up speech; reducing cfg helps compensate with slower, more deliberate pacing.

Installation

pip install chatterbox-tts

Usage

import torchaudio as ta
from chatterbox.tts import ChatterboxTTS

model = ChatterboxTTS.from_pretrained(device="cuda")
text = "Ezreal and Jinx teamed up with Ahri, Yasuo, and Teemo to take down the enemy's Nexus in an epic late-game pentakill."
wav = model.generate(text)
ta.save("test-1.wav", wav, model.sr)
If you want to synthesize with a different voice, specify the audio promptAUDIO_PROMPT_PATH="YOUR_FILE.wav"
wav = model.generate(text, audio_prompt_path=AUDIO_PROMPT_PATH)
ta.save("test-2.wav", wav, model.sr)

Multilingual Quickstart

import torchaudio as ta
from chatterbox.mtl_tts import ChatterboxMultilingualTTS

multilingual_model = ChatterboxMultilingualTTS.from_pretrained(device="cuda")
french_text = "Bonjour, comment ça va? Ceci est le modèle de synthèse vocale multilingue Chatterbox, il prend en charge 23 langues."
wav_french = multilingual_model.generate(french_text, language_id="fr")
ta.save("test-french.wav", wav_french, model.sr)
chinese_text = "你好，今天天气真不错，希望你有一个愉快的周末。"
wav_chinese = multilingual_model.generate(chinese_text, language_id="zh")
ta.save("test-chinese.wav", wav_chinese, model.sr)

See example_tts.py for more examples.

Acknowledgements

Built-in PerTh Watermarking for Responsible AI

Disclaimer

Don't use this model to do bad things. Prompts are sourced from freely available data on the internet.

📝 Limitations & Considerations

• Benchmark scores may vary based on evaluation methodology and hardware configuration.
• VRAM requirements are estimates; actual usage depends on quantization and batch size.
• FNI scores are relative rankings and may change as new models are added.
⚠ License Unknown: Verify licensing terms before commercial use.
• Source: Unknown

Top Tier

Social Proof

HuggingFace Hub

1.3KLikes

694.1KDownloads

Hub Discussions

🤗 Data Source: Hugging Face ↗

🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseℹ️ Verify with original source

🛡️ Model Transparency Report

Verified data manifest for traceability and transparency.

100% Data Disclosure Active

🆔 Identity & Source

id: hf-model--resembleai--chatterbox
source: huggingface
author: ResembleAI
tags: chatterboxtext-to-speechspeechspeech-generationvoice-cloningmultilingual-ttsardadeelenesfifrhehiitjakomsnlnoplptrusvswtrzhlicense:mitregion:us

⚙️ Technical Specs

architecture: null
params billions: null
context length: null
pipeline tag: null

📊 Engagement & Metrics

likes: 1,318
downloads: 694,062

Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)

chatterbox

🕸️ Neural Mesh Hub

📰 Timeline & Reports

Intelligence Hive

Intelligence Reports

Momentum Index

Contextual Anchors

Cite this model

🔬Technical Deep Dive

⚡ Quick Commands

⚖️ Free2AI Nexus Index

💬 Why this score?

🔗 Source Links (Click to verify)

🚀 What's Next?

Find Training Datasets

Compare Benchmarks

Deployment Guide

README

Chatterbox TTS

Key Details

Tips

Installation

Usage

Multilingual Quickstart

Acknowledgements

Built-in PerTh Watermarking for Responsible AI

Disclaimer

📝 Limitations & Considerations

Social Proof

🛡️ Model Transparency Report

🆔 Identity & Source

⚙️ Technical Specs

📊 Engagement & Metrics

Welcome to Free2AI Tools!

Smart Search

FNI Score

You're All Set!

🕸️ Neural Mesh Hub

📰 Timeline & Reports

Intelligence Hive

Intelligence Reports

Momentum Index

Contextual Anchors

Cite this model

🔬Technical Deep Dive

⚡ Quick Commands

⚖️ Free2AI Nexus Index

💬 Why this score?

🔗 Source Links (Click to verify)

🚀 What's Next?

Find Training Datasets

Compare Benchmarks

Deployment Guide

README

Chatterbox TTS

Key Details

Tips

Installation

Usage

Multilingual Quickstart

Acknowledgements

Built-in PerTh Watermarking for Responsible AI

Disclaimer

Chatterbox TTS

Key Details

Tips

Installation

Usage

If you want to synthesize with a different voice, specify the audio prompt

Multilingual Quickstart

Acknowledgements

Built-in PerTh Watermarking for Responsible AI

Disclaimer

📝 Limitations & Considerations

Social Proof

🛡️ Model Transparency Report

🆔 Identity & Source

⚙️ Technical Specs

📊 Engagement & Metrics