🧠

Model

Supertonic 2

Name: Supertonic 2
Author: Supertone

by Supertone hf-model--supertone--supertonic-2

Nexus Index

40.9 Top 100%

S: Semantic 50

A: Authority 0

P: Popularity 42

R: Recency 80

Q: Quality 50

Tech Context

Vital Performance

7.5K DL / 30D

0.0%

Source →

Audited 40.9 FNI Score

Tiny - Params

- Context

7.5K Downloads

Restricted OPENRAIL License

Model Information Summary
Entity Passport
Registry ID	hf-model--supertone--supertonic-2
License	OpenRAIL
Provider	huggingface

📜

Cite this model

Academic & Research Attribution

BibTeX

@misc{hf_model__supertone__supertonic_2,
  author = {Supertone},
  title = {Supertonic 2 Model},
  year = {2026},
  howpublished = {\url{https://huggingface.co/supertone/supertonic-2}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}

APA Style

Supertone. (2026). Supertonic 2 [Model]. Free2AITools. https://huggingface.co/supertone/supertonic-2

🔬Technical Deep Dive

Full Specifications [+]

Quick Commands

🤗 HF Download

huggingface-cli download supertone/supertonic-2

⚖️ Nexus Index V2.0

Methodology Index Protocol

40.9

TOP 100% SYSTEM IMPACT

Semantic (S) 50

Authority (A) 0

Popularity (P) 42

Recency (R) 80

Quality (Q) 50

💬 Index Insight

FNI V2.0 for Supertonic 2: Semantic (S:50), Authority (A:0), Popularity (P:42), Recency (R:80), Quality (Q:50).

Free2AITools Nexus Index

Verification Authority

HuggingFace API GitHub Metadata Arxiv Citation DB System Audit

Unbiased Data Node Refresh: VFS Live

---

🚀 What's Next?

📊

Find Training Datasets

Discover datasets compatible with this model

📈

Compare Benchmarks

See how this model ranks on standard tests

⚡

Technical Deep Dive

Supertonic 2 — Lightning Fast, On-Device TTS, Multilingual TTS

Supertonic Preview

Supertonic is a lightning-fast, on-device text-to-speech system designed for extreme performance with minimal computational overhead. Powered by ONNX Runtime, it runs entirely on your device—no cloud, no API calls, no privacy concerns.

What's New in Supertonic 2

Supertonic 2 extends multilingual capabilities while maintaining the same inference speed and efficiency as the original.

🌍 Multilingual Support

Language	Code
English	`en`
Korean	`ko`
Spanish	`es`
Portuguese	`pt`
French	`fr`

⚡ Same Speed, More Languages

No speed degradation: Supertonic 2 delivers the same ultra-fast inference speed as the original—up to 167× faster than real-time
Efficient architecture: Only 66M parameters, optimized for on-device deployment
Cross-language consistency: All supported languages share the same model architecture and inference pipeline

Performance

We evaluated Supertonic's performance (with 2 inference steps) using two key metrics across input texts of varying lengths: Short (59 chars), Mid (152 chars), and Long (266 chars).

Metrics:

Characters per Second: Measures throughput by dividing the number of input characters by the time required to generate audio. Higher is better.
Real-time Factor (RTF): Measures the time taken to synthesize audio relative to its duration. Lower is better (e.g., RTF of 0.1 means it takes 0.1 seconds to generate one second of audio).

Characters per Second

System	Short (59 chars)	Mid (152 chars)	Long (266 chars)
Supertonic (M4 pro - CPU)	912	1048	1263
Supertonic (M4 pro - WebGPU)	996	1801	2509
Supertonic (RTX4090)	2615	6548	12164
`API` ElevenLabs Flash v2.5	144	209	287
`API` OpenAI TTS-1	37	55	82
`API` Gemini 2.5 Flash TTS	12	18	24
`API` Supertone Sona speech 1	38	64	92
`Open` Kokoro	104	107	117
`Open` NeuTTS Air	37	42	47

Notes:
API = Cloud-based API services (measured from Seoul)
Open = Open-source models
Supertonic (M4 pro - CPU) and (M4 pro - WebGPU): Tested with ONNX
Supertonic (RTX4090): Tested with PyTorch model
Kokoro: Tested on M4 Pro CPU with ONNX
NeuTTS Air: Tested on M4 Pro CPU with Q8-GGUF

Real-time Factor

System	Short (59 chars)	Mid (152 chars)	Long (266 chars)
Supertonic (M4 pro - CPU)	0.015	0.013	0.012
Supertonic (M4 pro - WebGPU)	0.014	0.007	0.006
Supertonic (RTX4090)	0.005	0.002	0.001
`API` ElevenLabs Flash v2.5	0.133	0.077	0.057
`API` OpenAI TTS-1	0.471	0.302	0.201
`API` Gemini 2.5 Flash TTS	1.060	0.673	0.541
`API` Supertone Sona speech 1	0.372	0.206	0.163
`Open` Kokoro	0.144	0.124	0.126
`Open` NeuTTS Air	0.390	0.338	0.343

Additional Performance Data (5-step inference)

Characters per Second (5-step)

System	Short (59 chars)	Mid (152 chars)	Long (266 chars)
Supertonic (M4 pro - CPU)	596	691	850
Supertonic (M4 pro - WebGPU)	570	1118	1546
Supertonic (RTX4090)	1286	3757	6242

Real-time Factor (5-step)

System	Short (59 chars)	Mid (152 chars)	Long (266 chars)
Supertonic (M4 pro - CPU)	0.023	0.019	0.018
Supertonic (M4 pro - WebGPU)	0.024	0.012	0.010
Supertonic (RTX4090)	0.011	0.004	0.002

License

This project’s sample code is released under the MIT License. - see the LICENSE for details.

The accompanying model is released under the OpenRAIL-M License. - see the LICENSE file for details.

This model was trained using PyTorch, which is licensed under the BSD 3-Clause License but is not redistributed with this project. - see the LICENSE for details.

⚠️ Incomplete Data

Some information about this model is not available. Use with Caution - Verify details from the original source before relying on this data.

View Original Source →

📝 Limitations & Considerations

• Benchmark scores may vary based on evaluation methodology and hardware configuration.
• VRAM requirements are estimates; actual usage depends on quantization and batch size.
• FNI scores are relative rankings and may change as new models are added.
⚠ License Unknown: Verify licensing terms before commercial use.

Social Proof

HuggingFace Hub

7.5KDownloads

Hub Discussions

🤗 Data Source: Hugging Face ↗

🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseℹ️ Verify with original source

🛡️ Model Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

🆔 Identity & Source

id: hf-model--supertone--supertonic-2
slug: supertone--supertonic-2
source: huggingface
author: Supertone
license: OpenRAIL
tags: supertonic, onnx, text-to-speech, speech-synthesis, tts, en, ko, es, pt, fr, license:openrail, region:us

⚙️ Technical Specs

architecture: null
params billions: null
context length: null
pipeline tag: text-to-speech

📊 Engagement & Metrics

downloads: 7,522
stars: 0
forks: 0

Data indexed from public sources. Updated daily.

Welcome to Free2AI Tools!

Smart Search

FNI Score

You're All Set!