📊
Dataset

Uncovai Tts

by UncovAI hf-dataset--uncovai--uncovai_tts
Nexus Index
21.0 Top 3%
S / A / P / R / Q Breakdown Calibration Pending

Pillar scores are computed during the next indexing cycle.

Tech Context
Vital Performance
0 DL / 30D
0.0%

**UncovAI TTS** consists of **12,000 synthetic audio files** generated from the DailyDialog dataset. It is designed for AI audio detection, forensics, and TTS research. The dataset is organized into three folders, with **4,000 audio files per model**: - : Generated using nari-labs/Dia2-2B - : Generated using maya-research/maya1 - : Generated using myshell-ai/MeloTTS-English - **AI Audio D...

Data Integrity 21 FNI Score
- Size
- Rows
Parquet Format
- Tokens
Dataset Information Summary
Entity Passport
Registry ID hf-dataset--uncovai--uncovai_tts
Provider huggingface
📜

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_dataset__uncovai__uncovai_tts,
  author = {UncovAI},
  title = {Uncovai Tts Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/UncovAI/UncovAI_TTS}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
UncovAI. (2026). Uncovai Tts [Dataset]. Free2AITools. https://huggingface.co/datasets/UncovAI/UncovAI_TTS

đŸ”ŦTechnical Deep Dive

Full Specifications [+]

âš–ī¸ Nexus Index V2.0

21.0
ESTIMATED IMPACT TIER
Semantic (S) 50
Authority (A) 0
Popularity (P) 0
Recency (R) 0
Quality (Q) 0

đŸ’Ŧ Index Insight

FNI V2.0 for Uncovai Tts: Semantic (S:50), Authority (A:0), Popularity (P:0), Recency (R:0), Quality (Q:0).

Free2AITools Nexus Index

Verification Authority

Unbiased Data Node Refresh: VFS Live
âŦ‡ī¸
Downloads
16,656
â¤ī¸
Likes
5

đŸ‘ī¸ Data Preview

📊

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

🔗 Explore Full Dataset ↗

đŸ§Ŧ Field Logic

đŸ§Ŧ

Schema not yet indexed for this dataset.

Dataset Specification


license: cc-by-4.0
task_categories:

  • text-to-speech
    language:
  • en
    size_categories:
  • 10K<n<100K

UncovAI TTS Dataset

UncovAI TTS consists of 12,000 synthetic audio files generated from the DailyDialog dataset. It is designed for AI audio detection, forensics, and TTS research.

📂 Dataset Structure

The dataset is organized into three folders, with 4,000 audio files per model:

đŸŽ¯ Use Cases

  • AI Audio Detection: Training classifiers to detect synthetic voices.
  • Knowledge Distillation: Using high-quality synthetic data to train smaller student models.
  • TTS Evaluation: Benchmarking different architectures on conversational text.

📜 Citation

If you use this dataset, please cite this repository, the DailyDialog dataset and the original model creators:

📄 Paper

Title: Audio Deepfake Detection in the Age of Advanced Text-to-Speech models
Authors: Robin Singh, Aditya Yogesh Nair, Fabio Palumbo, Florian Barbaro, Anna Dyka, Lohith Rachakonda
Paper: https://arxiv.org/abs/2601.20510

BibTeX

@dataset{uncovai2026uncovaitts,
  author       = {UncovAI},
  title        = {{UncovAI\_TTS}: Synthetic and real multilingual text-to-speech dataset},
  publisher    = {Hugging Face Datasets},
  year         = {2026},
  doi          = {10.57967/hf/7548},
  url          = {https://huggingface.co/datasets/UncovAI/UncovAI_TTS}
}

This project was provided with computing AI and storage resources by GENCI at IDRIS thanks to the grant 2025-AD011016076 on the supercomputer Jean Zay's V100 partition .

Top Tier

Social Proof

HuggingFace Hub
5Likes
16.7KDownloads
🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseâ„šī¸ Verify with original source

đŸ›Ąī¸ Dataset Transparency Report

Verified data manifest for traceability and transparency.

100% Data Disclosure Active

🆔 Identity & Source

id
hf-dataset--uncovai--uncovai_tts
source
huggingface
author
UncovAI
tags
task_categories:text-to-speechlanguage:enlicense:cc-by-4.0size_categories:1kformat:audiofoldermodality:audiolibrary:datasetslibrary:mlcroissantarxiv:2601.20510doi:10.57967/hf/7548region:us

âš™ī¸ Technical Specs

architecture
null
params billions
null
context length
null

📊 Engagement & Metrics

likes
5
downloads
16,656

Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)