🧠
Model

Piper De De Thorsten Medium

by Trelis hf-model--trelis--piper-de-de-thorsten-medium
Nexus Index
37.2 Top 100%
S: Semantic 50
A: Authority 0
P: Popularity 3
R: Recency 94
Q: Quality 50
Tech Context
Vital Performance
28 DL / 30D
0.0%
Audited 37.2 FNI Score
Tiny - Params
- Context
28 Downloads
Restricted CC0 License
Model Information Summary
Entity Passport
Registry ID hf-model--trelis--piper-de-de-thorsten-medium
License CC0-1.0
Provider huggingface
📜

Cite this model

Academic & Research Attribution

BibTeX
@misc{hf_model__trelis__piper_de_de_thorsten_medium,
  author = {Trelis},
  title = {Piper De De Thorsten Medium Model},
  year = {2026},
  howpublished = {\url{https://huggingface.co/trelis/piper-de-de-thorsten-medium}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
Trelis. (2026). Piper De De Thorsten Medium [Model]. Free2AITools. https://huggingface.co/trelis/piper-de-de-thorsten-medium

đŸ”ŦTechnical Deep Dive

Full Specifications [+]

Quick Commands

🤗 HF Download
huggingface-cli download trelis/piper-de-de-thorsten-medium

âš–ī¸ Nexus Index V2.0

37.2
TOP 100% SYSTEM IMPACT
Semantic (S) 50
Authority (A) 0
Popularity (P) 3
Recency (R) 94
Quality (Q) 50

đŸ’Ŧ Index Insight

FNI V2.0 for Piper De De Thorsten Medium: Semantic (S:50), Authority (A:0), Popularity (P:3), Recency (R:94), Quality (Q:50).

Free2AITools Nexus Index

Verification Authority

Unbiased Data Node Refresh: VFS Live
---

🚀 What's Next?

Technical Deep Dive

Piper TTS: de_DE-thorsten-medium

Medium-size German male voice by Thorsten Mueller.

Model Details

Field Value
Architecture VITS (end-to-end)
Format ONNX
Language German (Germany)
Gender Male
Model Size medium (~63 MB ONNX, ~15M params)
Sample Rate 22050 Hz
License CC0 (Public Domain)

Note: Piper uses the terms "medium", "high", etc. to refer to model size, not output quality. Medium models (63 MB, ~15M params) and high models (114 MB, ~28M params) both produce 22.05 kHz audio.

Usage

With piper-tts (GPL)

python
from piper import PiperVoice

voice = PiperVoice.load("model.onnx")
for chunk in voice.synthesize("Hello, this is a test."):
    # chunk.audio_float_array contains float32 audio
    pass

Standalone ONNX (MIT — no piper-tts dependency)

Requires espeak-ng installed (brew install espeak-ng / apt install espeak-ng).

python
import json, subprocess, numpy as np, onnxruntime as ort, soundfile as sf
from huggingface_hub import hf_hub_download

model_id = "Trelis/piper-de-de-thorsten-medium"
onnx_path = hf_hub_download(model_id, "model.onnx")
config_path = hf_hub_download(model_id, "model.onnx.json")

with open(config_path) as f:
    config = json.load(f)

session = ort.InferenceSession(onnx_path, providers=["CPUExecutionProvider"])
phoneme_id_map = config["phoneme_id_map"]
espeak_voice = config["espeak"]["voice"]

def phonemize(text, voice):
    out = subprocess.run(
        ["espeak-ng", "-v", voice, "-q", "--ipa=2", "-x", text],
        capture_output=True, text=True,
    ).stdout.strip()
    return [list(line.replace("_", " ")) for line in out.split("\n") if line.strip()]

def to_ids(phonemes, pmap):
    ids = [pmap["^"][0], pmap["_"][0]]
    for p in phonemes:
        if p in pmap:
            ids.extend(pmap[p])
            ids.append(pmap["_"][0])
    ids.append(pmap["$"][0])
    return ids

text = "Hello, this is a test."
audio_chunks = []
for sentence in phonemize(text, espeak_voice):
    ids = to_ids(sentence, phoneme_id_map)
    if len(ids) < 3:
        continue
    audio = session.run(None, {
        "input": np.array([ids], dtype=np.int64),
        "input_lengths": np.array([len(ids)], dtype=np.int64),
        "scales": np.array([
            config["inference"]["noise_scale"],
            config["inference"]["length_scale"],
            config["inference"]["noise_w"],
        ], dtype=np.float32),
    })[0]
    audio_chunks.append(audio.squeeze())

audio = np.concatenate(audio_chunks).astype(np.float32)
sf.write("output.wav", audio, config["audio"]["sample_rate"])

Fine-tuning

You can fine-tune this model on your own voice data using Trelis Studio. Piper models can be trained on custom datasets to create personalized voices.

Attribution

Trained on Thorsten Voice dataset by Thorsten Mueller. Fine-tuned from lessac medium.

Re-hosted from rhasspy/piper-voices. Original voice: de_DE-thorsten-medium

âš ī¸ Incomplete Data

Some information about this model is not available. Use with Caution - Verify details from the original source before relying on this data.

View Original Source →

📝 Limitations & Considerations

  • â€ĸ Benchmark scores may vary based on evaluation methodology and hardware configuration.
  • â€ĸ VRAM requirements are estimates; actual usage depends on quantization and batch size.
  • â€ĸ FNI scores are relative rankings and may change as new models are added.
  • ⚠ License Unknown: Verify licensing terms before commercial use.

Social Proof

HuggingFace Hub
28Downloads
🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseâ„šī¸ Verify with original source

đŸ›Ąī¸ Model Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

🆔 Identity & Source

id
hf-model--trelis--piper-de-de-thorsten-medium
slug
trelis--piper-de-de-thorsten-medium
source
huggingface
author
Trelis
license
CC0-1.0
tags
piper, onnx, tts, vits, text-to-speech, de, license:cc0-1.0, region:us

âš™ī¸ Technical Specs

architecture
null
params billions
null
context length
null
pipeline tag
text-to-speech

📊 Engagement & Metrics

downloads
28
stars
0
forks
0

Data indexed from public sources. Updated daily.