🧠
Model

Mms 300m 1130 Forced Aligner

by MahmoudAshraf hf-model--mahmoudashraf--mms-300m-1130-forced-aligner
Nexus Index
49.9 Top 100%
S / A / P / R / Q Breakdown Calibration Pending

Pillar scores are computed during the next indexing cycle.

Tech Context
Vital Performance
4.5M DL / 30D
0.0%
Audited 49.9 FNI Score
Tiny - Params
- Context
Hot 4.5M Downloads
Restricted CC License
Model Information Summary
Entity Passport
Registry ID hf-model--mahmoudashraf--mms-300m-1130-forced-aligner
License CC-BY-NC-4.0
Provider huggingface
📜

Cite this model

Academic & Research Attribution

BibTeX
@misc{hf_model__mahmoudashraf__mms_300m_1130_forced_aligner,
  author = {MahmoudAshraf},
  title = {Mms 300m 1130 Forced Aligner Model},
  year = {2026},
  howpublished = {\url{https://huggingface.co/mahmoudashraf/mms-300m-1130-forced-aligner}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
MahmoudAshraf. (2026). Mms 300m 1130 Forced Aligner [Model]. Free2AITools. https://huggingface.co/mahmoudashraf/mms-300m-1130-forced-aligner

đŸ”ŦTechnical Deep Dive

Full Specifications [+]

Quick Commands

🤗 HF Download
huggingface-cli download mahmoudashraf/mms-300m-1130-forced-aligner
đŸ“Ļ Install Lib
pip install -U transformers

âš–ī¸ Nexus Index V2.0

49.9
ESTIMATED IMPACT TIER
Semantic (S) 0
Authority (A) 0
Popularity (P) 0
Recency (R) 0
Quality (Q) 0

đŸ’Ŧ Index Insight

FNI V2.0 for Mms 300m 1130 Forced Aligner: Semantic (S:0), Authority (A:0), Popularity (P:0), Recency (R:0), Quality (Q:0).

Free2AITools Nexus Index

Verification Authority

Unbiased Data Node Refresh: VFS Live
---

🚀 What's Next?

Technical Deep Dive

Forced Alignment with Hugging Face CTC Models

This Python package provides an efficient way to perform forced alignment between text and audio using Hugging Face's pretrained models. it also features an improved implementation to use much less memory than TorchAudio forced alignment API.

The model checkpoint uploaded here is a conversion from torchaudio to HF Transformers for the MMS-300M checkpoint trained on forced alignment dataset

Installation

bash
pip install git+https://github.com/MahmoudAshraf97/ctc-forced-aligner.git

Usage

python
import torch
from ctc_forced_aligner import (
    load_audio,
    load_alignment_model,
    generate_emissions,
    preprocess_text,
    get_alignments,
    get_spans,
    postprocess_results,
)

audio_path = "your/audio/path"
text_path = "your/text/path"
language = "iso" # ISO-639-3 Language code
device = "cuda" if torch.cuda.is_available() else "cpu"
batch_size = 16


alignment_model, alignment_tokenizer = load_alignment_model(
    device,
    dtype=torch.float16 if device == "cuda" else torch.float32,
)

audio_waveform = load_audio(audio_path, alignment_model.dtype, alignment_model.device)


with open(text_path, "r") as f:
    lines = f.readlines()
text = "".join(line for line in lines).replace("\n", " ").strip()

emissions, stride = generate_emissions(
    alignment_model, audio_waveform, batch_size=batch_size
)

tokens_starred, text_starred = preprocess_text(
    text,
    romanize=True,
    language=language,
)

segments, scores, blank_token = get_alignments(
    emissions,
    tokens_starred,
    alignment_tokenizer,
)

spans = get_spans(tokens_starred, segments, blank_token)

word_timestamps = postprocess_results(text_starred, spans, stride, scores)

âš ī¸ Incomplete Data

Some information about this model is not available. Use with Caution - Verify details from the original source before relying on this data.

View Original Source →

📝 Limitations & Considerations

  • â€ĸ Benchmark scores may vary based on evaluation methodology and hardware configuration.
  • â€ĸ VRAM requirements are estimates; actual usage depends on quantization and batch size.
  • â€ĸ FNI scores are relative rankings and may change as new models are added.
  • ⚠ License Unknown: Verify licensing terms before commercial use.

Social Proof

HuggingFace Hub
4.5MDownloads
🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseâ„šī¸ Verify with original source

đŸ›Ąī¸ Model Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

🆔 Identity & Source

id
hf-model--mahmoudashraf--mms-300m-1130-forced-aligner
slug
mahmoudashraf--mms-300m-1130-forced-aligner
source
huggingface
author
MahmoudAshraf
license
CC-BY-NC-4.0
tags
transformers, pytorch, safetensors, wav2vec2, automatic-speech-recognition, mms, audio, voice, speech, forced-alignment, ab, af, ak, am, ar, as, av, ay, az, ba, bm, be, bn, bi, bo, sh, br, bg, ca, cs, ce, cv, ku, cy, da, de, dv, dz, el, en, eo, et, eu, ee, fo, fa, fj, fi, fr, fy, ff, ga, gl, gn, gu, zh, ht, ha, he, hi, hu, hy, ig, ia, ms, is, it, jv, ja, kn, ka, kk, kr, km, ki, rw, ky, ko, kv, lo, la, lv, ln, lt, lb, lg, mh, ml, mr, mk, mg, mt, mn, mi, my, nl, no, ne, ny, oc, om, or, os, pa, pl,

âš™ī¸ Technical Specs

architecture
null
params billions
null
context length
null
pipeline tag
automatic-speech-recognition

📊 Engagement & Metrics

downloads
4,472,930
stars
0
forks
0

Data indexed from public sources. Updated daily.