📊
Dataset

Owls 4b 180k

by espnet hf-model--espnet--owls_4b_180k
Nexus Index
25.6 Top 100%
S: Semantic 50
A: Authority 0
P: Popularity 0
R: Recency 100
Q: Quality 23
Tech Context
Vital Performance
0 DL / 30D
0.0%
Data Integrity 25.6 FNI Score
- Size
- Rows
Parquet Format
- Tokens
Dataset Information Summary
Entity Passport
Registry ID hf-model--espnet--owls_4b_180k
Provider huggingface
📜

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_model__espnet__owls_4b_180k,
  author = {espnet},
  title = {Owls 4b 180k Dataset},
  year = {2026},
  howpublished = {\url{https://free2aitools.com/dataset/hf-model--espnet--owls_4b_180k}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
espnet. (2026). Owls 4b 180k [Dataset]. Free2AITools. https://free2aitools.com/dataset/hf-model--espnet--owls_4b_180k

đŸ”ŦTechnical Deep Dive

Full Specifications [+]

âš–ī¸ Nexus Index V2.0

25.6
TOP 100% SYSTEM IMPACT
Semantic (S) 50
Authority (A) 0
Popularity (P) 0
Recency (R) 100
Quality (Q) 23

đŸ’Ŧ Index Insight

FNI V2.0 for Owls 4b 180k: Semantic (S:50), Authority (A:0), Popularity (P:0), Recency (R:100), Quality (Q:23).

Free2AITools Nexus Index

Verification Authority

Unbiased Data Node Refresh: VFS Live

đŸ‘ī¸ Data Preview

📊

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

đŸ§Ŧ Field Logic

đŸ§Ŧ

Schema not yet indexed for this dataset.

Dataset Specification

OWLS: Open Whisper-style Large-scale neural model Suite

Paper

OWLS is a suite of Whisper-style models, designed to help researchers understand the scaling properties of speech models. OWLS models range from 0.25B to 18B parameters, and are trained on up to 360K hours of data.

OWLS models are developed using ESPnet, and support multilingual Speech Recognition and Translation.

It is part of the OWSM project, which aims to develop fully open speech foundation models using publicly available data and open-source toolkits.

The model in this repo has 4.66B parameters in total and is trained on 180k hours of public speech data. Specifically, it supports the following speech-to-text tasks:

  • Speech recognition
  • Any-to-any-language speech translation
  • Utterance-level alignment
  • Long-form transcription
  • Language identification

Use this model

You can use this model in your projects with the following code:

python
# make sure espnet is installed: pip install espnet
from espnet2.bin.s2t_inference import Speech2Text

model = Speech2Text.from_pretrained(
  "espnet/owls_4B_180K"
)

speech, rate = soundfile.read("speech.wav")
speech = librosa.resample(speech, orig_sr=rate, target_sr=16000) # make sure 16k sampling rate
text, *_ = model(speech)[0]

OWLS models

Model Name Checkpoint Training Artifacts
OWLS 0.25B 180K https://huggingface.co/espnet/owls_025B_180K TBA
OWLS 0.50B 180K https://huggingface.co/espnet/owls_05B_180K https://huggingface.co/espnet/owls_05B_180K_intermediates/tree/main
OWLS 1B 11K TBA TBA
OWLS 1B 22K TBA TBA
OWLS 1B 45K TBA TBA
OWLS 1B 90K TBA TBA
OWLS 1B 180K https://huggingface.co/espnet/owls_1B_180K TBA
OWLS 2B 180K https://huggingface.co/espnet/owls_2B_180K TBA
OWLS 4B 180K https://huggingface.co/espnet/owls_4B_180K https://huggingface.co/espnet/owls_4B_180K_intermediates
OWLS 9B 180K https://huggingface.co/espnet/owls_9B_180K https://huggingface.co/espnet/owls_9B_180K_intermediates
OWLS 18B 180K https://huggingface.co/espnet/owls_18B_180K TBA
OWLS 18B 360K https://huggingface.co/espnet/owls_18B_360K TBA

Citations

text
@article{chen2025owls,
  title={OWLS: Scaling Laws for Multilingual Speech Recognition and Translation Models},
  author={Chen, William and Tian, Jinchuan and Peng, Yifan and Yan, Brian and Yang, Chao-Han Huck and Watanabe, Shinji},
  journal={arXiv preprint arXiv:2502.10373},
  year={2025}
}
🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseâ„šī¸ Verify with original source

đŸ›Ąī¸ Dataset Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

🆔 Identity & Source

id
hf-model--espnet--owls_4b_180k
slug
espnet--owls_4b_180k
source
huggingface
author
espnet
license
tags

âš™ī¸ Technical Specs

architecture
null
params billions
4
context length
184,320
pipeline tag

📊 Engagement & Metrics

downloads
0
stars
0
forks
0

Data indexed from public sources. Updated daily.