📊
Dataset

Fineweb Edu

by HuggingFaceFW hf-dataset--huggingfacefw--fineweb-edu
Nexus Index
29.0 Top 1%
S / A / P / R / Q Breakdown Calibration Pending

Pillar scores are computed during the next indexing cycle.

Tech Context
Vital Performance
0 DL / 30D
0.0%

--- task_categories: - text-generation language: - en pretty_name: FineWeb-Edu size_categories: - n>1T configs: - config_name: default data_files: - split: train path: data/*/* features: - name: text dtype: string - name: id dtype: string - name: dump dtype: string - name: url dtype: string - name: date dtype: string - name: file_path dtype: string - name: language dtype: string - name: language_score dtype: float64 - name: token_count dtype: int64 - name: score dtype: float64...

Data Integrity 29 FNI Score
- Size
- Rows
Parquet Format
- Tokens
Dataset Information Summary
Entity Passport
Registry ID hf-dataset--huggingfacefw--fineweb-edu
Provider huggingface
📜

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_dataset__huggingfacefw__fineweb_edu,
  author = {HuggingFaceFW},
  title = {Fineweb Edu Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
HuggingFaceFW. (2026). Fineweb Edu [Dataset]. Free2AITools. https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu

🔬Technical Deep Dive

Full Specifications [+]

⚖️ Nexus Index V2.0

29.0
ESTIMATED IMPACT TIER
Semantic (S) 50
Authority (A) 0
Popularity (P) 0
Recency (R) 0
Quality (Q) 0

💬 Index Insight

FNI V2.0 for Fineweb Edu: Semantic (S:50), Authority (A:0), Popularity (P:0), Recency (R:0), Quality (Q:0).

Free2AITools Nexus Index

Verification Authority

Unbiased Data Node Refresh: VFS Live
⬇️
Downloads
352,144
❤️
Likes
938

👁️ Data Preview

📊

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

🔗 Explore Full Dataset ↗

🧬 Field Logic

🧬

Schema not yet indexed for this dataset.

Dataset Specification

Top Tier

Social Proof

HuggingFace Hub
938Likes
352.1KDownloads
🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseℹ️ Verify with original source

🛡️ Dataset Transparency Report

Verified data manifest for traceability and transparency.

100% Data Disclosure Active

🆔 Identity & Source

id
hf-dataset--huggingfacefw--fineweb-edu
source
huggingface
author
HuggingFaceFW
tags
task_categories:text-generationlanguage:enlicense:odc-bysize_categories:1bformat:parquetmodality:tabularmodality:textlibrary:datasetslibrary:dasklibrary:polarslibrary:mlcroissantarxiv:2406.17557arxiv:2404.14219arxiv:2401.10020arxiv:2109.07445doi:10.57967/hf/2497region:us

⚙️ Technical Specs

architecture
null
params billions
null
context length
null

📊 Engagement & Metrics

likes
938
downloads
352,144

Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)