📊
Dataset

Fineweb Edu Chinese V2.1

by opencsg hf-dataset--opencsg--fineweb-edu-chinese-v2.1
Nexus Index
36.1 Top 100%
S: Semantic 50
A: Authority 0
P: Popularity 56
R: Recency 64
Q: Quality 30
Tech Context
Vital Performance
0 DL / 30D
0.0%
Data Integrity 36.1 FNI Score
- Size
- Rows
Parquet Format
- Tokens
Dataset Information Summary
Entity Passport
Registry ID hf-dataset--opencsg--fineweb-edu-chinese-v2.1
License Apache-2.0
Provider huggingface
📜

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_dataset__opencsg__fineweb_edu_chinese_v2.1,
  author = {opencsg},
  title = {Fineweb Edu Chinese V2.1 Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/opencsg/fineweb-edu-chinese-v2.1}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
opencsg. (2026). Fineweb Edu Chinese V2.1 [Dataset]. Free2AITools. https://huggingface.co/datasets/opencsg/fineweb-edu-chinese-v2.1

🔬Technical Deep Dive

Full Specifications [+]

⚖️ Nexus Index V2.0

36.1
TOP 100% SYSTEM IMPACT
Semantic (S) 50
Authority (A) 0
Popularity (P) 56
Recency (R) 64
Quality (Q) 30

💬 Index Insight

FNI V2.0 for Fineweb Edu Chinese V2.1: Semantic (S:50), Authority (A:0), Popularity (P:56), Recency (R:64), Quality (Q:30).

Free2AITools Nexus Index

Verification Authority

Unbiased Data Node Refresh: VFS Live
⬇️
Downloads
63,600

👁️ Data Preview

📊

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

🔗 Explore Full Dataset ↗

🧬 Field Logic

🧬

Schema not yet indexed for this dataset.

Dataset Specification

Social Proof

HuggingFace Hub
63.6KDownloads
🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseℹ️ Verify with original source

🛡️ Dataset Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

🆔 Identity & Source

id
hf-dataset--opencsg--fineweb-edu-chinese-v2.1
slug
opencsg--fineweb-edu-chinese-v2.1
source
huggingface
author
opencsg
license
Apache-2.0
tags
task_categories:text-generation, language:zh, license:apache-2.0, size_categories:100m<n<1b, format:parquet, modality:text, library:datasets, library:dask, library:mlcroissant, library:polars, arxiv:2501.08197, region:us

⚙️ Technical Specs

architecture
null
params billions
null
context length
null
pipeline tag

📊 Engagement & Metrics

downloads
63,600
stars
67
forks
0

Data indexed from public sources. Updated daily.