📊
Dataset

Dolma3 Pool

by allenai hf-dataset--allenai--dolma3_pool
Nexus Index
23.0 Top 2%
S / A / P / R / Q Breakdown Calibration Pending

Pillar scores are computed during the next indexing cycle.

Tech Context
Vital Performance
0 DL / 30D
0.0%

⚠️ **IMPORTANT NOTICE** ⚠️ This is the Dolma 3 *pool*, pre–quality upsampling and mixing. If you are interested in *the data used* to train Olmo 3 7B and Olmo 3 32B, visit **allenai/dolma3_mix-6T-1025**. -----

Data Integrity 23 FNI Score
- Size
- Rows
Parquet Format
- Tokens
Dataset Information Summary
Entity Passport
Registry ID hf-dataset--allenai--dolma3_pool
Provider huggingface
📜

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_dataset__allenai__dolma3_pool,
  author = {allenai},
  title = {Dolma3 Pool Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/allenai/dolma3_pool}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
allenai. (2026). Dolma3 Pool [Dataset]. Free2AITools. https://huggingface.co/datasets/allenai/dolma3_pool

🔬Technical Deep Dive

Full Specifications [+]

⚖️ Nexus Index V2.0

23.0
ESTIMATED IMPACT TIER
Semantic (S) 50
Authority (A) 0
Popularity (P) 0
Recency (R) 0
Quality (Q) 0

💬 Index Insight

FNI V2.0 for Dolma3 Pool: Semantic (S:50), Authority (A:0), Popularity (P:0), Recency (R:0), Quality (Q:0).

Free2AITools Nexus Index

Verification Authority

Unbiased Data Node Refresh: VFS Live
⬇️
Downloads
48,215
❤️
Likes
28

👁️ Data Preview

📊

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

🔗 Explore Full Dataset ↗

🧬 Field Logic

🧬

Schema not yet indexed for this dataset.

Dataset Specification

Top Tier

Social Proof

HuggingFace Hub
28Likes
48.2KDownloads
🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseℹ️ Verify with original source

🛡️ Dataset Transparency Report

Verified data manifest for traceability and transparency.

100% Data Disclosure Active

🆔 Identity & Source

id
hf-dataset--allenai--dolma3_pool
source
huggingface
author
allenai
tags
task_categories:text-generationlanguage:enlicense:odc-bysize_categories:10mformat:jsonmodality:textlibrary:datasetslibrary:dasklibrary:mlcroissantarxiv:2512.13961region:us

⚙️ Technical Specs

architecture
null
params billions
null
context length
null

📊 Engagement & Metrics

likes
28
downloads
48,215

Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)