πŸ“Š
Dataset

Dolma3 Pool

by allenai ID: hf-dataset--allenai--dolma3_pool
FNI Rank 23
Percentile Top 2%
Activity
β†’ 0.0%

⚠️ **IMPORTANT NOTICE** ⚠️ This is the Dolma 3 *pool*, pre–quality upsampling and mixing. If you are interested in *the data used* to train Olmo 3 7B and Olmo 3 32B, visit **allenai/dolma3_mix-6T-1025**. -----

Data Integrity 23 FNI Score
- Size
- Rows
Parquet Format
- Tokens
Dataset Information Summary
Entity Passport
Registry ID hf-dataset--allenai--dolma3_pool
Provider huggingface
πŸ“œ

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_dataset__allenai__dolma3_pool,
  author = {allenai},
  title = {Dolma3 Pool Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/allenai/dolma3_pool}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
allenai. (2026). Dolma3 Pool [Dataset]. Free2AITools. https://huggingface.co/datasets/allenai/dolma3_pool

πŸ”¬Technical Deep Dive

Full Specifications [+]

βš–οΈ Free2AI Nexus Index

Methodology β†’ πŸ“˜ What is FNI?
23.0
Top 2% Overall Impact
πŸ”₯ Popularity (P) 0
πŸš€ Velocity (V) 0
πŸ›‘οΈ Credibility (C) 0
πŸ”§ Utility (U) 0
Nexus Verified Data

πŸ’¬ Why this score?

The Nexus Index for Dolma3 Pool aggregates Popularity (P:0), Velocity (V:0), and Credibility (C:0). The Utility score (U:0) represents deployment readiness, context efficiency, and structural reliability within the Nexus ecosystem.

Data Verified πŸ• Last Updated: Not calculated
Free2AI Nexus Index | Fair Β· Transparent Β· Explainable | Full Methodology
⬇️
Downloads
48,215
❀️
Likes
28

πŸ‘οΈ Data Preview

πŸ“Š

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

πŸ”— Explore Full Dataset β†—

🧬 Field Logic

🧬

Schema not yet indexed for this dataset.

Dataset Specification

Top Tier

Social Proof

HuggingFace Hub
28Likes
48.2KDownloads
πŸ”„ Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

πŸ“Š FNI Methodology πŸ“š Knowledge Baseℹ️ Verify with original source

πŸ›‘οΈ Dataset Transparency Report

Verified data manifest for traceability and transparency.

100% Data Disclosure Active

πŸ†” Identity & Source

id
hf-dataset--allenai--dolma3_pool
source
huggingface
author
allenai
tags
task_categories:text-generationlanguage:enlicense:odc-bysize_categories:10mformat:jsonmodality:textlibrary:datasetslibrary:dasklibrary:mlcroissantarxiv:2512.13961region:us

βš™οΈ Technical Specs

architecture
null
params billions
null
context length
null

πŸ“Š Engagement & Metrics

likes
28
downloads
48,215

Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)