📊
Dataset

Oolong Synth

by oolongbench hf-dataset--oolongbench--oolong-synth
Nexus Index
42.0 Top 0%
S / A / P / R / Q Breakdown Calibration Pending

Pillar scores are computed during the next indexing cycle.

Tech Context
Vital Performance
0 DL / 30D
0.0%

--- dataset_info: features: - name: id dtype: int64 - name: context_len dtype: int64 - name: dataset dtype: string - name: context_window_text dtype: string - name: context_window_text_with_labels dtype: string - name: question dtype: string - name: task_group dtype: string - name: task dtype: string - name: answer dtype: string - name: answer_type dtype: string - name: input_subset dtype: string - name: num_labels dtype: int64 - name: context_window_id dtype: int64 splits: - name: validation...

Data Integrity 42 FNI Score
- Size
- Rows
Parquet Format
- Tokens
Dataset Information Summary
Entity Passport
Registry ID hf-dataset--oolongbench--oolong-synth
Provider huggingface
📜

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_dataset__oolongbench__oolong_synth,
  author = {oolongbench},
  title = {Oolong Synth Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/oolongbench/oolong-synth}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
oolongbench. (2026). Oolong Synth [Dataset]. Free2AITools. https://huggingface.co/datasets/oolongbench/oolong-synth

đŸ”ŦTechnical Deep Dive

Full Specifications [+]

âš–ī¸ Nexus Index V2.0

42.0
ESTIMATED IMPACT TIER
Semantic (S) 50
Authority (A) 0
Popularity (P) 0
Recency (R) 0
Quality (Q) 0

đŸ’Ŧ Index Insight

FNI V2.0 for Oolong Synth: Semantic (S:50), Authority (A:0), Popularity (P:0), Recency (R:0), Quality (Q:0).

Free2AITools Nexus Index

Verification Authority

Unbiased Data Node Refresh: VFS Live
âŦ‡ī¸
Downloads
45,778
â¤ī¸
Likes
3

đŸ‘ī¸ Data Preview

📊

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

🔗 Explore Full Dataset ↗

đŸ§Ŧ Field Logic

đŸ§Ŧ

Schema not yet indexed for this dataset.

Dataset Specification


dataset_info:
features:

  • name: id
    dtype: int64
  • name: context_len
    dtype: int64
  • name: dataset
    dtype: string
  • name: context_window_text
    dtype: string
  • name: context_window_text_with_labels
    dtype: string
  • name: question
    dtype: string
  • name: task_group
    dtype: string
  • name: task
    dtype: string
  • name: answer
    dtype: string
  • name: answer_type
    dtype: string
  • name: input_subset
    dtype: string
  • name: num_labels
    dtype: int64
  • name: context_window_id
    dtype: int64
    splits:
  • name: validation
    num_bytes: 3189333135.268293
    num_examples: 1300
  • name: test
    num_bytes: 20979703314.710743
    num_examples: 5200
    download_size: 12400515844
    dataset_size: 24169036449.979034
    configs:
  • config_name: default
    data_files:
    • split: validation
      path: data/validation-*
    • split: test
      path: data/test-*

Oolong-synth is a dataset from the paper Oolong: Evaluating Long Context Reasoning and Aggregation Capabilities. See the paper for more details on the dataset construction.

To run the standard evaluation setting you will need:

  • input: context_window_text + "\n" + question (these are separated because the context window text can be cached for reuse across multiple input queries)
  • output: answer
Top Tier

Social Proof

HuggingFace Hub
3Likes
45.8KDownloads
🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseâ„šī¸ Verify with original source

đŸ›Ąī¸ Dataset Transparency Report

Verified data manifest for traceability and transparency.

100% Data Disclosure Active

🆔 Identity & Source

id
hf-dataset--oolongbench--oolong-synth
source
huggingface
author
oolongbench
tags
size_categories:10kformat:parquetmodality:tabularmodality:textlibrary:datasetslibrary:dasklibrary:polarslibrary:mlcroissantarxiv:2511.02817region:us

âš™ī¸ Technical Specs

architecture
null
params billions
null
context length
null

📊 Engagement & Metrics

likes
3
downloads
45,778

Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)