Molmo2 Synmultiimageqa
Pillar scores are computed during the next indexing cycle.
--- dataset_info: - config_name: chart features: - name: id dtype: string - name: images sequence: image - name: code sequence: string - name: qa_pairs sequence: - name: question dtype: string - name: explanation dtype: string - name: answer dtype: string - name: qa_pairs_raw sequence: - name: question dtype: string - name: explanation dtype: string - name: answer dtype: string - name: metadata struct: - name: content_type dtype: string - name: persona dtype: string - name: overall_descriptio...
| Entity Passport | |
| Registry ID | hf-dataset--allenai--molmo2-synmultiimageqa |
| Provider | huggingface |
Cite this dataset
Academic & Research Attribution
@misc{hf_dataset__allenai__molmo2_synmultiimageqa,
author = {allenai},
title = {Molmo2 Synmultiimageqa Dataset},
year = {2026},
howpublished = {\url{https://huggingface.co/datasets/allenai/Molmo2-SynMultiImageQA}},
note = {Accessed via Free2AITools Knowledge Fortress}
} π¬Technical Deep Dive
Full Specifications [+]βΎ
βοΈ Nexus Index V2.0
π¬ Index Insight
FNI V2.0 for Molmo2 Synmultiimageqa: Semantic (S:50), Authority (A:0), Popularity (P:0), Recency (R:0), Quality (Q:0).
Verification Authority
ποΈ Data Preview
Row-level preview not available for this dataset.
Schema structure is shown in the Field Logic panel when available.
π Explore Full Dataset β𧬠Field Logic
Schema not yet indexed for this dataset.
Dataset Specification
dataset_info:
- config_name: chart
features:- name: id
dtype: string - name: images
sequence: image - name: code
sequence: string - name: qa_pairs
sequence:- name: question
dtype: string - name: explanation
dtype: string - name: answer
dtype: string
- name: question
- name: qa_pairs_raw
sequence:- name: question
dtype: string - name: explanation
dtype: string - name: answer
dtype: string
- name: question
- name: metadata
struct:- name: content_type
dtype: string - name: persona
dtype: string - name: overall_description
dtype: string - name: num_images
dtype: int32
splits:
- name: content_type
- name: train
num_bytes: 31377766981.464
num_examples: 48988 - name: validation
num_bytes: 664539837.896
num_examples: 1024
download_size: 31320308759
dataset_size: 32042306819.36
- name: id
- config_name: chemical
features:- name: id
dtype: string - name: images
sequence: image - name: code
sequence: string - name: qa_pairs
sequence:- name: question
dtype: string - name: explanation
dtype: string - name: answer
dtype: string
- name: question
- name: qa_pairs_raw
sequence:- name: question
dtype: string - name: explanation
dtype: string - name: answer
dtype: string
- name: question
- name: metadata
struct:- name: content_type
dtype: string - name: persona
dtype: string - name: overall_description
dtype: string - name: num_images
dtype: int32
splits:
- name: content_type
- name: train
num_bytes: 424964508.464
num_examples: 4872 - name: validation
num_bytes: 24206206
num_examples: 256
download_size: 451957736
dataset_size: 449170714.464
- name: id
- config_name: circuit
features:- name: id
dtype: string - name: images
sequence: image - name: code
sequence: string - name: qa_pairs
sequence:- name: question
dtype: string - name: explanation
dtype: string - name: answer
dtype: string
- name: question
- name: qa_pairs_raw
sequence:- name: question
dtype: string - name: explanation
dtype: string - name: answer
dtype: string
- name: question
- name: metadata
struct:- name: content_type
dtype: string - name: persona
dtype: string - name: overall_description
dtype: string - name: num_images
dtype: int32
splits:
- name: content_type
- name: train
num_bytes: 1116565068.592
num_examples: 3416 - name: validation
num_bytes: 93863865
num_examples: 256
download_size: 1211487359
dataset_size: 1210428933.592
- name: id
- config_name: diagram
features:- name: id
dtype: string - name: images
sequence: image - name: code
sequence: string - name: qa_pairs
sequence:- name: question
dtype: string - name: explanation
dtype: string - name: answer
dtype: string
- name: question
- name: qa_pairs_raw
sequence:- name: question
dtype: string - name: explanation
dtype: string - name: answer
dtype: string
- name: question
- name: metadata
struct:- name: content_type
dtype: string - name: persona
dtype: string - name: overall_description
dtype: string - name: num_images
dtype: int32
splits:
- name: content_type
- name: train
num_bytes: 14826450271.401
num_examples: 22753 - name: validation
num_bytes: 347717025
num_examples: 512
download_size: 15269447929
dataset_size: 15174167296.401
- name: id
- config_name: doc
features:- name: id
dtype: string - name: images
sequence: image - name: code
sequence: string - name: qa_pairs
sequence:- name: question
dtype: string - name: explanation
dtype: string - name: answer
dtype: string
- name: question
- name: qa_pairs_raw
sequence:- name: question
dtype: string - name: explanation
dtype: string - name: answer
dtype: string
- name: question
- name: metadata
struct:- name: content_type
dtype: string - name: persona
dtype: string - name: overall_description
dtype: string - name: num_images
dtype: int32
splits:
- name: content_type
- name: train
num_bytes: 183673493107.72
num_examples: 56454 - name: validation
num_bytes: 3359296700.016
num_examples: 1024
download_size: 190039981963
dataset_size: 187032789807.736
- name: id
- config_name: graphic
features:- name: id
dtype: string - name: images
sequence: image - name: code
sequence: string - name: qa_pairs
sequence:- name: question
dtype: string - name: explanation
dtype: string - name: answer
dtype: string
- name: question
- name: qa_pairs_raw
sequence:- name: question
dtype: string - name: explanation
dtype: string - name: answer
dtype: string
- name: question
- name: metadata
struct:- name: content_type
dtype: string - name: persona
dtype: string - name: overall_description
dtype: string - name: num_images
dtype: int32
splits:
- name: content_type
- name: train
num_bytes: 2044986784.82
num_examples: 11956 - name: validation
num_bytes: 86888121
num_examples: 512
download_size: 1911609124
dataset_size: 2131874905.82
- name: id
- config_name: music
features:- name: id
dtype: string - name: images
sequence: image - name: code
sequence: string - name: qa_pairs
sequence:- name: question
dtype: string - name: explanation
dtype: string - name: answer
dtype: string
- name: question
- name: qa_pairs_raw
sequence:- name: question
dtype: string - name: explanation
dtype: string - name: answer
dtype: string
- name: question
- name: metadata
struct:- name: content_type
dtype: string - name: persona
dtype: string - name: overall_description
dtype: string - name: num_images
dtype: int32
splits:
- name: content_type
- name: train
num_bytes: 406399250.29
num_examples: 4785 - name: validation
num_bytes: 23272894
num_examples: 256
download_size: 414041290
dataset_size: 429672144.29
- name: id
- config_name: table
features:- name: id
dtype: string - name: images
sequence: image - name: code
sequence: string - name: qa_pairs
sequence:- name: question
dtype: string - name: explanation
dtype: string - name: answer
dtype: string
- name: question
- name: qa_pairs_raw
sequence:- name: question
dtype: string - name: explanation
dtype: string - name: answer
dtype: string
- name: question
- name: metadata
struct:- name: content_type
dtype: string - name: persona
dtype: string - name: overall_description
dtype: string - name: num_images
dtype: int32
splits:
- name: content_type
- name: train
num_bytes: 55434259170.648
num_examples: 33736 - name: validation
num_bytes: 1673423925.832
num_examples: 1024
download_size: 55204600074
dataset_size: 57107683096.48
configs:
- name: id
- config_name: chart
data_files:- split: train
path: chart/train-* - split: validation
path: chart/validation-*
- split: train
- config_name: chemical
data_files:- split: train
path: chemical/train-* - split: validation
path: chemical/validation-*
- split: train
- config_name: circuit
data_files:- split: train
path: circuit/train-* - split: validation
path: circuit/validation-*
- split: train
- config_name: diagram
data_files:- split: train
path: diagram/train-* - split: validation
path: diagram/validation-*
- split: train
- config_name: doc
data_files:- split: train
path: doc/train-* - split: validation
path: doc/validation-*
- split: train
- config_name: graphic
data_files:- split: train
path: graphic/train-* - split: validation
path: graphic/validation-*
- split: train
- config_name: music
data_files:- split: train
path: music/train-* - split: validation
path: music/validation-*
- split: train
- config_name: table
data_files:- split: train
path: table/train-* - split: validation
path: table/validation-*
license: odc-by
task_categories:
- split: train
- visual-question-answering
size_categories: - 100K<n<1M
Molmo2-SynMultiImageQA
Molmo2-SynMultiImageQA is a collection of synthetic multi-image question-answer pairs about various kinds of text-rich images, including charts, tables, documents, diagrams, etc.
The synthetic data is generated by extending the CoSyn framework into multi-image settings,
with Claude-sonnet-4-5 as the coding LLM to generate code that can be executed to render an image.
Then, we use GPT-5 to generate question-answer pairs with code (without using the rendered image).
Molmo2-SynMultiImageQA is part of the Molmo2 dataset collection and was used to train the Molmo2 family of models.
Quick links:
- π Paper
- π₯ Blog with Videos
Loading
The dataset has eight subsets:
chart: charts and plotschemical: chemical structurescircuit: diagrams of electrical circuitsdiagram: diagram and graphsdocument: various types of documentsgraphic: vector graphicsmusic: music sheetstable: tables and sheets
Use config_name to specify which one to load. By default, chart will be loaded. For example:
table_dataset = datasets.load_dataset("allenai/Molmo2-SynMultiImageQA", "table", split="train")
Data Format
Each row of the example has the following information:
id: the unique ID of each exampleimages: a list of rendered images from the codecode: a list of the source code for each imageqa_pairs: a list of questions, answers, and chain-of-thought explanationsqa_pairs_raw: the raw format of QA pairs without replacing the image reference (<IMAGE-N>)to natural format.metadata: metadata of each example, including the content type, persona, overall descriptions, and the number of images.
Splits
The data is divided into validation and train splits. These splits are "unofficial" because we do not generally use this data for evaluation anyway.
However, they reflect what was used when training the Molmo2 models, which were only trained on the train splits.
License
This dataset is licensed under ODC-BY. It is intended for research and educational use in accordance with Ai2βs Responsible Use Guidelines.
This dataset includes synthetic images from model outputs using code generated from Claude-Sonnet-4.5, which is subject to Anthropic's Terms of Service.
The questions are generated from GPT-5, which is subject to OpenAIβs Terms of Use.
Social Proof
AI Summary: Based on Hugging Face metadata. Not a recommendation.
π‘οΈ Dataset Transparency Report
Verified data manifest for traceability and transparency.
π Identity & Source
- id
- hf-dataset--allenai--molmo2-synmultiimageqa
- source
- huggingface
- author
- allenai
- tags
- task_categories:visual-question-answeringlicense:odc-bysize_categories:100k
format:parquetmodality:imagemodality:textlibrary:datasetslibrary:dasklibrary:mlcroissantlibrary:polarsregion:us
βοΈ Technical Specs
- architecture
- null
- params billions
- null
- context length
- null
π Engagement & Metrics
- likes
- 5
- downloads
- 15,761
Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)