📊
Dataset

Maptrace 20k

by Voxel51 hf-dataset--voxel51--maptrace_20k
Nexus Index
41.0 Top 0%
P / V / C / U Breakdown Calibration Pending

Pillar scores are computed during the next indexing cycle.

Tech Context
Vital Performance
0 DL / 30D
0.0%

!image/png This is a FiftyOne dataset with 20000 samples. If you haven't already, install FiftyOne: MapTrace is a synthetic dataset for path tracing on maps. The dataset contains annotated paths design...

Data Integrity 41 FNI Score
- Size
- Rows
Parquet Format
- Tokens
Dataset Information Summary
Entity Passport
Registry ID hf-dataset--voxel51--maptrace_20k
Provider huggingface
📜

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_dataset__voxel51__maptrace_20k,
  author = {Voxel51},
  title = {Maptrace 20k Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/Voxel51/maptrace_20k}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
Voxel51. (2026). Maptrace 20k [Dataset]. Free2AITools. https://huggingface.co/datasets/Voxel51/maptrace_20k

🔬Technical Deep Dive

Full Specifications [+]

⚖️ Nexus Index V16.5

41.0
ESTIMATED IMPACT TIER
Popularity (P) 0
Freshness (F) 0
Completeness (C) 0
Utility (U) 0

💬 Index Insight

The Free2AITools Nexus Index for Maptrace 20k aggregates Popularity (P:0), Freshness (F:0), and Completeness (C:0). The Utility score (U:0) represents deployment readiness and ecosystem adoption.

Free2AITools Nexus Index

Verification Authority

Unbiased Data Node Refresh: VFS Live
⬇️
Downloads
16,152
❤️
Likes
1

👁️ Data Preview

📊

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

🔗 Explore Full Dataset ↗

🧬 Field Logic

🧬

Schema not yet indexed for this dataset.

Dataset Specification


annotations_creators: []
language: en
size_categories:

  • 10K<n<100K
    task_categories: []
    task_ids: []
    pretty_name: maptrace_20k
    tags:

  • fiftyone

  • image
    dataset_summary: '

    This is a FiftyOne dataset with 20000 samples.

    Installation

    If you haven''t already, install FiftyOne:

    
    pip install -U fiftyone
    

    Usage

    
    import fiftyone as fo
    
    

    from fiftyone.utils.huggingface import load_from_hub

    Load the dataset

    Note: other available arguments include ''max_samples'', etc

    dataset = load_from_hub("Voxel51/maptrace_20k")

    Launch the App

    session = fo.launch_app(dataset)

    '


Dataset Card for MapTrace-20k

image/png

This is a FiftyOne dataset with 20000 samples.

Installation

If you haven't already, install FiftyOne:

pip install -U fiftyone

Usage

import fiftyone as fo
from fiftyone.utils.huggingface import load_from_hub

Load the dataset

Note: other available arguments include 'max_samples', etc

dataset = load_from_hub("Voxel51/maptrace_20k")

Launch the App

session = fo.launch_app(dataset)

Dataset Details

Dataset Description

MapTrace is a synthetic dataset for path tracing on maps. The dataset contains annotated paths designed to train vision-language models on route-tracing tasks. Each sample consists of a map image annotated with start (green) and end (red) positions, along with a natural language prompt and ground truth path coordinates.

The maptrace_20k split used here contains paths on stylized maps such as those found in brochures, park directories, or shopping malls.

  • Curated by: Google
  • Language(s) (NLP): English
  • License: CC-BY-4.0

Dataset Sources

Uses

Direct Use

This dataset is intended for training and evaluating vision-language models on spatial reasoning and path-tracing tasks. Models are expected to interpret map images with marked start/end locations and output coordinate sequences representing valid paths between those points.

Dataset Structure

Original Schema (Hugging Face)

The maptrace_20k split contains the following fields:

  • image: The image bytes of the map, annotated with start and end positions
  • label: A string representation of a list of (x, y) coordinate tuples defining the target path (normalized between 0 and 1)
  • input: A natural language prompt asking the model to find the path

FiftyOne Schema

The FiftyOne dataset converts the original format into the following structure:

Sample Fields:

  • filepath: Path to the PNG image file
  • input (StringField): The natural language prompt describing the task
  • ground_truth (Keypoints): The path represented as keypoints with the following properties:
    • Each keypoint is labeled alphabetically (A, B, C, ..., Z, AA, AB, etc.)
    • Points are normalized coordinates in [0, 1] range
    • The number of keypoints varies per sample

Dataset-Level Attributes:

  • default_skeleton: A KeypointSkeleton that connects sequential keypoints (A→B→C→D...) to visualize the path as a connected polyline in the FiftyOne App

Dataset Creation

Source Data

Data Collection and Processing

The dataset is synthetically generated. Maps are created using text-to-image generation models from natural language map descriptions. Paths are then annotated on these synthetic map images with start positions marked in green and end positions marked in red.

Citation

BibTeX:

@dataset{maptrace2024,
  title={MapTrace: A 2M-Sample Synthetic Dataset for Path Tracing on Maps},
  author={Google},
  year={2024},
  publisher={Hugging Face},
  url={https://huggingface.co/datasets/google/MapTrace}
}
Top Tier

Social Proof

HuggingFace Hub
1Likes
16.2KDownloads
🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseℹ️ Verify with original source

🛡️ Dataset Transparency Report

Verified data manifest for traceability and transparency.

100% Data Disclosure Active

🆔 Identity & Source

id
hf-dataset--voxel51--maptrace_20k
source
huggingface
author
Voxel51
tags
language:ensize_categories:1kformat:imagefoldermodality:imagelibrary:datasetslibrary:mlcroissantlibrary:fiftyoneregion:usfiftyoneimage

⚙️ Technical Specs

architecture
null
params billions
null
context length
20,480

📊 Engagement & Metrics

likes
1
downloads
16,152

Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)