📊
Dataset

Image As An Imu Finetuning

by jerredchen00 hf-dataset--jerredchen00--image-as-an-imu-finetuning
Nexus Index
36.4 Top 100%
S / A / P / R / Q Breakdown Calibration Pending

Pillar scores are computed during the next indexing cycle.

Tech Context
Vital Performance
0 DL / 30D
0.0%
Data Integrity 36.4 FNI Score
- Size
- Rows
Parquet Format
- Tokens
Dataset Information Summary
Entity Passport
Registry ID hf-dataset--jerredchen00--image-as-an-imu-finetuning
License Apache-2.0
Provider huggingface
📜

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_dataset__jerredchen00__image_as_an_imu_finetuning,
  author = {jerredchen00},
  title = {Image As An Imu Finetuning Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/jerredchen00/image-as-an-imu-finetuning}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
jerredchen00. (2026). Image As An Imu Finetuning [Dataset]. Free2AITools. https://huggingface.co/datasets/jerredchen00/image-as-an-imu-finetuning

đŸ”ŦTechnical Deep Dive

Full Specifications [+]

âš–ī¸ Nexus Index V2.0

36.4
ESTIMATED IMPACT TIER
Semantic (S) 0
Authority (A) 0
Popularity (P) 0
Recency (R) 0
Quality (Q) 0

đŸ’Ŧ Index Insight

FNI V2.0 for Image As An Imu Finetuning: Semantic (S:0), Authority (A:0), Popularity (P:0), Recency (R:0), Quality (Q:0).

Free2AITools Nexus Index

Verification Authority

Unbiased Data Node Refresh: VFS Live
âŦ‡ī¸
Downloads
90,080

đŸ‘ī¸ Data Preview

📊

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

🔗 Explore Full Dataset ↗

đŸ§Ŧ Field Logic

đŸ§Ŧ

Schema not yet indexed for this dataset.

Dataset Specification

Image as an IMU: Real-world Finetuning Dataset

Official real-world finetuning dataset from Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image (ICCV 2025 Oral).

[arXiv] [Webpage] [GitHub]

PIXL, University of Oxford

Jerred Chen, Ronald Clark


Dataset Details

This dataset consists of 32 sequences of real-world motion-blurred videos in various indoor scenes, captured using the iPhone 13 camera.

dataset_train_real-world.csv and dataset_val_real-world.csv are the CSV files used for training/validating the model in the paper. These can be directly plugged into the provided dataloader in the GitHub.

The CSVs provide the following:

  • blurred: the relative path to the (resized 320x224) motion-blurred RGB image
  • ts1,ts2: the frame timestamps between the previous RGB and next RGB image
  • fx,fy,cx,cy: the scaled camera intrinsics, corresponding to the 320x224 image
  • bRa_qx,bRa_qy,bRa_qz,bRa_qw: body-frame rotational velocity, parameterized as a quaternion
  • bta_x,bta_y,bta_z: body-frame translational velocity
  • exposure: exposure time at the given image
  • sequence: the sequence name

Sequence Details

Each sequence consists of the following:

text
sequence1/
├─ blurry_frames_320x224
│  ├─ XXXXXX.jpg
│  └─ ...
├─ confidence
│  ├─ XXXXXX.png
│  └─ ...
├─ depth
│  ├─ XXXXXX.png
│  └─ ...
├─ rgb
│  ├─ XXXXXX.jpg
│  └─ ...
├─ rgb_320x224
│  ├─ XXXXXX.jpg
│  └─ ...
└─ blurred_frames_320x224.csv
└─ camera_matrix.csv
└─ camera_matrix_320x224.csv
└─ imu.csv
└─ odometry.csv
└─ velocities.csv

Sequences were recorded using the StrayScanner app, slightly modified to also obtain the exposure time from ARKit. confidence, depth, rgb, camera_matrix.csv, imu.csv, and odometry.csv are the original outputs from StrayScanner.

We provide the following data in addition to the StrayScanner outputs:

  • rgb_320x224 are the resized recorded RGB images
  • blurry_frames_320x224 are the identified frames with more extensive blur using FFT as described in Liu et al.
  • camera_matrix_320x224.csv are the corresponding scaled camera intrinsics
  • velocities.csv consist of the translational velocities computed from ARKit poses in odometry.csv and the rotational velocities directly from the gyroscope.

Of course, the RGB images/camera intrinsics can be resized/scaled online during training; we provide this to maintain consistency with our own training.

Since the ARKit computed poses can have very large errors, dataset_train_real-world.csv consists of manually filtered samples without large outlier pose estimates.

📊 Structured Schema (Zero-Fabrication)

Feature Key Data Type
blurred string
ts1 float64
ts2 float64
fx float64
fy float64
cx float64
cy float64
bRa_qx float64
bRa_qy float64
bRa_qz float64
bRa_qw float64
bta_x float64
bta_y float64
bta_z float64
exposure float64
sequence string

Estimated Rows: 13,232

Social Proof

HuggingFace Hub
90.1KDownloads
🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseâ„šī¸ Verify with original source

đŸ›Ąī¸ Dataset Transparency Report

Verified data manifest for traceability and transparency.

100% Data Disclosure Active

🆔 Identity & Source

id
hf-dataset--jerredchen00--image-as-an-imu-finetuning
slug
jerredchen00--image-as-an-imu-finetuning
source
huggingface
author
jerredchen00
license
Apache-2.0
tags
license:apache-2.0, size_categories:10k<n<100k, format:csv, modality:image, modality:tabular, modality:text, library:datasets, library:pandas, library:mlcroissant, library:polars, arxiv:2503.17358, region:us

âš™ī¸ Technical Specs

architecture
null
params billions
null
context length
null
pipeline tag

📊 Engagement & Metrics

downloads
90,080
stars
0
forks
0

Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)