Raw Rain Bayer
| Entity Passport | |
| Registry ID | hf-dataset--realrainmaker--raw-rain-bayer |
| License | CC-BY-4.0 |
| Provider | huggingface |
Cite this dataset
Academic & Research Attribution
@misc{hf_dataset__realrainmaker__raw_rain_bayer,
author = {realrainmaker},
title = {Raw Rain Bayer Dataset},
year = {2026},
howpublished = {\url{https://huggingface.co/datasets/realrainmaker/raw-rain-bayer}},
note = {Accessed via Free2AITools Knowledge Fortress}
} π¬Technical Deep Dive
Full Specifications [+]βΎ
βοΈ Nexus Index V2.0
π¬ Index Insight
FNI V2.0 for Raw Rain Bayer: Semantic (S:50), Authority (A:0), Popularity (P:57), Recency (R:47), Quality (Q:30).
Verification Authority
ποΈ Data Preview
Row-level preview not available for this dataset.
Schema structure is shown in the Field Logic panel when available.
π Explore Full Dataset β𧬠Field Logic
Schema not yet indexed for this dataset.
Dataset Specification
Dataset Card for RΒ³ Bayer Dataset
Table of Contents
Dataset Description
- Homepage: https://arxiv.org/abs/2509.24022
- Paper: RΒ³: Reconstruction, Raw, and Rain: Deraining Directly in the Bayer Domain
Dataset Summary
The RΒ³ (Reconstruction, Raw, and Rain) Bayer Dataset is a large-scale, real-world stereo dataset designed for deraining tasks directly in the raw Bayer domain. It was collected using a dual-camera synchronized setup capturing raw Bayer images. Unlike popular synthetic datasets, RΒ³ uses a real-world rain simulation system to generate realistic rain artifacts, including scene depth effects and droplets on windshields. The dataset captures diverse scenarios, including varying lighting conditions (day to low-light evening), scenes behind glass, moving vehicles, and active windshield wipers, enabling the evaluation of model robustness to dynamic occlusions and motion artifacts.
The RAW-RAIN-bayer Dataset is one part of a two part dataset, the other being RAW-RAIN-rgb Dataset.
Both Datasets are released within the context of research done for our paper: https://arxiv.org/abs/2509.24022
This dataset contains the raw .pgm images capturing the Bayer pattern data before ISP processing.
Supported Tasks and Leaderboards
- Raw Image Deraining: Removing rain streaks and droplets directly from raw sensor data.
- Stereo Vision/Depth Estimation in Adverse Weather: utilizing the stereo pairs for reconstruction tasks under rainy conditions.
- Multi-camera low-level vision.
Dataset Structure
Data Instances
A typical data instance represents a single synchronized stereo frame. It includes the raw Bayer image corrupted by rain, its corresponding ground truth raw Bayer image (static scene), and associated metadata.
Data Fields
Under each scene gt/rain you can find the metadata.csv which holds the following fields:
frame: integer, the sequence number of the frame within the scene.camera_id: string, "camera_0" (right camera) or "camera_1" (left camera).timestamp: float, capture time from metadata.exposure: float, exposure time in ms.analog_gain: float, fixed gain value (set to 1 during collection).
Data Splits
The dataset is divided into Train, Identity, Test, and Validation splits.
| Split | Scenes | Total Frames | Rain Density | Illumination | Notes |
|---|---|---|---|---|---|
| Train | 89 | 48,000 | lightβheavy | day+evening | Diverse scenarios (glass, vehicle, wipers). |
| Identity | 30 | 18,000 | None | day | Used for reconstruction/identity tasks. |
| Test | 10 | 6,000 | lightβheavy | day+evening | Held-out evaluation set. |
File Structure Mapping
The underlying data is stored in a nested folder structure. This dataset loader flattens the structure into individual examples. The original structure is as follows:
root/
βββ [split_name]/ # e.g., train, test, identity
β βββ [split_name]_[scene_number]/ # e.g., train_1, test_3
β βββ metadata.csv # Contains: frameId, timestamp, exposure, analogGain
β βββ gt/ # Ground Truth - Static Scene
β β βββ camera_0/ # Right Camera
β β β βββ [raw_images].pgm # Any image here serves as static GT
β β βββ camera_1/ # Left Camera
β β βββ [raw_images].pgm
β βββ rain/ # Rainy Scenario - Video Sequence
β βββ camera_0/ # Right Camera
β β βββ raw_9.pgm
β β βββ raw_10.pgm
β β βββ ...
β βββ camera_1/ # Left Camera
β βββ raw_9.pgm
β βββ raw_10.pgm
β βββ ...
Note: For stereo matching, gt/camera_0 corresponds to rain/camera_0, and gt/camera_1 corresponds to rain/camera_1.
Dataset Creation
Curation Rationale
To support studies on deraining directly in raw space, a custom stereo dataset was required that captures real-world rain physics rather than synthetic overlays, while ensuring temporal synchronization and providing ground truth.
Source Data
Data Collection Process: The dataset was collected using a rig with two synchronized FLIR Blackfly S 2.8MP cameras capable of capturing raw Bayer images.
- Protocol: Ground truth images were collected first using a spatially stable rig. Immediately after, a custom rain system was triggered to scatter water-rain-like droplets on the windshield and in the scene depth while recording the "rain" frames.
- Settings: Fixed analog gain of 1. Exposure times varied (300ms - 10,000ms) to capture bright daylight to low-light evening conditions.
- Scenarios: Included recordings through stationary glass windows, from within moving vehicles, and sessions with windshield wipers operating at varying speeds.
Additional Information
Licensing Information
CC-BY-4.0
Citation Information
@misc{rothschild2025mathbfr3reconstructionrawrain,
title={$\mathbf{R}^3$: Reconstruction, Raw, and Rain: Deraining Directly in the Bayer Domain},
author={Nate Rothschild and Moshe Kimhi and Avi Mendelson and Chaim Baskin},
year={2025},
eprint={2509.24022},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={[https://arxiv.org/abs/2509.24022](https://arxiv.org/abs/2509.24022)},
}
Social Proof
AI Summary: Based on Hugging Face metadata. Not a recommendation.
π‘οΈ Dataset Transparency Report
Technical metadata sourced from upstream repositories.
π Identity & Source
- id
- hf-dataset--realrainmaker--raw-rain-bayer
- slug
- realrainmaker--raw-rain-bayer
- source
- huggingface
- author
- realrainmaker
- license
- CC-BY-4.0
- tags
- task_categories:image-to-image, annotations_creators:machine-generated, multilinguality:monolingual, source_datasets:original, language:en, license:cc-by-4.0, size_categories:10k<n<100k, arxiv:2509.24022, region:us, computer-vision, raw-image, bayer-pattern, stereo-vision, image-deraining, rain-removal, real-world-rain
βοΈ Technical Specs
- architecture
- null
- params billions
- null
- context length
- null
- pipeline tag
π Engagement & Metrics
- downloads
- 86,277
- stars
- 0
- forks
- 0
Data indexed from public sources. Updated daily.