Sarlo 80
| Entity Passport | |
| Registry ID | hf-dataset--onera--sarlo-80 |
| Provider | huggingface |
Cite this dataset
Academic & Research Attribution
@misc{hf_dataset__onera__sarlo_80,
author = {ONERA},
title = {Sarlo 80 Dataset},
year = {2026},
howpublished = {\url{https://huggingface.co/datasets/onera/sarlo-80}},
note = {Accessed via Free2AITools Knowledge Fortress}
} π¬Technical Deep Dive
Full Specifications [+]βΎ
βοΈ Nexus Index V2.0
π¬ Index Insight
FNI V2.0 for Sarlo 80: Semantic (S:50), Authority (A:0), Popularity (P:50), Recency (R:51), Quality (Q:30).
Verification Authority
ποΈ Data Preview
Row-level preview not available for this dataset.
Schema structure is shown in the Field Logic panel when available.
π Explore Full Dataset β𧬠Field Logic
Schema not yet indexed for this dataset.
Dataset Specification
SARLO-80: Worldwide Slant SAR Language Optic Dataset - 80cm
Description
This dataset provides 119566 paired Umbra SAR, optical images in slant-range geometry and text description.
- SAR data come from Umbra SICD products, resampled to a fixed 80 cm Γ 80 cm resolution (azimuth and range) in slant-range geometry.
- Optical images are high-resolution optical data reprojected into the slant-range geometry of the corresponding SAR acquisition.
- Each pair also includes three automatically generated captions from the optical image using CogVLM2, then cleaned and post-processed with the Qwen LLM.
Format
- Resolution: 80 cm (azimuth Γ range), slant-range geometry
- Modality pairing: Umbra SAR Γ optical (slant-range reprojected)
- Text: 3 captions per pair (SHORT, MID, LONG)
Sample contents (per WebDataset shard)
Each sample contains:
| File | Description | Notes |
|---|---|---|
<id>.optic.png |
Optical image in slant-range geometry | 80 cm, (1024, 1024) |
<id>.sar.png |
SAR amplitude image in slant-range | 80 cm, (1024, 1024) |
<id>.sar.npy |
Complex SAR array (slant-range) | complex64 (1024, 1024) |
<id>.meta.json |
Metadata | UTF-8 JSON |
<id>.__key__ |
Unique WebDataset key | String |
The meta.json contains the information of optic.png, city, city_id_satellite_pass, SAR geometry: Nb_pixel_a, Nb_pixel_r, spacing_eff_az, spacing_eff_rg, operation_sampling, bbox_ecf, bbox_llh and
caption: SHORT β short, MID β medium-length and LONG β descriptive caption.
More examples
Usage (WebDataset / Python)
Example to stream the dataset with webdataset:
import os, io
import numpy as np
from PIL import Image
from huggingface_hub import HfApi, hf_hub_download
import webdataset as wds
token = os.environ.get("HF_TOKEN") #put your token_hf
repo_id = "ONERA/SARLO-80"
chunk_id = 0
shard_name = "shard-00027.tar" # exact name
filename = f"train/chunk_{chunk_id:03d}/{shard_name}"
local_tar = hf_hub_download(
repo_id=repo_id,
repo_type="dataset",
filename=filename,
token=token,
)
ds = wds.WebDataset(local_tar, shardshuffle=False)
sample = next(iter(ds))
img_optic = Image.open(io.BytesIO(sample["optic.png"]))
print(" optic.png size (W,H):", img_optic.size)
img_sar = Image.open(io.BytesIO(sample["sar.png"]))
print(" sar.png size (W,H):", img_sar.size)
sar_complex_slc = np.load(io.BytesIO(sample["sar.npy"]), allow_pickle=False)
print(" sar.npy dtype/shape :", sar_complex_slc.dtype, sar_complex_slc.shape)
meta_bytes = sample["meta.json"]
meta = json.loads(meta_bytes.decode("utf-8"))
print(f'Metadata: {meta}')
Acknowledgments
This work was carried out as part of the PhD of Solène Debuysère at DEMR-ONERA - Université de Paris Saclay, under the supervision of Nicolas Trouvé, Nathan Letheule, and Elise Colin. We gratefully acknowledge ONERA, and especially DEMR-SEM Team for providing computational and research resources, Umbra for the SAR data collections and open-access initiatives enabling research use, and Hugging Face, in particular Georgia Channing, for her help on this project.
Contacts: [email protected] or [email protected]
License
SARLO-80 is released under the Creative Commons AttributionβShareAlike 4.0 (CCβBYβSAβ4.0) license.
Citation information will be made available upon soon paper publication.
Social Proof
AI Summary: Based on Hugging Face metadata. Not a recommendation.
π‘οΈ Dataset Transparency Report
Technical metadata sourced from upstream repositories.
π Identity & Source
- id
- hf-dataset--onera--sarlo-80
- slug
- onera--sarlo-80
- source
- huggingface
- author
- ONERA
- license
- tags
- region:us
βοΈ Technical Specs
- architecture
- null
- params billions
- null
- context length
- null
- pipeline tag
π Engagement & Metrics
- downloads
- 26,528
- stars
- 9
- forks
- 0
Data indexed from public sources. Updated daily.