πŸ“Š
Dataset

Sarlo 80

by ONERA hf-dataset--onera--sarlo-80
Nexus Index
30.4 Top 100%
S: Semantic 50
A: Authority 0
P: Popularity 50
R: Recency 51
Q: Quality 30
Tech Context
Vital Performance
0 DL / 30D
0.0%
Data Integrity 30.4 FNI Score
- Size
- Rows
Parquet Format
- Tokens
Dataset Information Summary
Entity Passport
Registry ID hf-dataset--onera--sarlo-80
Provider huggingface
πŸ“œ

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_dataset__onera__sarlo_80,
  author = {ONERA},
  title = {Sarlo 80 Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/onera/sarlo-80}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
ONERA. (2026). Sarlo 80 [Dataset]. Free2AITools. https://huggingface.co/datasets/onera/sarlo-80

πŸ”¬Technical Deep Dive

Full Specifications [+]

βš–οΈ Nexus Index V2.0

30.4
TOP 100% SYSTEM IMPACT
Semantic (S) 50
Authority (A) 0
Popularity (P) 50
Recency (R) 51
Quality (Q) 30

πŸ’¬ Index Insight

FNI V2.0 for Sarlo 80: Semantic (S:50), Authority (A:0), Popularity (P:50), Recency (R:51), Quality (Q:30).

Free2AITools Nexus Index

Verification Authority

Unbiased Data Node Refresh: VFS Live
⬇️
Downloads
26,528

πŸ‘οΈ Data Preview

πŸ“Š

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

πŸ”— Explore Full Dataset β†—

🧬 Field Logic

🧬

Schema not yet indexed for this dataset.

Dataset Specification

SARLO-80 logo

SARLO-80: Worldwide Slant SAR Language Optic Dataset - 80cm

Description

This dataset provides 119566 paired Umbra SAR, optical images in slant-range geometry and text description.

  • SAR data come from Umbra SICD products, resampled to a fixed 80 cm Γ— 80 cm resolution (azimuth and range) in slant-range geometry.
  • Optical images are high-resolution optical data reprojected into the slant-range geometry of the corresponding SAR acquisition.
  • Each pair also includes three automatically generated captions from the optical image using CogVLM2, then cleaned and post-processed with the Qwen LLM.

Format

  • Resolution: 80 cm (azimuth Γ— range), slant-range geometry
  • Modality pairing: Umbra SAR Γ— optical (slant-range reprojected)
  • Text: 3 captions per pair (SHORT, MID, LONG)

Sample contents (per WebDataset shard)

Each sample contains:

File Description Notes
<id>.optic.png Optical image in slant-range geometry 80 cm, (1024, 1024)
<id>.sar.png SAR amplitude image in slant-range 80 cm, (1024, 1024)
<id>.sar.npy Complex SAR array (slant-range) complex64 (1024, 1024)
<id>.meta.json Metadata UTF-8 JSON
<id>.__key__ Unique WebDataset key String

The meta.json contains the information of optic.png, city, city_id_satellite_pass, SAR geometry: Nb_pixel_a, Nb_pixel_r, spacing_eff_az, spacing_eff_rg, operation_sampling, bbox_ecf, bbox_llh and caption: SHORT – short, MID – medium-length and LONG – descriptive caption.

pairs_example

More examples

images_examples

Usage (WebDataset / Python)

Example to stream the dataset with webdataset:

python

import os, io
import numpy as np
from PIL import Image
from huggingface_hub import HfApi, hf_hub_download
import webdataset as wds

token = os.environ.get("HF_TOKEN") #put your token_hf

repo_id = "ONERA/SARLO-80"

chunk_id = 0
shard_name = "shard-00027.tar"  # exact name
filename = f"train/chunk_{chunk_id:03d}/{shard_name}"


local_tar = hf_hub_download(
            repo_id=repo_id,
            repo_type="dataset",
            filename=filename,
            token=token,
        )

ds = wds.WebDataset(local_tar, shardshuffle=False)
sample = next(iter(ds))

img_optic = Image.open(io.BytesIO(sample["optic.png"]))
print("  optic.png size (W,H):", img_optic.size)

img_sar = Image.open(io.BytesIO(sample["sar.png"]))
print("  sar.png   size (W,H):", img_sar.size)

sar_complex_slc = np.load(io.BytesIO(sample["sar.npy"]), allow_pickle=False)
print("  sar.npy dtype/shape :", sar_complex_slc.dtype, sar_complex_slc.shape)

meta_bytes = sample["meta.json"]          
meta = json.loads(meta_bytes.decode("utf-8"))
print(f'Metadata: {meta}')

Acknowledgments

This work was carried out as part of the PhD of Solène Debuysère at DEMR-ONERA - Université de Paris Saclay, under the supervision of Nicolas Trouvé, Nathan Letheule, and Elise Colin. We gratefully acknowledge ONERA, and especially DEMR-SEM Team for providing computational and research resources, Umbra for the SAR data collections and open-access initiatives enabling research use, and Hugging Face, in particular Georgia Channing, for her help on this project.

Contacts: [email protected] or [email protected]

License

SARLO-80 is released under the Creative Commons Attribution‑ShareAlike 4.0 (CC‑BY‑SA‑4.0) license.

Citation information will be made available upon soon paper publication.

Social Proof

HuggingFace Hub
26.5KDownloads
πŸ”„ Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

πŸ“Š FNI Methodology πŸ“š Knowledge Baseℹ️ Verify with original source

πŸ›‘οΈ Dataset Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

πŸ†” Identity & Source

id
hf-dataset--onera--sarlo-80
slug
onera--sarlo-80
source
huggingface
author
ONERA
license
tags
region:us

βš™οΈ Technical Specs

architecture
null
params billions
null
context length
null
pipeline tag

πŸ“Š Engagement & Metrics

downloads
26,528
stars
9
forks
0

Data indexed from public sources. Updated daily.