πŸ“Š
Dataset

RenderedText

by wendlerc hf-dataset--wendlerc--renderedtext
Nexus Index
34.8 Top 100%
S / A / P / R / Q Breakdown Calibration Pending

Pillar scores are computed during the next indexing cycle.

Tech Context
Vital Performance
0 DL / 30D
0.0%
Data Integrity 34.8 FNI Score
- Size
- Rows
Parquet Format
- Tokens
Dataset Information Summary
Entity Passport
Registry ID hf-dataset--wendlerc--renderedtext
Provider huggingface
πŸ“œ

Cite this dataset

Academic & Research Attribution

BibTeX
@misc{hf_dataset__wendlerc__renderedtext,
  author = {wendlerc},
  title = {RenderedText Dataset},
  year = {2026},
  howpublished = {\url{https://huggingface.co/datasets/wendlerc/renderedtext}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
wendlerc. (2026). RenderedText [Dataset]. Free2AITools. https://huggingface.co/datasets/wendlerc/renderedtext

πŸ”¬Technical Deep Dive

Full Specifications [+]

βš–οΈ Nexus Index V2.0

34.8
ESTIMATED IMPACT TIER
Semantic (S) 0
Authority (A) 0
Popularity (P) 0
Recency (R) 0
Quality (Q) 0

πŸ’¬ Index Insight

FNI V2.0 for RenderedText: Semantic (S:0), Authority (A:0), Popularity (P:0), Recency (R:0), Quality (Q:0).

Free2AITools Nexus Index

Verification Authority

Unbiased Data Node Refresh: VFS Live
⬇️
Downloads
46,696

πŸ‘οΈ Data Preview

πŸ“Š

Row-level preview not available for this dataset.

Schema structure is shown in the Field Logic panel when available.

πŸ”— Explore Full Dataset β†—

🧬 Field Logic

🧬

Schema not yet indexed for this dataset.

Dataset Specification

This dataset has been created by Stability AI and LAION.

This dataset contains 12 million 1024x1024 images of handwritten text written on a digital 3D sheet of paper generated using Blender geometry nodes and rendered using Blender Cycles. The text has varying font size, color, and rotation, and the paper was rendered under random lighting conditions. Note that, the first 10 million examples are in the root folder of this dataset repository and the remaining 2 million are in ./remaining (due to the constraint on number of files per directory).

It was generated with the script https://github.com/GbotHQ/ocr-dataset-rendering/, which utilizes:

Line level annotations Character level annotations

The dataset contains both line-level, as well as character level annotations for each example. The annotations are stored in the accompanying json files and are of the following form:

text
{
'ocr_annotation':
{'bounding_boxes': [[[145.0, 370.0], [788.0, 353.0], [827.0, 633.0], [182.0, 669.0]]], 
 'text': ['Joe.'], 
 'bb_relative': [[[0.1416015625, 0.361328125], [0.76953125, 0.3447265625], [0.8076171875, 0.6181640625], [0.177734375, 0.6533203125]]], 
 'char': ['J', 'o', 'e', '.'], 
 'char_idx': [0, 1, 2, 3], 
 'bb_character_level': [[[145.0, 370.0], [346.0, 365.0], [382.0, 651.0], [181.0, 662.0]], [[375.0, 438.0], [557.0, 431.0], [585.0, 640.0], [402.0, 650.0]], [[578.0, 440.0], [744.0, 434.0], [771.0, 629.0], [604.0, 638.0]], [[778.0, 591.0], [821.0, 589.0], [827.0, 633.0], [784.0, 635.0]]], 
 'font_path': '/fsx/home-wendlerc/blender-dataset/assets/fonts/fontcollection/HelloScribbles-axapm.ttf', 
 'font_color': [17, 25, 231], 
 'text_rotation_angle': 7},
'width':1024,
'height':1024,
}

Browse a few more examples here: https://colab.research.google.com/drive/1o0rZhtY9aeurzNrAbu6nJypULSIIcf1v?usp=sharing

πŸ“Š Structured Schema (Zero-Fabrication)

Feature Key Data Type
__key__ string
__url__ string
json unknown
png Image

Estimated Rows: 4,700

Social Proof

HuggingFace Hub
46.7KDownloads
πŸ”„ Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

πŸ“Š FNI Methodology πŸ“š Knowledge Baseℹ️ Verify with original source

πŸ›‘οΈ Dataset Transparency Report

Verified data manifest for traceability and transparency.

100% Data Disclosure Active

πŸ†” Identity & Source

id
hf-dataset--wendlerc--renderedtext
slug
wendlerc--renderedtext
source
huggingface
author
wendlerc
license
tags
task_categories:text-to-image, task_categories:image-to-text, language:en, size_categories:10m

βš™οΈ Technical Specs

architecture
null
params billions
null
context length
null
pipeline tag

πŸ“Š Engagement & Metrics

downloads
46,696
stars
55
forks
0

Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)