đ
Fineweb Edu 100b Shuffle dataset by karpathy
â 31.5
đŦTechnical Deep Dive
Full Specifications [+]
đ Daily sync (03:00 UTC)
AI Summary: Based on Hugging Face metadata. Not a recommendation.
đĄī¸ Dataset Transparency Report
Technical metadata sourced from upstream repositories.
Open Metadata
đ Identity & Source
- id
- hf-dataset--karpathy--fineweb-edu-100b-shuffle
- slug
- karpathy--fineweb-edu-100b-shuffle
- source
- huggingface
- author
- karpathy
- license
- odc-by
- tags
- license:odc-by, size_categories:10m<n<100m, format:parquet, modality:text, library:datasets, library:dask, library:polars, library:mlcroissant, region:us
âī¸ Technical Specs
- architecture
- null
- params billions
- 100
- context length
- 4,096
- pipeline tag
đ Engagement & Metrics
- downloads
- 45,444
- stars
- 161
- forks
- 0
Data indexed from public sources. Updated daily.