đ
Fineweb Edu 100b Shuffle dataset by karpathy
â 23
đŦTechnical Deep Dive
Full Specifications [+]
đ Daily sync (03:00 UTC)
AI Summary: Based on Hugging Face metadata. Not a recommendation.
đĄī¸ Dataset Transparency Report
Verified data manifest for traceability and transparency.
100% Data Disclosure Active
đ Identity & Source
- id
- hf-dataset--karpathy--fineweb-edu-100b-shuffle
- source
- huggingface
- author
- karpathy
- tags
- license:odc-bysize_categories:10m
format:parquetmodality:textlibrary:datasetslibrary:dasklibrary:polarslibrary:mlcroissantregion:us
âī¸ Technical Specs
- architecture
- null
- params billions
- 100
- context length
- 4,096
đ Engagement & Metrics
- likes
- 152
- downloads
- 40,532
Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)