đ
Github Code 2025 Language Split dataset by lumees
â 37.3
đŦTechnical Deep Dive
Full Specifications [+]
đ Daily sync (03:00 UTC)
AI Summary: Based on Hugging Face metadata. Not a recommendation.
đĄī¸ Dataset Transparency Report
Verified data manifest for traceability and transparency.
100% Data Disclosure Active
đ Identity & Source
- id
- hf-dataset--lumees--github-code-2025-language-split
- slug
- lumees--github-code-2025-language-split
- source
- huggingface
- author
- lumees
- license
- ["other"]
- tags
- source_datasets:nick007x/github-code-2025, license:other, size_categories:100m<n<1b, format:parquet, modality:text, library:datasets, library:dask, library:polars, library:mlcroissant, region:us
âī¸ Technical Specs
- architecture
- null
- params billions
- null
- context length
- null
- pipeline tag
đ Engagement & Metrics
- downloads
- 218,748
- stars
- 6
- forks
- 0
Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)