đ§
Sarvam 30b Gptq W8a16 model by Orange
â 41.4
đŦTechnical Deep Dive
Full Specifications [+]
đ Daily sync (03:00 UTC)
AI Summary: Based on Hugging Face metadata. Not a recommendation.
đĄī¸ Model Transparency Report
Technical metadata sourced from upstream repositories.
Open Metadata
đ Identity & Source
- id
- hf-model--orange--sarvam-30b-gptq-w8a16
- slug
- orange--sarvam-30b-gptq-w8a16
- source
- huggingface
- author
- Orange
- license
- Apache-2.0
- tags
- transformers, safetensors, sarvam_moe, text-generation, gptq, quantized, 8-bit, w8a16, moe, vllm, compressed-tensors, conversational, custom_code, base_model:sarvamai/sarvam-30b, base_model:quantized:sarvamai/sarvam-30b, license:apache-2.0, region:eu
âī¸ Technical Specs
- architecture
- null
- params billions
- 30
- context length
- 4,096
- pipeline tag
- text-generation
- vram gb
- 23.8
- vram is estimated
- true
- vram formula
- VRAM â (params * 0.75) + 0.8GB (KV) + 0.5GB (OS)
đ Engagement & Metrics
- downloads
- 315
- stars
- 0
- forks
- 0
Data indexed from public sources. Updated daily.