π§
Glm 5.1 Mlx Q4.8 Inf model by inferencerlabs
β 55.9
π¬Technical Deep Dive
Full Specifications [+]
π Updated daily
Source summary: Based on Hugging Face metadata. Not a recommendation.
π‘οΈ Model Transparency Report
Technical metadata sourced from upstream repositories.
Open Metadata
π Identity & Source
- id
- hf-model--inferencerlabs--glm-5.1-mlx-q4.8-inf
- slug
- inferencerlabs--glm-5.1-mlx-q4.8-inf
- source
- huggingface
- author
- inferencerlabs
- license
- tags
- mlx, safetensors, glm_moe_dsa, quantized, text-generation, conversational, en, base_model:zai-org/glm-5.1, base_model:quantized:zai-org/glm-5.1, region:us
βοΈ Technical Specs
- architecture
- GlmMoeDsaForCausalLM
- params billions
- 743.91
- context length
- 8,192
- pipeline tag
- text-generation
- vram gb
- 560.4
- vram is estimated
- true
- vram formula
- VRAM β (params * 0.75) + 2GB (KV) + 0.5GB (OS)
π Engagement & Metrics
- downloads
- 4,316
- stars
- null
- forks
- null
Data indexed from public sources. Updated daily.