đ§
Subtext Arena Grpo model by aamrinder
â 37.7
đŦTechnical Deep Dive
Full Specifications [+]
đ Daily sync (03:00 UTC)
AI Summary: Based on Hugging Face metadata. Not a recommendation.
đĄī¸ Model Transparency Report
Technical metadata sourced from upstream repositories.
Open Metadata
đ Identity & Source
- id
- hf-model--aamrinder--subtext-arena-grpo
- slug
- aamrinder--subtext-arena-grpo
- source
- huggingface
- author
- aamrinder
- license
- tags
- peft, safetensors, base_model:adapter:qwen/qwen2.5-3b-instruct, grpo, lora, transformers, trl, text-generation, conversational, arxiv:2402.03300, base_model:qwen/qwen2.5-3b-instruct, region:us
âī¸ Technical Specs
- architecture
- null
- params billions
- null
- context length
- null
- pipeline tag
- text-generation
đ Engagement & Metrics
- downloads
- 16
- stars
- 0
- forks
- null
Data indexed from public sources. Updated daily.