DeepSeek-Coder-V2-Lite-Instruct
by deepseek-ai
DeepSeek-Coder-V2-Lite-Instruct is an open-source AI model by deepseek-ai
Technical Specifications
View Config (4 entries)
{
"architectures": [
"DeepseekV2ForCausalLM"
],
"auto_map": {
"AutoConfig": "configuration_deepseek.DeepseekV2Config",
"AutoModel": "modeling_deepseek.DeepseekV2Model",
"AutoModelForCausalLM": "modeling_deepseek.DeepseekV2ForCausalLM"
},
"model_type": "deepseek_v2",
"tokenizer_config": {
"bos_token": {
"__type": "AddedToken",
"content": "<๏ฝbeginโofโsentence๏ฝ>",
"lstrip": false,
"normalized": true,
"rstrip": false,
"single_word": false
},
"eos_token": {
"__type": "AddedToken",
"content": "<๏ฝendโofโsentence๏ฝ>",
"lstrip": false,
"normalized": true,
"rstrip": false,
"single_word": false
},
"pad_token": {
"__type": "AddedToken",
"content": "<๏ฝendโofโsentence๏ฝ>",
"lstrip": false,
"normalized": true,
"rstrip": false,
"single_word": false
},
"unk_token": null,
"chat_template": "{% if not add_generation_prompt is defined %}{% set add_generation_prompt = false %}{% endif %}{{ bos_token }}{% for message in messages %}{% if message['role'] == 'user' %}{{ 'User: ' + message['content'] + '\n\n' }}{% elif message['role'] == 'assistant' %}{{ 'Assistant: ' + message['content'] + eos_token }}{% elif message['role'] == 'system' %}{{ message['content'] + '\n\n' }}{% endif %}{% endfor %}{% if add_generation_prompt %}{{ 'Assistant:' }}{% endif %}"
}
}
Est. VRAM Required
~12 GB
Estimation Formula
VRAM = params ร 0.6 + 2 GB
Based on FP16 precision.
โ ๏ธ Does not account for KV cache or parallel overhead.
๐ Estimate only. Actual requirements may vary.
Based on open-source metadata snapshot. Last synced: Dec 31, 2025
๐ง Architecture Explorer
Neural network architecture
Technical Specifications
View Config (4 entries)
{
"architectures": [
"DeepseekV2ForCausalLM"
],
"auto_map": {
"AutoConfig": "configuration_deepseek.DeepseekV2Config",
"AutoModel": "modeling_deepseek.DeepseekV2Model",
"AutoModelForCausalLM": "modeling_deepseek.DeepseekV2ForCausalLM"
},
"model_type": "deepseek_v2",
"tokenizer_config": {
"bos_token": {
"__type": "AddedToken",
"content": "<๏ฝbeginโofโsentence๏ฝ>",
"lstrip": false,
"normalized": true,
"rstrip": false,
"single_word": false
},
"eos_token": {
"__type": "AddedToken",
"content": "<๏ฝendโofโsentence๏ฝ>",
"lstrip": false,
"normalized": true,
"rstrip": false,
"single_word": false
},
"pad_token": {
"__type": "AddedToken",
"content": "<๏ฝendโofโsentence๏ฝ>",
"lstrip": false,
"normalized": true,
"rstrip": false,
"single_word": false
},
"unk_token": null,
"chat_template": "{% if not add_generation_prompt is defined %}{% set add_generation_prompt = false %}{% endif %}{{ bos_token }}{% for message in messages %}{% if message['role'] == 'user' %}{{ 'User: ' + message['content'] + '\n\n' }}{% elif message['role'] == 'assistant' %}{{ 'Assistant: ' + message['content'] + eos_token }}{% elif message['role'] == 'system' %}{{ message['content'] + '\n\n' }}{% endif %}{% endfor %}{% if add_generation_prompt %}{{ 'Assistant:' }}{% endif %}"
}
}
๐ Limitations & Considerations
- โข Benchmark scores may vary based on evaluation methodology and hardware configuration.
- โข VRAM requirements are estimates; actual usage depends on quantization and batch size.
- โข FNI scores are relative rankings and may change as new models are added.
- โ License Unknown: Verify licensing terms before commercial use.
- โข Source: Huggingface
๐ Related Resources
๐ Related Papers
No related papers linked yet. Check the model's official documentation for research papers.
๐ Training Datasets
Training data information not available. Refer to the original model card for details.
๐ Related Models
Data unavailable