🧠 Model

Mixtral-8x7B-v0.1

by mistralai

--- library_name: vllm license: apache-2.0 language: - fr - it - de - es - en tags: - moe - mistral-common extra_gated_description: >- If you want to learn more

🕐 Updated 12/19/2025

🧠 Architecture Explorer

Neural network architecture

1 Input Layer

2 Hidden Layers

3 Attention

4 Output Layer

Learn about Transformers →

About

The Mixtral-8x7B Large Language Model (LLM) is a pretrained generative Sparse Mixture of Experts. The Mistral-8x7B outperforms Llama 2 70B on most benchmarks we tested. For full details of this model please read our ...

📝 Limitations & Considerations

• Benchmark scores may vary based on evaluation methodology and hardware configuration.
• VRAM requirements are estimates; actual usage depends on quantization and batch size.
• FNI scores are relative rankings and may change as new models are added.
• Data source: [{"source_platform":"huggingface","source_url":"https://huggingface.co/mistralai/Mixtral-8x7B-v0.1","fetched_at":"2025-12-19T07:41:01.176Z","adapter_version":"3.2.0"}]

📚 Related Resources

📄 Related Papers

No related papers linked yet. Check the model's official documentation for research papers.

📊 Training Datasets

Training data information not available. Refer to the original model card for details.

🔗 Related Models V6.2

Mixtral-8x7B-Instruct-v0.1📦 Sibling Mistral-7B-v0.1📦 Sibling Mistral-7B-Instruct-v0.2📦 Sibling Mistral-7B-Instruct-v0.3📦 Sibling Mistral-7B-Instruct-v0.1📦 Sibling

Model Information Summary
Model Name	Mixtral-8x7B-v0.1
Author	mistralai
Type	other
Downloads	57,898
Likes	1,772
Source	Hugging Face
Last Updated	December 19, 2025

Graph Overview

200 Models

460 Connections

Explore Full Graph →

🚀 What's Next?

📊

Find Training Datasets

Discover datasets compatible with this model

📈

Compare Benchmarks

See how this model ranks on standard tests

⚡

Learn About Deployment

Understand deployment options