🧠
Model

Exaone 3.5 2.4b Instruct Gguf

by mobilint hf-model--mobilint--exaone-3.5-2.4b-instruct-gguf
Nexus Index
37.7 Top 100%
S: Semantic 50
A: Authority 0
P: Popularity 20
R: Recency 96
Q: Quality 50
Tech Context
2.4 Params
4.096K Ctx
Vital Performance
468 DL / 30D
0.0%
Audited 37.7 FNI Score
Tiny 2.4B Params
4k Context
468 Downloads
8G GPU ~4GB Est. VRAM
Restricted OTHER License
Model Information Summary
Entity Passport
Registry ID hf-model--mobilint--exaone-3.5-2.4b-instruct-gguf
License Other
Provider huggingface
💾

Compute Threshold

~3.1GB VRAM

Interactive
Analyze Hardware
â–ŧ

* Static estimation for 4-Bit Quantization.

📜

Cite this model

Academic & Research Attribution

BibTeX
@misc{hf_model__mobilint__exaone_3.5_2.4b_instruct_gguf,
  author = {mobilint},
  title = {Exaone 3.5 2.4b Instruct Gguf Model},
  year = {2026},
  howpublished = {\url{https://huggingface.co/mobilint/exaone-3.5-2.4b-instruct-gguf}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}
APA Style
mobilint. (2026). Exaone 3.5 2.4b Instruct Gguf [Model]. Free2AITools. https://huggingface.co/mobilint/exaone-3.5-2.4b-instruct-gguf

đŸ”ŦTechnical Deep Dive

Full Specifications [+]

Quick Commands

đŸĻ™ Ollama Run
ollama run exaone-3.5-2.4b-instruct-gguf
🤗 HF Download
huggingface-cli download mobilint/exaone-3.5-2.4b-instruct-gguf

âš–ī¸ Nexus Index V2.0

37.7
TOP 100% SYSTEM IMPACT
Semantic (S) 50
Authority (A) 0
Popularity (P) 20
Recency (R) 96
Quality (Q) 50

đŸ’Ŧ Index Insight

FNI V2.0 for Exaone 3.5 2.4b Instruct Gguf: Semantic (S:50), Authority (A:0), Popularity (P:20), Recency (R:96), Quality (Q:50).

Free2AITools Nexus Index

Verification Authority

Unbiased Data Node Refresh: VFS Live
---

🚀 What's Next?

Technical Deep Dive

EXAONE 3.5 2.4B Instruct — GGUF + MXQ for llama-cli-mblt

This repository provides EXAONE 3.5 2.4B Instruct compiled and optimized for Mobilint NPU hardware, packaged for use with llama.cpp-mblt.

Branches

Branch Contents Description
main Body model only Standard autoregressive decoding
eagle3 Body + FC + Draft models EAGLE3 speculative decoding (~2-4x faster)

Quick Start

bash
# Simple decoding
llama-cli-mblt -hf mobilint/EXAONE-3.5-2.4B-Instruct-GGUF -p "Hello!" -n 128

# EAGLE3 speculative decoding
llama-cli-mblt -hf mobilint/EXAONE-3.5-2.4B-Instruct-GGUF --eagle3 -p "Hello!" -n 128

# Interactive chat
llama-cli-mblt -hf mobilint/EXAONE-3.5-2.4B-Instruct-GGUF --eagle3

Files

main branch

File Size Description
exaone-3.5-2.4b-instruct-vocab.gguf 4.0 MB Tokenizer (vocab-only GGUF)
target_emb.bin 1.0 GB Body embedding weights (float32)
EXAONE-3.5-2.4B-Instruct.mxq 1.4 GB Body model for NPU
config.json — Model configuration

eagle3 branch (adds)

File Size Description
single_Fc_EXAONE-3.5-2.4B-Instruct.mxq 19 MB FC dimension converter model
Draft_EXAONE-3.5-2.4B-Instruct.mxq 87 MB EAGLE3 draft model
draft_emb.bin 1.0 GB Draft embedding weights
d2t.bin 250 KB Draft-to-target vocabulary mapping

About

This model is compiled and optimized for Mobilint NPU hardware. It is intended to be used with llama-cli-mblt from llama.cpp-mblt.

âš ī¸ Incomplete Data

Some information about this model is not available. Use with Caution - Verify details from the original source before relying on this data.

View Original Source →

📝 Limitations & Considerations

  • â€ĸ Benchmark scores may vary based on evaluation methodology and hardware configuration.
  • â€ĸ VRAM requirements are estimates; actual usage depends on quantization and batch size.
  • â€ĸ FNI scores are relative rankings and may change as new models are added.
  • ⚠ License Unknown: Verify licensing terms before commercial use.

Social Proof

HuggingFace Hub
468Downloads
🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseâ„šī¸ Verify with original source

đŸ›Ąī¸ Model Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

🆔 Identity & Source

id
hf-model--mobilint--exaone-3.5-2.4b-instruct-gguf
slug
mobilint--exaone-3.5-2.4b-instruct-gguf
source
huggingface
author
mobilint
license
Other
tags
llama-cpp, gguf, mobilint-exaone, mobilint, exaone, npu, text-generation, base_model:lgai-exaone/exaone-3.5-2.4b-instruct, license:other, endpoints_compatible, region:us, conversational

âš™ī¸ Technical Specs

architecture
null
params billions
2.4
context length
4,096
pipeline tag
text-generation
vram gb
3.1
vram is estimated
true
vram formula
VRAM ≈ (params * 0.75) + 0.8GB (KV) + 0.5GB (OS)

📊 Engagement & Metrics

downloads
468
stars
0
forks
0

Data indexed from public sources. Updated daily.