🧠

Model

Falcon3 7b Instruct

Name: Falcon3 7b Instruct
Author: tiiuae

by tiiuae hf-model--tiiuae--falcon3-7b-instruct

Nexus Index

39.3 Top 6%

S: Semantic 50

A: Authority 0

P: Popularity 50

R: Recency 52

Q: Quality 65

Tech Context

7.46 Params

8.192K Ctx

Vital Performance

24.8K DL / 30D

0.0%

Source →

Audited 39.3 FNI Score

7.46B Params

8k Context

24.8K Downloads

8G GPU ~7GB Est. VRAM

Dense LLAMAFORCAUSALLM Architecture

Restricted OTHER License

Model Information Summary
Entity Passport
Registry ID	hf-model--tiiuae--falcon3-7b-instruct
License	Other
Provider	huggingface

💾

Compute Threshold

~6.9GB VRAM

Interactive

Analyze Hardware

Hardware Compatibility Test

▼

* Static estimation for 4-Bit Quantization.

📜

Cite this model

Academic & Research Attribution

BibTeX

@misc{hf_model__tiiuae__falcon3_7b_instruct,
  author = {tiiuae},
  title = {Falcon3 7b Instruct Model},
  year = {2026},
  howpublished = {\url{https://huggingface.co/tiiuae/Falcon3-7B-Instruct}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}

APA Style

tiiuae. (2026). Falcon3 7b Instruct [Model]. Free2AITools. https://huggingface.co/tiiuae/Falcon3-7B-Instruct

🔬Technical Deep Dive

Full Specifications [+]

Quick Commands

🦙 Ollama Run

ollama run falcon3-7b-instruct

🤗 HF Download

huggingface-cli download tiiuae/falcon3-7b-instruct

📦 Install Lib

pip install -U transformers

⚖️ Nexus Index V2.0

Methodology Index Protocol

39.3

TOP 6% SYSTEM IMPACT

Semantic (S) 50

Authority (A) 0

Popularity (P) 50

Recency (R) 52

Quality (Q) 65

💬 Index Insight

FNI V2.0 for Falcon3 7b Instruct: Semantic (S:50), Authority (A:0), Popularity (P:50), Recency (R:52), Quality (Q:65).

Free2AITools Nexus Index

Verification Authority

HuggingFace API GitHub Metadata Arxiv Citation DB System Audit

Unbiased Data Node Refresh: VFS Live

---

🚀 What's Next?

📊

Find Training Datasets

Discover datasets compatible with this model

📈

Compare Benchmarks

See how this model ranks on standard tests

⚡

Technical Deep Dive

Falcon3-7B-Instruct

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B.

This repository contains the Falcon3-7B-Instruct. It achieves state of art results (at the time of release) on reasoning, language understanding, instruction following, code and mathematics tasks. Falcon3-7B-Instruct supports 4 languages (english, french, spanish, portuguese) and a context length up to 32K.

Model Details

Architecture
- Transformer based causal decoder only architecture
- 28 decoder blocks
- Grouped query attention (GQA) for faster inference: 12 query heads and 4 key value heads
- Wider head dimension: 256
- High RoPE value to support long context understanding: 1000042
- Uses SwiGLU and RMSNorm
- 32K context length
- 131K vocab size
Pretrained on 14 Teratokens of datasets comprising of web, code, STEM, high quality and mutlilingual data using 1024 H100 GPU chips
Postrained on 1.2 million samples of STEM, conversations, code, safety and function call data
Supports EN, FR, ES, PT
Developed by Technology Innovation Institute
License: TII Falcon-LLM License 2.0
Model Release Date: December 2024

Getting started

Click to expand

python


from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "tiiuae/Falcon3-7B-Instruct"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "How many hours in one day?"
messages = [
    {"role": "system", "content": "You are a helpful friendly assistant Falcon3 from TII, try to follow instructions as much as possible."},
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=1024
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
print(response)

Benchmarks

We report the official HuggingFace leaderboard normalized evaluations Open LLM Leaderboard Evaluation Results in the following table.

Benchmark	Llama-3.1-8B-Instruct	Qwen2.5-7B-Instruct	Falcon3-7B-Instruct
IFEval	78.56	75.85	76.12
BBH (3-shot)	29.89	34.89	37.92
MATH Lvl-5 (4-shot)	19.34	0.00	31.87
GPQA (0-shot)	2.35	5.48	8.05
MUSR (0-shot)	8.41	8.45	21.17
MMLU-PRO (5-shot)	30.68	36.52	34.30

Also, we report in the following table our internal pipeline benchmarks.

We use lm-evaluation harness.
We report raw scores obtained by applying chat template and fewshot_as_multiturn.
We use same batch-size across all models.

Category	Benchmark	Llama-3.1-8B-Instruct	Qwen2.5-7B-Instruct	Falcon3-7B-Instruct
General	MMLU (5-shot)	68.2	73.5	70.5
	MMLU-PRO (5-shot)	36.4	43.1	40.7
	IFEval	78.8	74.7	76.5
Math	GSM8K (5-shot)	82.6	72.0	81.4
	GSM8K (8-shot, COT)	85.4	76.6	79.7
	MATH Lvl-5 (4-shot)	15.4	-	29.4
Reasoning	Arc Challenge (25-shot)	58.6	57.8	62.6
	GPQA (0-shot)	33.5	32	31.9
	GPQA (0-shot, COT)	9.6	13.8	22.3
	MUSR (0-shot)	38.6	41	46.4
	BBH (3-shot)	48.6	54.1	52.4
CommonSense Understanding	PIQA (0-shot)	78.9	73.7	78.8
	SciQ (0-shot)	80.2	50.9	94.7
	Winogrande (0-shot)	-	-	70.4
	OpenbookQA (0-shot)	46.2	42.4	45.8
Instructions following	MT-Bench (avg)	7.9	8.5	8.4
Instructions following	Alpaca (WC)	26.6	31.5	26.1
Tool use	BFCL AST (avg)	90.6	91.4	89.5

Useful links

View our release blogpost.
Feel free to join our discord server if you have any questions or to interact with our researchers and developers.

Technical Report

Coming soon....

Citation

If Falcon3 family were helpful to your work, feel free to give us a cite.

text

@misc{Falcon3,
    title = {The Falcon 3 family of Open Models},
    author = {TII Team},
    month = {December},
    year = {2024}
}

⚠️ Incomplete Data

Some information about this model is not available. Use with Caution - Verify details from the original source before relying on this data.

View Original Source →

📝 Limitations & Considerations

• Benchmark scores may vary based on evaluation methodology and hardware configuration.
• VRAM requirements are estimates; actual usage depends on quantization and batch size.
• FNI scores are relative rankings and may change as new models are added.
⚠ License Unknown: Verify licensing terms before commercial use.

Top Tier

Social Proof

HuggingFace Hub

24.8KDownloads

Hub Discussions

🤗 Data Source: Hugging Face ↗

🔄 Daily sync (03:00 UTC)

AI Summary: Based on Hugging Face metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseℹ️ Verify with original source

🛡️ Model Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

🆔 Identity & Source

id: hf-model--tiiuae--falcon3-7b-instruct
slug: tiiuae--falcon3-7b-instruct
source: huggingface
author: tiiuae
license: Other
tags: transformers, safetensors, llama, text-generation, falcon3, conversational, en, fr, es, pt, base_model:tiiuae/falcon3-7b-base, base_model:finetune:tiiuae/falcon3-7b-base, license:other, text-generation-inference, endpoints_compatible, region:us, 5.4B

⚙️ Technical Specs

architecture: LlamaForCausalLM
params billions: 7.46
context length: 8,192
pipeline tag: text-generation
vram gb: 6.9
vram is estimated: true
vram formula: VRAM ≈ (params * 0.75) + 0.8GB (KV) + 0.5GB (OS)

📊 Engagement & Metrics

downloads: 24,768
stars: 0
forks: 0

Data indexed from public sources. Updated daily.

Welcome to Free2AI Tools!

Smart Search

FNI Score

You're All Set!