🧠

Model

MiniRAG

Name: MiniRAG
Author: HKUDS

by HKUDS gh-tool--hkuds--minirag

Free2AITools Nexus Index

43.8 Top 100%

S: Semantic 50

A: Authority 0

P: Popularity 62

R: Recency 67

Q: Quality 70

Tech Context

Vital Performance

0 DL / 30D

0.0%

Source →

Audited 43.8 FNI Score

Tiny - Params

- Context

0 Downloads

Commercial MIT License

Model Information Summary
Entity Passport
Registry ID	gh-tool--hkuds--minirag
License	MIT
Provider	github

📜

Cite this model

Academic & Research Attribution

BibTeX

@misc{gh_tool__hkuds__minirag,
  author = {HKUDS},
  title = {MiniRAG Model},
  year = {2026},
  howpublished = {\url{https://github.com/HKUDS/MiniRAG}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}

APA Style

HKUDS. (2026). MiniRAG [Model]. Free2AITools. https://github.com/HKUDS/MiniRAG

🔬Technical Deep Dive

Full Specifications [+]

Quick Commands

🐙 Git Clone

git clone https://github.com/HKUDS/MiniRAG

⚖️ Free2AITools Nexus Index V2.0

Methodology Index Protocol

Semantic (S) 50

Authority (A) 0

Popularity (P) 62

Recency (R) 67

Quality (Q) 70

💬 Index Insight

FNI V2.0 for MiniRAG: Semantic (S:50), Authority (A:0), Popularity (P:62), Recency (R:67), Quality (Q:70).

Free2AITools Nexus Index

Verification Authority

HuggingFace API GitHub Metadata Arxiv Citation DB System Audit

Unbiased Data Node Refresh: VFS Live

---

🚀 What's Next?

📊

Find Training Datasets

Discover datasets compatible with this model

📈

Compare Benchmarks

See how this model ranks on standard tests

⚡

Technical Deep Dive

MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation

MiniRAG

The Code Repository: MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation

Tianyu Fan, Jingyuan Wang, Xubin Ren, Chao Huang* (*Correspondence)

🎉 News

[2025.02.27]🎯📢Now you can use pip install minirag-hku to run our code!
[2025.02.14]🎯📢Now MiniRAG supports 10+ heterogeneous graph databases, including Neo4j, PostgreSQL, TiDB, etc. Happy valentine's day!🌹🌹🌹
[2025.02.05]🎯📢Our team has released VideoRAG understanding extremely long-context videos.
[2025.02.01]🎯📢Now MiniRAG supports API&Docker deployment. see This for more details.

TLDR

MiniRAG is an extremely simple retrieval-augmented generation framework that enables small models to achieve good RAG performance through heterogeneous graph indexing and lightweight topology-enhanced retrieval.

Abstract

The growing demand for efficient and lightweight Retrieval-Augmented Generation (RAG) systems has highlighted significant challenges when deploying Small Language Models (SLMs) in existing RAG frameworks. Current approaches face severe performance degradation due to SLMs' limited semantic understanding and text processing capabilities, creating barriers for widespread adoption in resource-constrained scenarios. To address these fundamental limitations, we present MiniRAG, a novel RAG system designed for extreme simplicity and efficiency. MiniRAG introduces two key technical innovations: (1) a semantic-aware heterogeneous graph indexing mechanism that combines text chunks and named entities in a unified structure, reducing reliance on complex semantic understanding, and (2) a lightweight topology-enhanced retrieval approach that leverages graph structures for efficient knowledge discovery without requiring advanced language capabilities. Our extensive experiments demonstrate that MiniRAG achieves comparable performance to LLM-based methods even when using SLMs while requiring only 25% of the storage space. Additionally, we contribute a comprehensive benchmark dataset LiHua-World for evaluating lightweight RAG systems under realistic on-device scenarios with complex queries.

MiniRAG Framework

MiniRAG

MiniRAG employs a streamlined workflow built on the key components: heterogeneous graph indexing and lightweight graph-based knowledge retrieval. This architecture addresses the unique challenges faced by on-device RAG systems, optimizing for both efficiency and effectiveness.

Install

Install from source (Recommend)

bash

cd MiniRAG
pip install -e .

Install from PyPI (Our code is based on LightRAG, so you can install it directly)

bash

pip install lightrag-hku

Quick Start

All the code can be found in the ./reproduce.
Download the dataset you need.
Put the dataset in the ./dataset directory.
Note: We have already put the LiHua-World dataset in ./dataset/LiHua-World/data/ as LiHuaWorld.zip. If you want to use other dataset, you can put it in the ./dataset/xxx.

Then use the following bash command to index the dataset:

bash

python ./reproduce/Step_0_index.py
python ./reproduce/Step_1_QA.py

Or, use the code in ./main.py to initialize MiniRAG.

Overall Performance Table

Model	NaiveRAG		GraphRAG		LightRAG		MiniRAG
	acc↑	err↓	acc↑	err↓	acc↑	err↓	acc↑	err↓
LiHua-World
Phi-3.5-mini-instruct	41.22%	23.20%	/	/	39.81%	25.39%	53.29%	23.35%
GLM-Edge-1.5B-Chat	42.79%	24.76%	/	/	35.74%	25.86%	52.51%	25.71%
Qwen2.5-3B-Instruct	43.73%	24.14%	/	/	39.18%	28.68%	48.75%	26.02%
MiniCPM3-4B	43.42%	17.08%	/	/	35.42%	21.94%	51.25%	21.79%
gpt-4o-mini	46.55%	19.12%	35.27%	37.77%	56.90%	20.85%	54.08%	19.44%
MultiHop-RAG
Phi-3.5-mini-instruct	42.72%	31.34%	/	/	27.03%	11.78%	49.96%	28.44%
GLM-Edge-1.5B-Chat	44.44%	24.26%	/	/	/	/	51.41%	23.44%
Qwen2.5-3B-Instruct	39.48%	31.69%	/	/	21.91%	13.73%	48.55%	33.10%
MiniCPM3-4B	39.24%	31.42%	/	/	19.48%	10.41%	47.77%	26.88%
gpt-4o-mini	53.60%	27.19%	60.92%	16.86%	64.91%	19.37%	68.43%	19.41%

In the table, / means the method struggles to generate effective responses.

Reproduce

All the code can be found in the ./reproduce directory.

Code Structure

python

├── dataset
│   └── LiHua-World
│       ├── README.md
│       ├── README_CN.md
│       ├── data
│       │   ├── LiHuaWorld.zip
│       └── qa
│           ├── query_set.csv
│           └── query_set.json
├── minirag
│   ├── kg
│   │   ├── __init__.py
│   │   ├── neo4j_impl.py
│   │   └── oracle_impl.py
│   ├── __init__.py
│   ├── base.py
│   ├── exceptions.py
│   ├── llm.py
│   ├── minirag.py
│   ├── operate.py
│   ├── prompt.py
│   ├── storage.py
│   └── utils.py
├── reproduce
│   ├── Step_0_index.py
│   └── Step_1_QA.py
├── LICENSE
├── main.py
├── README.md
├── README_CN.md
├── requirements.txt
├── setup.py

Dataset: LiHua-World

LiHuaWorld

LiHua-World is a dataset specifically designed for on-device RAG scenarios, containing one year of chat records from a virtual user named LiHua. The dataset includes three types of questions: single-hop, multi-hop, and summary, with each question paired with manually annotated answers and supporting documents. For more details, please refer to the README of LiHua-World dataset.

Star History

Contribution

Thank you to all our contributors!

Acknowledgements

You may refer to related work that serves as foundations for our framework and code repository, nano-graphrag and LightRAG. Thanks for their wonderful works.

🌟Citation

python

@article{fan2025minirag,
  title={MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation},
  author={Fan, Tianyu and Wang, Jingyuan and Ren, Xubin and Huang, Chao},
  journal={arXiv preprint arXiv:2501.06713},
  year={2025}
}

Thank you for your interest in our work!

🚀 Quick Start

bash

cd MiniRAG
pip install -e .

⚠️ Incomplete Data

Some information about this model is not available. Use with Caution - Verify details from the original source before relying on this data.

View Original Source →

📝 Limitations & Considerations

• Benchmark scores may vary based on evaluation methodology and hardware configuration.
• VRAM requirements are estimates; actual usage depends on quantization and batch size.
• FNI scores are relative rankings and may change as new models are added.
⚠ License Unknown: Verify licensing terms before commercial use.

Social Proof

GitHub Repository

248Forks

Repo Issues

🐙 Data Source: GitHub ↗

🔄 Daily sync (03:00 UTC)

AI Summary: Based on GitHub metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseℹ️ Verify with original source

🛡️ Model Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

🆔 Identity & Source

id: gh-tool--hkuds--minirag
slug: hkuds--minirag
source: github
author: HKUDS
license: MIT
tags: large-language-models, rag, retrieval-augmented-generation, python

⚙️ Technical Specs

architecture: null
params billions: null
context length: null
pipeline tag: other

📊 Engagement & Metrics

downloads: 0
stars: 0
forks: 248

Data indexed from public sources. Updated daily.

MiniRAG

Cite this model

🔬Technical Deep Dive

Quick Commands

⚖️ Free2AITools Nexus Index V2.0

💬 Index Insight

Verification Authority

🚀 What's Next?

Find Training Datasets

Compare Benchmarks

Deployment Guide

Technical Deep Dive

MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation

🌍 README Translations

🎉 News

TLDR

Abstract

MiniRAG Framework

Install

Quick Start

Overall Performance Table

Reproduce

Code Structure

Dataset: LiHua-World

Star History

Contribution

Acknowledgements

🌟Citation

🚀 Quick Start

⚠️ Incomplete Data

📝 Limitations & Considerations

Social Proof

🛡️ Model Transparency Report

🆔 Identity & Source

⚙️ Technical Specs

📊 Engagement & Metrics