MiniRAG
| Entity Passport | |
| Registry ID | gh-tool--hkuds--minirag |
| License | MIT |
| Provider | github |
Cite this model
Academic & Research Attribution
@misc{gh_tool__hkuds__minirag,
author = {HKUDS},
title = {MiniRAG Model},
year = {2026},
howpublished = {\url{https://github.com/HKUDS/MiniRAG}},
note = {Accessed via Free2AITools Knowledge Fortress}
} π¬Technical Deep Dive
Full Specifications [+]βΎ
Quick Commands
git clone https://github.com/HKUDS/MiniRAG βοΈ Free2AITools Nexus Index V2.0
π¬ Index Insight
FNI V2.0 for MiniRAG: Semantic (S:50), Authority (A:0), Popularity (P:62), Recency (R:67), Quality (Q:70).
Verification Authority
π What's Next?
Technical Deep Dive
MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation

The Code Repository: MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation
Tianyu Fan, Jingyuan Wang, Xubin Ren, Chao Huang* (*Correspondence)
π README Translations
π News
- [2025.02.27]π―π’Now you can use
pip install minirag-hkuto run our code! - [2025.02.14]π―π’Now MiniRAG supports 10+ heterogeneous graph databases, including Neo4j, PostgreSQL, TiDB, etc. Happy valentine's day!πΉπΉπΉ
- [2025.02.05]π―π’Our team has released VideoRAG understanding extremely long-context videos.
- [2025.02.01]π―π’Now MiniRAG supports API&Docker deployment. see This for more details.
TLDR
MiniRAG is an extremely simple retrieval-augmented generation framework that enables small models to achieve good RAG performance through heterogeneous graph indexing and lightweight topology-enhanced retrieval.
Abstract
The growing demand for efficient and lightweight Retrieval-Augmented Generation (RAG) systems has highlighted significant challenges when deploying Small Language Models (SLMs) in existing RAG frameworks. Current approaches face severe performance degradation due to SLMs' limited semantic understanding and text processing capabilities, creating barriers for widespread adoption in resource-constrained scenarios. To address these fundamental limitations, we present MiniRAG, a novel RAG system designed for extreme simplicity and efficiency. MiniRAG introduces two key technical innovations: (1) a semantic-aware heterogeneous graph indexing mechanism that combines text chunks and named entities in a unified structure, reducing reliance on complex semantic understanding, and (2) a lightweight topology-enhanced retrieval approach that leverages graph structures for efficient knowledge discovery without requiring advanced language capabilities. Our extensive experiments demonstrate that MiniRAG achieves comparable performance to LLM-based methods even when using SLMs while requiring only 25% of the storage space. Additionally, we contribute a comprehensive benchmark dataset LiHua-World for evaluating lightweight RAG systems under realistic on-device scenarios with complex queries.
MiniRAG Framework

MiniRAG employs a streamlined workflow built on the key components: heterogeneous graph indexing and lightweight graph-based knowledge retrieval. This architecture addresses the unique challenges faced by on-device RAG systems, optimizing for both efficiency and effectiveness.
Install
- Install from source (Recommend)
cd MiniRAG
pip install -e .
- Install from PyPI (Our code is based on LightRAG, so you can install it directly)
pip install lightrag-hku
Quick Start
- All the code can be found in the
./reproduce. - Download the dataset you need.
- Put the dataset in the
./datasetdirectory. - Note: We have already put the LiHua-World dataset in
./dataset/LiHua-World/data/asLiHuaWorld.zip. If you want to use other dataset, you can put it in the./dataset/xxx.
Then use the following bash command to index the dataset:
python ./reproduce/Step_0_index.py
python ./reproduce/Step_1_QA.py
Or, use the code in ./main.py to initialize MiniRAG.
Overall Performance Table
| Model | NaiveRAG | GraphRAG | LightRAG | MiniRAG | ||||
|---|---|---|---|---|---|---|---|---|
| accβ | errβ | accβ | errβ | accβ | errβ | accβ | errβ | |
| LiHua-World | ||||||||
| Phi-3.5-mini-instruct | 41.22% | 23.20% | / | / | 39.81% | 25.39% | 53.29% | 23.35% |
| GLM-Edge-1.5B-Chat | 42.79% | 24.76% | / | / | 35.74% | 25.86% | 52.51% | 25.71% |
| Qwen2.5-3B-Instruct | 43.73% | 24.14% | / | / | 39.18% | 28.68% | 48.75% | 26.02% |
| MiniCPM3-4B | 43.42% | 17.08% | / | / | 35.42% | 21.94% | 51.25% | 21.79% |
| gpt-4o-mini | 46.55% | 19.12% | 35.27% | 37.77% | 56.90% | 20.85% | 54.08% | 19.44% |
| MultiHop-RAG | ||||||||
| Phi-3.5-mini-instruct | 42.72% | 31.34% | / | / | 27.03% | 11.78% | 49.96% | 28.44% |
| GLM-Edge-1.5B-Chat | 44.44% | 24.26% | / | / | / | / | 51.41% | 23.44% |
| Qwen2.5-3B-Instruct | 39.48% | 31.69% | / | / | 21.91% | 13.73% | 48.55% | 33.10% |
| MiniCPM3-4B | 39.24% | 31.42% | / | / | 19.48% | 10.41% | 47.77% | 26.88% |
| gpt-4o-mini | 53.60% | 27.19% | 60.92% | 16.86% | 64.91% | 19.37% | 68.43% | 19.41% |
In the table, / means the method struggles to generate effective responses.
Reproduce
All the code can be found in the ./reproduce directory.
Code Structure
βββ dataset
β βββ LiHua-World
β βββ README.md
β βββ README_CN.md
β βββ data
β β βββ LiHuaWorld.zip
β βββ qa
β βββ query_set.csv
β βββ query_set.json
βββ minirag
β βββ kg
β β βββ __init__.py
β β βββ neo4j_impl.py
β β βββ oracle_impl.py
β βββ __init__.py
β βββ base.py
β βββ exceptions.py
β βββ llm.py
β βββ minirag.py
β βββ operate.py
β βββ prompt.py
β βββ storage.py
β βββ utils.py
βββ reproduce
β βββ Step_0_index.py
β βββ Step_1_QA.py
βββ LICENSE
βββ main.py
βββ README.md
βββ README_CN.md
βββ requirements.txt
βββ setup.py
Dataset: LiHua-World

LiHua-World is a dataset specifically designed for on-device RAG scenarios, containing one year of chat records from a virtual user named LiHua. The dataset includes three types of questions: single-hop, multi-hop, and summary, with each question paired with manually annotated answers and supporting documents. For more details, please refer to the README of LiHua-World dataset.
Star History
Contribution
Thank you to all our contributors!
Acknowledgements
You may refer to related work that serves as foundations for our framework and code repository, nano-graphrag and LightRAG. Thanks for their wonderful works.
πCitation
@article{fan2025minirag,
title={MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation},
author={Fan, Tianyu and Wang, Jingyuan and Ren, Xubin and Huang, Chao},
journal={arXiv preprint arXiv:2501.06713},
year={2025}
}
Thank you for your interest in our work!
π Quick Start
cd MiniRAG
pip install -e .
β οΈ Incomplete Data
Some information about this model is not available. Use with Caution - Verify details from the original source before relying on this data.
View Original Source βπ Limitations & Considerations
- β’ Benchmark scores may vary based on evaluation methodology and hardware configuration.
- β’ VRAM requirements are estimates; actual usage depends on quantization and batch size.
- β’ FNI scores are relative rankings and may change as new models are added.
- β License Unknown: Verify licensing terms before commercial use.
Social Proof
AI Summary: Based on GitHub metadata. Not a recommendation.
π‘οΈ Model Transparency Report
Technical metadata sourced from upstream repositories.
π Identity & Source
- id
- gh-tool--hkuds--minirag
- slug
- hkuds--minirag
- source
- github
- author
- HKUDS
- license
- MIT
- tags
- large-language-models, rag, retrieval-augmented-generation, python
βοΈ Technical Specs
- architecture
- null
- params billions
- null
- context length
- null
- pipeline tag
- other
π Engagement & Metrics
- downloads
- 0
- stars
- 0
- forks
- 248
Data indexed from public sources. Updated daily.