Deploy Agentic Rag
A powerful Retrieval-Augmented Generation (RAG) system built with CrewAI agents, featuring a REST API and a beautiful Gradio web interface. This project demonstrates how to create intelligent AI agents that can research queries, synthesize information, and provide comprehensive responses. !Agentic RAG Demo - **Multi-Agent Architecture**: Researcher and Writer agents working together - **Web Search Integration**: Powered by SerperDev tools for real-time information - **REST API**: Clean API en...
| Entity Passport | |
| Registry ID | gh-tool--eloiramos--deploy-agentic-rag |
| Provider | github |
Cite this tool
Academic & Research Attribution
@misc{gh_tool__eloiramos__deploy_agentic_rag,
author = {EloiRamos},
title = {Deploy Agentic Rag Tool},
year = {2026},
howpublished = {\url{https://github.com/EloiRamos/deploy-agentic-rag}},
note = {Accessed via Free2AITools Knowledge Fortress}
} đŦTechnical Deep Dive
Full Specifications [+]âž
Quick Commands
git clone https://github.com/EloiRamos/deploy-agentic-rag pip install deploy-agentic-rag âī¸ Nexus Index V16.5
đŦ Index Insight
The Free2AITools Nexus Index for Deploy Agentic Rag aggregates Popularity (P:0), Freshness (F:0), and Completeness (C:0). The Utility score (U:0) represents deployment readiness and ecosystem adoption.
Verification Authority
đ Specs
- Language
- Python
- License
- Open Source
- Version
- 1.0.0
Usage documentation not yet indexed for this tool.
đ View Source Code âTechnical Documentation
đ¤ Agentic RAG API
A powerful Retrieval-Augmented Generation (RAG) system built with CrewAI agents, featuring a REST API and a beautiful Gradio web interface. This project demonstrates how to create intelligent AI agents that can research queries, synthesize information, and provide comprehensive responses.
đ Features
- Multi-Agent Architecture: Researcher and Writer agents working together
- Web Search Integration: Powered by SerperDev tools for real-time information
- REST API: Clean API endpoints using LitServe
- Beautiful Web UI: Interactive Gradio interface
- Command Line Client: Simple CLI for testing and automation
- Production Ready: Proper error handling and logging
đ Privacy & Cost Benefits
This project leverages Qwen 3 4B via Ollama as a local language model, providing:
- đ Complete Data Privacy - All inference happens on your machine
- đ° Zero API Costs - No charges for LLM calls
- ⥠Fast Responses - No network latency
- đ Offline Capability - Works without internet (except web search)
đī¸ Architecture
âââââââââââââââââââ ââââââââââââââââââââ âââââââââââââââââââ
â Web Client â â REST API â â CrewAI Agents â
â (Gradio UI) âââââēâ (LitServe) âââââēâ (Researcher â
â â â â â + Writer) â
âââââââââââââââââââ ââââââââââââââââââââ âââââââââââââââââââ
â
âŧ
ââââââââââââââââââââ
â SerperDev â
â Web Search â
ââââââââââââââââââââ
đ¤ How the Agents Work
This project uses a multi-agent architecture powered by CrewAI, where specialized AI agents collaborate to provide intelligent responses:
Agent Roles and Workflow
Researcher Agent đ
- Role: Information gathering and analysis
- Tools: SerperDev web search integration
- Goal: Research the user's query and generate comprehensive insights
- Process:
- Receives the user's query
- Uses web search to find relevant, up-to-date information
- Analyzes and synthesizes findings
- Produces research insights and context
Writer Agent âī¸
- Role: Response synthesis and communication
- Goal: Transform research insights into clear, informative responses
- Process:
- Takes the researcher's insights as input
- Crafts a concise, well-structured response
- Ensures the response is accurate and easy to understand
- Formats the final answer for the user
The Collaboration Process
User Query â Researcher Agent â Web Search â Analysis â Writer Agent â Final Response
â â â â â â
"What is Researches & Searches Synthesizes Writes "Machine
machine gathers info current research clear learning
learning?" from web data results response" is..."
Key Features
- Sequential Processing: Agents work in sequence, each building on the previous agent's work
- Tool Integration: Real-time web search for current information
- Context Preservation: Information flows from researcher to writer seamlessly
- Quality Assurance: Each agent has specialized expertise for their role
- Error Handling: Robust error handling ensures reliable responses
Technical Implementation
- Framework: CrewAI for agent orchestration
- Language Model: Ollama Qwen 3 4B for intelligent responses
- Tools: SerperDev for web search capabilities
- API: LitServe for production-ready serving
- UI: Gradio for beautiful web interface
đ Quick Start
Prerequisites
- Python 3.11 or higher
- uv package manager (recommended)
- Internet connection for web search functionality
Installation
Clone the repository
bashgit clonecd deploy-agentic-rag Install dependencies
bashuv syncSet up environment variables
bash# Copy the example environment file cp .env.example .env # Edit .env and add your actual API key # Get your SerperDev API key from: https://serper.dev SERPER_API_KEY="your-actual-serper-api-key-here"
Run the Application
Start the API server
bashuv run python server.pyThe API will be available at
http://localhost:8000Start the web interface (in a new terminal)
bashuv run python gradio_ui.pyThe web UI will be available at
http://localhost:7860Test with the CLI client
bashuv run python client.py --query "What is machine learning?"
đą Screenshots
Agentic RAG Interface showcasing intelligent AI responses to complex queries about AI breakthroughs
Usage
Web Interface
- Open
http://localhost:7860in your browser - Enter your query in the text box
- Click "Submit Query" or press Enter
- View the intelligent response generated by the AI agents
REST API
Endpoint: POST /predict
Request:
{
"query": "Explain quantum computing in simple terms"
}
Response:
{
"output": {
"raw": "Quantum computing is a type of computing that uses quantum mechanics...",
"tasks_output": [...],
"token_usage": {...}
}
}
Command Line
# Simple query
uv run python client.py --query "What is the capital of France?"
# The client displays responses with a typewriter effect
đ§ Configuration
API Configuration
- Server URL:
http://localhost:8000 - Timeout: 60 seconds
- Model: Ollama Qwen 3 4B (configurable in
server.py)
Agent Configuration
Researcher Agent:
- Role: Research and gather information
- Tools: SerperDev web search
- Goal: Generate comprehensive insights
Writer Agent:
- Role: Synthesize information into clear responses
- Goal: Provide concise, informative answers
đ Project Structure
deploy-agentic-rag/
âââ server.py # Main API server using LitServe
âââ client.py # Command-line client
âââ gradio_ui.py # Web interface
âââ pyproject.toml # Project configuration
âââ README.md # This file
đ ī¸ Development
Adding New Features
- Custom Agents: Modify
server.pyto add new agent types - Additional Tools: Extend the Researcher agent with more tools
- UI Enhancements: Customize
gradio_ui.pyfor new features
Testing
# Run tests
uv run pytest
# Code formatting
uv run black .
uv run flake8 .
đ API Reference
Health Check
- GET
/health- Check server status
Predictions
- POST
/predict- Submit queries for processing
đ¤ Contributing
- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing-feature) - Commit your changes (
git commit -m 'Add amazing feature') - Push to the branch (
git push origin feature/amazing-feature) - Open a Pull Request
đ License
This project is licensed under the MIT License - see the LICENSE file for details.
đ Acknowledgments
- CrewAI for the amazing agent framework
- LitServe for the lightweight API serving
- Gradio for the beautiful web interface
- SerperDev for web search capabilities
Made with â¤ī¸ using CrewAI Agents
đ Quick Start
git clone
cd deploy-agentic-rag
Social Proof
AI Summary: Based on GitHub metadata. Not a recommendation.
đĄī¸ Tool Transparency Report
Verified data manifest for traceability and transparency.
đ Identity & Source
- id
- gh-tool--eloiramos--deploy-agentic-rag
- source
- github
- author
- EloiRamos
- tags
- crewaigradio-uilitserveragserperdevpython
âī¸ Technical Specs
- architecture
- null
- params billions
- null
- context length
- null
- pipeline tag
- feature-extraction
đ Engagement & Metrics
- likes
- 0
- downloads
- 0
- github stars
- 1
Free2AITools Constitutional Data Pipeline: Curated disclosure mode active. (V15.x Standard)