HyDRA

 HyDRA: Hybrid Dynamic RAG Agents, An advanced, agentic AI framework that transforms Retrieval-Augmented Generation from a static pipeline into a dynamic, learning reasoning system.

Generate Convert Improve

Install / Use

/learn @hassenhamdi/HyDRA

About this skill

Quality Score

0/100

README

HyDRA Banner

HyDRA: Hybrid Dynamic RAG Agents

An advanced, agentic AI framework that transforms Retrieval-Augmented Generation from a static pipeline into a dynamic, learning reasoning system.

HyDRA Website

Announcement:

New release more powerful overhaul : for full changelog check Changelog.md

🎬 Project Demo

See HyDRA in action! This video showcases the iterative reasoning process, the dynamic TUI, and the agent's ability to learn and adapt. (😁 Rest assured The noisy logging output suppression is on the roadmap.)

https://github.com/user-attachments/assets/327a96a7-e45e-474c-9984-9d63032d5378

Why HyDRA?
The HyDRA Approach
✨ Core Features
Architectural Overview
🛠️ Technical Deep Dive
🚀 Installation & Setup
💻 Usage
📝 Future Roadmap
🤝 Contributing
Acknowledgements & Foundational Work
License

Why HyDRA?

The world of Retrieval-Augmented Generation is evolving at a breakneck pace. Groundbreaking ideas are published monthly, but they often exist in isolation within academic papers or specific repositories. HyDRA was born from a simple question: What would a system look like if we fused the best of these ideas into a single, cohesive, and practical framework?

HyDRA is an ambitious attempt to synthesize and build upon the core principles of several leading-edge projects:

It adopts the robust, three-layer agentic structure from HiRA for a clean separation of strategy and execution.
It implements the multi-agent, multi-source retrieval philosophy of HM-RAG.
It leverages the HyDE technique to bridge the semantic gap between user queries and stored documents.
It is powered by Milvus, used not just as a vector store but as a unified backend for hybrid search, RRF reranking, and agent memory.
It utilizes the full potential of the BGE-M3 model for state-of-the-art dense and sparse embeddings.

HyDRA is our answer to building a RAG system that is not just powerful, but also intelligent, adaptive, and architecturally sound.

🧠 The HyDRA Approach

HyDRA's intelligence is built on three foundational pillars that work in concert:

1. Hierarchical Agents

A clear separation of concerns ensures robust and predictable behavior.

Meta-Planner: The strategist. It analyzes the user's query and the conversation history to determine the next logical step.
Adaptive Coordinator: The manager. It receives a task from the planner and delegates it to the most suitable specialist agent, guided by past performance data.
Executors: The specialists. A pool of agents with distinct tools, such as the AdvancedVectorSearchAgent for querying the internal knowledge base or the DeepSearchAgent for performing live web research.

2. Iterative Reasoning (ReAct)

Unlike traditional RAG pipelines that execute a fixed plan, HyDRA employs a dynamic Reasoning-Acting loop.

The Meta-Planner observes the current state and decides on the single best action to take next.
The Coordinator delegates this action to an Executor, which performs the task (e.g., a web search).
The result, or observation, is returned and appended to the history.
The loop repeats, with the planner using the full history of actions and observations to inform its next decision.

This allows HyDRA to tackle complex, multi-hop questions, recover from failed steps, and adjust its strategy on the fly.

3. Autonomous Learning (HELP/SIMPSON)

The Heuristic Experience-based Learning Policy (HELP) system is HyDRA's long-term memory and self-improvement mechanism. After every user interaction, a four-stage learning cycle begins:

Observe: The PostInteractionAnalyzer agent reviews the full transcript of the conversation.
Critique: It evaluates the efficiency of each step, identifying which agent delegations were successful and which were not.
Memorize: It formulates and stores a concise, actionable "policy" in its Milvus memory (e.g., "For recent events, web search is more effective than vector search"). It also learns the user's implicit preferences (e.g., "Prefers bullet-pointed lists").
Adapt: The next time the AdaptiveCoordinator faces a similar task, it retrieves this learned policy as "strategic guidance," enabling it to make smarter, experience-based decisions.

✨ Core Features

✅ Three-Layer Agentic Architecture: Meta-Planner for strategy, AdaptiveCoordinator for delegation, and specialized Executors for task execution.
✅ Iterative ReAct-style Agents: Moves beyond static plans to dynamic, multi-step reasoning for complex problem-solving.
✅ Continuous Self-Improvement (HELP/SIMPSON): A long-term learning loop that analyzes past performance to optimize future agent delegation and planning.
✅ State-of-the-Art Retrieval Pipeline: Combines Hybrid Search (dense + sparse vectors), Reciprocal Rank Fusion (RRF), and a final BGE Reranker for maximum precision.
✅ Adaptive Retrieval Strategies: The AdvancedVectorSearchAgent can autonomously use techniques like HyDE for conceptual queries or perform multiple refined searches.
✅ Interactive TUI with Streaming & Knowledge Management: A rich Terminal User Interface with streaming responses and commands (/save, /ingest) to curate the agent's knowledge base.
✅ Configurable Deployment Profiles: Easily switch between development, production_balanced, and hyperscale profiles to manage performance and resource trade-offs.

🗺️ Architectural Overview

HyDRA's workflow is a dynamic loop of strategy, execution, and learning. The Meta-Planner creates a step, the Coordinator delegates it, and Executors act. The PostInteractionAnalyzer reviews completed sessions to update the MemoryAgent, creating a continuous cycle of improvement.

graph TD
    subgraph "Layer 1: Strategy (ReAct Loop)"
        A[User Query] --> Planner{Meta-Planner Agent};
        Planner -- "1. Devises Next Action" --> Plan([Single Sub-task]);
    end

    subgraph "Layer 2: Coordination & Memory"
        Plan --> Coord{Adaptive Coordinator};
        Coord -- "3. Get Context & Guidance" --> Memory[MemoryAgent];
        Memory -- "Personalization & Learned Policies" --> Coord;
    end
    
    subgraph "Layer 3: Execution & Tools"
        Executors((Executor Pool));
        VSA[AdvancedVectorSearchAgent];
        DSA["DeepSearchAgent (Web)"];
    end

    Coord -- "4. Delegate Sub-task" --> Executors;
    
    subgraph "Data & Knowledge Layer"
        Milvus[Unified Milvus Backend<br>- Hybrid Vectors<br>- Memory Store];
        Internet[Internet];
    end

    VSA --> Milvus;
    DSA --> Internet;

    subgraph "Synthesis & Learning"
        Executors -- "5. Result / Observation" --> Synth{Synthesis Agent};
        Synth -- "Appends to History & Planner" --> Planner;
        Synth --> FinalAnswer[Final Answer Stream];
        FinalAnswer -- "Full Transcript" --> Analyzer{Post-Interaction Analyzer};
        Analyzer -- "Learns & Updates" --> Memory;
    end

🛠️ Technical Deep Dive

Hybrid Search: Combines semantic Vector Search (dense vectors for meaning) with keyword-based Lexical Search (sparse vectors for keywords) using the BGE-M3 model.
Reciprocal Rank Fusion (RRF): Merges the dense and sparse search results efficiently within Milvus for a unified ranking.
Reranking: A powerful BGE-Reranker cross-encoder model re-ranks the fused candidates for maximum contextual relevance, ensuring the most precise results are at the top.
Vector Quantization: Supports database-level quantization (HNSW_SQ8, IVF_RABITQ) for scalable, cost-effective production deployments, configurable via profiles.
Model Management: A central ModelRegistry ensures that large embedding and reranker models are loaded into memory only once, optimizing resource usage.

🚀 Installation & Setup

1. Prerequisites

Python 3.10+
A Google Gemini API Key.
Docker and Docker Compose (for running Milvus).

2. Install Milvus Standalone (Recommended)

Choose the instructions for your operating system.

<details> <summary>🐧 For Linux & macOS</summary>

The quickest way to get started is with the official installation script.

# Download the script
curl -sfL https://raw.githubusercontent.com/milvus-io/milvus/master/scripts/standalone_embed.sh -o standalone_embed.sh

# Start Milvus and its dependencies
bash standalone_embed.sh start

# To stop the services later
# bash standalone_embed.sh down

</details> <details> <summary>❖ For Windows</summary>

On Windows, Milvus runs via Docker Desktop with WSL2.

**Ensu

Related Skills

node-connect

350.8k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

110.4k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

350.8k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

350.8k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。

hassenhamdi

View profile

View on GitHub

GitHub Stars14

CategoryDevelopment

Updated10d ago

Forks0

hassenhamdi/HyDRA

Languages

Python

Security Score

95/100

Audited on Mar 27, 2026

No findings

HyDRA

Install / Use

README

HyDRA: Hybrid Dynamic RAG Agents

HyDRA Website

Announcement:

🎬 Project Demo

Table of Contents

Why HyDRA?

🧠 The HyDRA Approach

1. Hierarchical Agents

2. Iterative Reasoning (ReAct)

3. Autonomous Learning (HELP/SIMPSON)

✨ Core Features

🗺️ Architectural Overview

🛠️ Technical Deep Dive

🚀 Installation & Setup

1. Prerequisites

2. Install Milvus Standalone (Recommended)

Related Skills