ERAG

Overview

You can use this application to:

Talk privately with your documents using Ollama or talk fast using Groq and others.
Perform Retrieval-Augmented Generation activities (RAG) using various APIs (Ollama, LLaMA, Groq, Gemini, Cohere).
Perform AI powered web search.
Talk with a specific url.
Analyze and summarize GitHub repositories.
Do AI powered Exploratory Data Analysis (EDA) with AI generated Business Intelligence and insights on excels and csv (see some examples in images below).
Utilize multiple AI models in collaboration (worker, supervisor, manager) for pre-defined complex tasks.
Generate specific knowledge entries (knol), or generate full size textbooks or use AI generated questions and answers to create datasets.

Thus, ERAG is an advanced system that combines lexical, semantic, text, and knowledge graph searches with conversation context to provide accurate and contextually relevant responses. This tool processes various document types, creates embeddings, builds knowledge graphs, and uses this information to answer user queries intelligently. It also includes modules for interacting with web content, GitHub repositories, performing exploratoru data analysis using various language models.

working on CPU only

tested on Windows 10

ERAG GUI 1 ERAG GUI 2 ERAG GUI 3

Key Features

Multi-modal Document Processing: Handles DOCX, PDF, TXT, and JSON files with intelligent chunking and table of contents extraction.
Advanced Embedding Generation: Creates and manages embeddings for efficient semantic search using sentence transformers, with support for batch processing and caching.
Knowledge Graph Creation: Builds and utilizes a knowledge graph for enhanced information retrieval using spaCy and NetworkX.
Multi-API Support: Integrates with Ollama, LLaMA, and Groq APIs for flexible language model deployment.
Retrieval-Augmented Generation (RAG): Combines retrieved context with language model capabilities for improved responses.
Web Content Processing: Implements real-time web crawling, content extraction, and summarization.
Query Routing: Intelligently routes queries to the most appropriate subsystem based on content relevance and query complexity.
Server Management: Provides a GUI for managing local LLaMA.cpp servers, including model selection and server configuration.
Customizable Settings: Offers a wide range of configurable parameters through a graphical user interface and a centralized settings management system.
Advanced Search Utilities: Implements lexical, semantic, graph-based, and text search methods with configurable weights and thresholds.
Conversation Context Management: Maintains and utilizes conversation history for more coherent and contextually relevant responses.
GitHub Repository Analysis: Provides tools for analyzing and summarizing GitHub repositories, including code analysis, dependency checking, and code smell detection.
Web Summarization: Offers capabilities to summarize web content based on user queries.
Interactive Model Chat: Allows direct interaction with various language models for general conversation and task completion.
Debug and Logging Capabilities: Provides comprehensive logging and debug information for system operations and search results.
Color-coded Console Output: Enhances user experience with color-coded console messages for different types of information.
Structured Data Analysis: Implements tools for analyzing structured data stored in SQLite databases, including value counts, grouped summary statistics, and advanced visualizations.
Exploratory Data Analysis (EDA): Offers comprehensive EDA capabilities, including distribution analysis, correlation studies, and outlier detection.
Advanced Data Visualization: Generates various types of plots and charts, such as histograms, box plots, scatter plots, and pair plots for in-depth data exploration.
Statistical Analysis: Provides tools for conducting statistical tests and generating statistical summaries of the data.
Multi-Model Collaboration: Utilizes worker, supervisor, and manager AI models to create, improve, and evaluate knowledge entries.
Iterative Knowledge Refinement: Implements an iterative process of knowledge creation, improvement, and evaluation to achieve high-quality, comprehensive knowledge entries.
Automated Quality Assessment: Includes an automated grading system for evaluating the quality of generated knowledge entries.
Structured Knowledge Format: Enforces a consistent, hierarchical structure for knowledge entries to ensure comprehensive coverage and easy navigation.
PDF Report Generation: Automatically generates comprehensive PDF reports summarizing the results of various analyses, including visualizations and AI-generated interpretations.

System Architecture

ERAG is composed of several interconnected components:

File Processing: Handles document upload and processing, including table of contents extraction.
Embedding Utilities: Manages the creation and retrieval of document embeddings.
Knowledge Graph: Creates and maintains a graph representation of document content and entity relationships.
RAG System: Implements the core retrieval-augmented generation functionality.
Query Router: Analyzes queries and routes them to the appropriate subsystem.
Server Manager: Handles the configuration and management of local LLaMA.cpp servers.
Settings Manager: Centralizes system configuration and provides easy customization options.
Search Utilities: Implements various search methods to retrieve relevant context for queries.
API Integration: Provides a unified interface for interacting with different language model APIs.
Talk2Model: Enables direct interaction with language models for general queries and tasks.
Talk2URL: Allows interaction with web content, including crawling and question-answering based on web pages.
WebRAG: Implements a web-based retrieval-augmented generation system for answering queries using internet content.
WebSum: Provides tools for summarizing web content based on user queries.
Talk2Git: Offers capabilities for analyzing and summarizing GitHub repositories.
Talk2SD: Implements tools for interacting with and analyzing structured data stored in SQLite databases.
Exploratory Data Analysis (EDA): Provides comprehensive EDA capabilities, including various statistical analyses and visualizations.
Advanced Exploratory Data Analysis: Offers more sophisticated data analysis techniques, including machine learning-based approaches and complex visualizations.
Self Knol Creator: Manages the process of creating, improving, and evaluating comprehensive knowledge entries on specific subjects.
Innovative Exploratory Data Analysis: while the individual analytical techniques are not particularly innovative on their own, the overall system's attempt to automate the entire process from data analysis to interpretation and reporting, using multiple AI models, represents a more innovative approach to data analysis automation. However, the true innovation and effectiveness of this system would depend heavily on the quality of the AI models used.

Installation

Clone the repository:

git clone https://github.com/EdwardDali/erag.git && cd erag

Install torch CPU only

pip install torch==2.3.1 torchvision==0.18.1 torchaudio==2.3.1 --index-url https://download.pytorch.org/whl/cpu

Install required Python dependencies:
```
pip install -r requirements.txt
```

Download required spaCy and NLTK models:

python -m spacy download en_core_web_sm
python -m nltk.downloader punkt

Install Ollama (for using Ollama API and for embeddings) and install ollama models:
- Linux/macOS: curl https://ollama.ai/install.sh | sh
- Windows: Visit https://ollama.ai/download and follow installation instructions
- ollama run gemma2:2b
- ollama run chroma/all-minilm-l6-v2-f32:latest - for embedddings

Set up environment variables:

Create a .env file in the project root

Add the following variables (if applicable):

 GROQ_API_KEY='your_groq_api_key_here'
 GEMINI_API_KEY='your_gemini_api_key_here'
 CO_API_KEY='your_cohere_api_key_here'
 GITHUB_TOKEN='your_github_token_here'

Usage

Start the ERAG GUI:
```
python main.py
```
Use the GUI to:
- Upload and process documents
- Generate embeddings
- Create knowledge graphs
- Configure system settings
- Manage local LLaMA.cpp servers
- Run various RAG operations (Talk2Doc, WebRAG, etc.)
- Analyze structured data and perform exploratory data analysis
- Create and refine comprehensive knowledge entries (Self Knols)

Configuration

Customize ERAG's behavior through the Settings tab in the GUI or by modifying settings.py. Key configurable options include:

Chunk sizes and overlap for document processing
Embedding model selection and batch size
Knowledge graph parameters (similarity threshol

Erag

Install / Use

README