ARAGOG
ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research papers dataset. Includes modular code for easy experimentation and reusability.
Install / Use
/learn @predlico/ARAGOGREADME
ARAGOG - Advanced Retrieval Augmented Generation Output Grading :spider:
This repository contains the code, data, and analysis for our study [link later] on advanced Retrieval-Augmented Generation (RAG) techniques. It's part of our scientific paper investigating the efficacy of various RAG techniques in enhancing the precision and contextual relevance of LLMs.
Repository Structure
eval_questions/: Contains a JSON file with 107 QA pairs used in the evaluation.papers_for_questions/: Holds a collection of AI-ArXiv papers that were utilized for creating the 107 QA pairs.resources/: Includes essential resources like the prompt template and configuration files. Note: Actual config files need API keys and other settings to be filled out.main.py: The main script where experiments are defined and executed.res_analysis.ipynb: A Jupyter notebook for in-depth analysis of the final experimental results.utils.py: Helper functions supporting various operations within the repository.vector_db.py: Scripts for setting up different vector databases, such as Classic VDB, Sentence-window, and Document Summary.final_results.xlsx: Spreadsheet containing the final results from our experiments, shared for transparency and scientific verification.
Getting Started
To replicate our experiments or to analyze our results, please ensure to fill in the necessary API keys and other configurations by creating a .env file (see .sample.env) - the .env is ignored in .gitignore for security.
Setup the python environment using either venv or pyenv or your favourite python environment amanger. Call the environment aragog or anything you like.
python3 -m venv aragogand activate it usingsource venv/bin/activate(Mac/Linux) orvenv\Scripts\activate(Windows).- OR
pyenvwithpyenv virtualenv 3.12 aragog, then activate withpyenv local aragog.
Then run pip install -r requirements.txt to install all necessary dependencies.
Results examination
The res_analysis.ipynb notebook provides a detailed examination of the experimental results stored in final_results.xlsx.
Full replication
To set up vector databases for experiments, run the vector_db.py script. Subsequently, execute main.py to perform the experiments. Post-experimentation, use res_analysis.ipynb for analyzing the results. Helper functions in utils.py are employed across scripts to streamline processes.
Contribution
Contributions are welcome. For any changes or enhancements, please open an issue first to discuss what you would like to change.
License
This project is open-source and available under the MIT License.
Related Skills
best-practices-researcher
The most comprehensive Claude Code skills registry | Web Search: https://skills-registry-web.vercel.app
groundhog
399Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you can drive these tools harder (or perhaps make your own!).
codebase-to-course
Turn any codebase into a beautiful, interactive single-page HTML course that teaches how the code works to non-technical people. Use this skill whenever someone wants to create an interactive course, tutorial, or educational walkthrough from a codebase or project. Also trigger when users mention 'turn this into a course,' 'explain this codebase interactively,' 'teach this code,' 'interactive tutorial from code,' 'codebase walkthrough,' 'learn from this codebase,' or 'make a course from this project.' This skill produces a stunning, self-contained HTML file with scroll-based navigation, animated visualizations, embedded quizzes, and code-with-plain-English side-by-side translations.
academic-pptx
Use this skill whenever the user wants to create or improve a presentation for an academic context — conference papers, seminar talks, thesis defenses, grant briefings, lab meetings, invited lectures, or any presentation where the audience will evaluate reasoning and evidence. Triggers include: 'conference talk', 'seminar slides', 'thesis defense', 'research presentation', 'academic deck', 'academic presentation'. Also triggers when the user asks to 'make slides' in combination with academic content (e.g., 'make slides for my paper on X', 'create a presentation for my dissertation defense', 'build a deck for my grant proposal'). This skill governs CONTENT and STRUCTURE decisions. For the technical work of creating or editing the .pptx file itself, also read the pptx SKILL.md.
