Seerai
Zotero AI plugin, research framework, assistant, with MCP, agent capabilities, OCR, table extraction, semantic scholar, and firecrawl/tavily web search
Install / Use
/learn @dralkh/SeeraiREADME
seerai
<p align="center"> <img width="200" height="200" alt="logo" src="https://github.com/user-attachments/assets/26e6aa5b-4b70-464a-8198-6ec48544593d" /> </p> <p align="center"> <a href="https://www.zotero.org"> <img src="https://img.shields.io/badge/Zotero-8.x-brightgreen" /> </a> <a href="https://opensource.org/licenses/MIT"> <img src="https://img.shields.io/badge/License-MIT-blue.svg" /> </a> <a href="https://github.com/dralkh/seerai/releases"> <img src="https://img.shields.io/badge/Version-1.6.0-blue" /> </a> </p> <p align="center"> <img src="https://github.com/user-attachments/assets/90f68a0d-fec0-41e7-a988-f557c18bd150" width="600" /> </p> <p align="center"> <img src="https://github.com/user-attachments/assets/b9e71c6b-d903-4a69-8461-c7314e52dcf5" width="600" /> </p> <p align="center"> <img src="https://github.com/user-attachments/assets/fe543d48-eb68-47e4-b5f6-1d954e1597a2" width="600" /> </p> <p align="center"> <a href="https://nano-gpt.com/r/RwCEN6fR"> <img src="https://ik.imagekit.io/nanogpt/og-image.png?tr=w-600" alt="Powered by NanoGPT" width="600" /> </a> </p> <p align="center"> <b>Powered by <a href="https://nano-gpt.com/r/RwCEN6fR">NanoGPT</a></b> — Access 300+ AI models with flexible pay-as-you-go or subscription plans. </p> <p align="center"> <a href="https://github.com/dralkh/seerai/releases/latest/download/seerai.xpi"> <img src="https://img.shields.io/badge/Install-Download%20.xpi-brightgreen?style=for-the-badge&logo=zotero" alt="Download seerai.xpi" /> </a> </p>seerai is an intelligent research assistant plugin for Zotero 8 that integrates AI-powered chat, advanced search, and data extraction capabilities directly into your research workflow. Chat with your papers, extract structured data, and accelerate your literature review with a local-first, privacy-focused design.
Features
AI-Powered Chat Interface
- Contextual Conversations: Chat with AI about your selected papers with full context awareness.
- Smart Context Priority: Automatically prioritizes content sources:
- Zotero Notes (OCR note, and other notes for highest priority)
- Indexed PDF Text (Fast, efficient, however consumes a lot of tokens and may cause limit issues)
- OCR (Fallback for scanned documents with no indexed text)
- Multi-paper Support: Add multiple papers to a single conversation for comparative analysis.
- Streaming Responses: Real-time, token-by-token response rendering.
- Markdown & Math: Responses are formatted with syntax highlighting and LaTeX math support.
- Vision Support: Paste images directly into chat for multimodal analysis.
- Multimodal Generation: Generate images, videos, speech-to-text, and text-to-speech directly from chat.
- Attachments Upload: Add files to conversations via the context menu.
- Interactive Follow-ups: Deepen the conversation with streaming follow-up questions.
- Configurable Citations: Choose your preferred citation style for AI insights and chat.
- Smart Copy: Select and copy text with preserved formatting (Markdown) directly from chat bubbles.
- Enhanced Keybindings:
Enter: Insert new lineShift+Enter: Send messageCtrl+Shift+S: Toggle/Focus detachable window
- Detachable Window: Pop out the SeerAI interface into a standalone resizable floating window to maintain chat access while navigating your library.
- Themed UI: Enhanced dialogs and components with full theme support for a consistent look across Zotero's Light and Dark modes.
- Responsive Layout: Chat, tables, and search tabs dynamically adapt to panel width changes.
Semantic Search & Discovery
- RAG (Retrieval-Augmented Generation): Per-context embeddings with chunking, vector store, and semantic retrieval for large documents.
- Web Search: Integrated Firecrawl & Tavily support for finding high-quality full-text content.
- Semantic Scholar Agent: Advanced paper search with:
- Advanced Filters: Fine-tune results by Year, Venue, and Citation Count.
- AI Insights Config: Configure insight generation directly from the search panel.
- Smart Import:
- PDF Discovery: Automatically finds and attaches PDFs during import.
- Source Link: Fallback to source links if PDFs are unavailable.
- Status Indicators: Clear feedback on import status (⬇️ Importing, ✅ Imported, ⚠️ Failed).
- Global Search Scope: Searching now extends across all libraries, including personal and group collections.
- Advanced Boolean Search: Robust support for nested logic (AND/OR/NOT), implicit phrasing, and markdown-aware matching.
- Smart Regex matching: Improved search precision with intelligent handling of word boundaries and special characters.
Agentic Chat & Tool Use
- Autonomous Agents: AI can now use tools to interact with your Zotero library and the web.
- Rich Tool Suite:
- Search Tool: Search through your library with advanced filters.
- Collection Tool: Manage collections and move items.
- Note Tools: Read, create, and edit existing item notes for seamless research updates.
- Tag Tool: Automatically generate and apply relevant tags to your research.
- Read Tool: Extract text from PDFs and items for deep analysis.
- Citation Tool: Generate citations and bibliographies.
- Table Tool: Interact with and generate data for your Paper Tables.
- Web Tool: Search the web and fetch content using Firecrawl or Tavily.
- Advanced Orchestration: Improved tool calling logic and iteration tracking for more robust agent performance.
- Model-as-a-Tool: Seamless integration with LLM tool-calling capabilities.
Papers Tables
- Structured Extraction: Extract specific data points from multiple papers into a comparative table.
- AI-Powered Columns: Define custom columns with AI prompts (e.g., "Methodology", "Sample Size").
- Inline Editing: innovative inline editor for column titles and prompts.
- One-Click Generation: Generate data for individual cells or entire columns instantly.
- Bulk Actions: Regenerate content or add selected papers to collections in bulk.
- Side Strip Actions: Unified controls for adding, removing columns, generating triggers, and settings.
OCR & Text Extraction
- Flexible OCR Options:
- Mistral OCR: High-quality cloud OCR (Recommended).
- DataLab.to: Reliable cloud-based extraction.
- Local Marker: Run your own local OCR server for free, private processing.
- Auto-Processing: Automatically processes unindexed PDFs when needed.
Customizable AI
- Persistent API Keys: API keys are saved and persisted across all configured providers.
- Model Presets: Pre-configured settings for popular providers:
- NanoGPT (300+ models, pay-as-you-go or subscription)
- OpenAI (GPT-5, o3)
- Anthropic (Claude Sonnet 4.5)
- Google (Gemini 3 Pro)
- DeepSeek, Mistral, Groq, OpenRouter
- Local Models (Openai compatible endpoint, Ollama, LM Studio)
- 12-16g Vram - Qwen3-4B-Thinking-2507
- 24-32g Vram - gpt-oss-20b
- 48-64g Vram - QwQ-32B
- 96-128g Vram - Qwen3-Next-80B-A3B-Instruct
- Smart Rate Limiting: Per-model configuration for concurrency, RPM, and TPM to prevent provider errors.
- Per-Conversation Models: Switch models dynamically based on the task complexity.
Installation
From GitHub (Recommended)
- Download the latest release (
.xpifile) from Releases. - In Zotero, go to Tools → Add-ons.
- Click the gear icon ⚙️ and select Install Add-on From File....
- Select the downloaded
.xpifile. - Restart Zotero.
From Source
# Clone the repository
git clone https://github.com/dralkh/seerai.git
cd seerai
# Install dependencies
npm install
# Build the plugin
npm run build
# The .xpi file will be generated in the root directory
Configuration
Go to Zotero → Settings → seerai to configure your AI providers and services.
1. AI Models
Use the Add Configuration button to set up your AI models.
- NanoGPT (Default): Pre-selected when adding a new model. Access 300+ AI models with flexible pricing — pay-as-you-go or subscription plans. Supports Apple Pay, Google Pay, credit card, and crypto. The model list is fetched automatically.
- Presets: Select from built-in presets (OpenAI, Anthropic, Ollama, etc.) for quick setup.
- Custom: Manually configure API URL, Key, and Model ID for any OpenAI-compatible provider.
- Default: Set a preferred model as your default for new conversations.
2. OCR Services
Choose your preferred text extraction engine:
- Mistral OCR: Requires Mistral API Key. Best for accuracy.
- Cloud (DataLab.to): Requires DataLab API Key.
- Local Marker Server: Requires running a local Python server.
- URL:
http://localhost:8001(Default) - See Marker Project for setup.
- URL:
3. Search Integrations
- Semantic Scholar: Add your API Key for higher rate limits and faster searches.
- Firecrawl: Add API Key to enable deep web search capabilities - local instance with (GitHub).
- Tavily: Add API Key for optimized search results tailored for AI agents.
4. MCP Server & API
Seer-AI now includes a Model Context Protocol (MCP) server and a local API for external integrations.
- MCP Server: Located in
mcp-server/. Allows external LLMs (like Claude Desktop) to interact with your Zotero library. See the MCP Setup Guide for configuration instru
Related Skills
Hook Development
108.8kThis skill should be used when the user asks to "create a hook", "add a PreToolUse/PostToolUse/Stop hook", "validate tool use", "implement prompt-based hooks", "use ${CLAUDE_PLUGIN_ROOT}", "set up event-driven automation", "block dangerous commands", or mentions hook events (PreToolUse, PostToolUse, Stop, SubagentStop, SessionStart, SessionEnd, UserPromptSubmit, PreCompact, Notification). Provides comprehensive guidance for creating and implementing Claude Code plugin hooks with focus on advanced prompt-based hooks API.
MCP Integration
108.8kThis skill should be used when the user asks to "add MCP server", "integrate MCP", "configure MCP in plugin", "use .mcp.json", "set up Model Context Protocol", "connect external service", mentions "${CLAUDE_PLUGIN_ROOT} with MCP", or discusses MCP server types (SSE, stdio, HTTP, WebSocket). Provides comprehensive guidance for integrating Model Context Protocol servers into Claude Code plugins for external tool and service integration.
Plugin Structure
108.8kThis skill should be used when the user asks to "create a plugin", "scaffold a plugin", "understand plugin structure", "organize plugin components", "set up plugin.json", "use ${CLAUDE_PLUGIN_ROOT}", "add commands/agents/skills/hooks", "configure auto-discovery", or needs guidance on plugin directory layout, manifest configuration, component organization, file naming conventions, or Claude Code plugin architecture best practices.
Skill Development
108.8kThis skill should be used when the user wants to "create a skill", "add a skill to plugin", "write a new skill", "improve skill description", "organize skill content", or needs guidance on skill structure, progressive disclosure, or skill development best practices for Claude Code plugins.
