seerai

seerai is an intelligent research assistant plugin for Zotero 7 that integrates AI-powered chat, advanced search, and data extraction capabilities directly into your research workflow. Chat with your papers, extract structured data, and accelerate your literature review with a local-first, privacy-focused design.

Features

AI-Powered Chat Interface

Contextual Conversations: Chat with AI about your selected papers with full context awareness.
Smart Context Priority: Automatically prioritizes content sources:
1. Zotero Notes (OCR note, and other notes for highest priority)
2. Indexed PDF Text (Fast, efficient, howver consumes alot of tokens, and would cause limit issues)
3. OCR (Fallback for scanned documents with no indexed text)
Multi-paper Support: Add multiple papers to a single conversation for comparative analysis.
Streaming Responses: Real-time, token-by-token response rendering.
Markdown & Math: Responses are formatted with syntax highlighting and LaTeX math support.
Vision Support: Paste images directly into chat for multimodal analysis.
Enhanced Keybindings:
- Enter: Insert new line
- Shift+Enter: Send message

Semantic Search & Discovery

Web Search: Integrated Firecrawl support for finding full-text content.
Semantic Scholar Agent: Advanced paper search with filtering (Year, Venue, Citation Count).
Smart Import:
- PDF Discovery: Automatically finds and attaches PDFs during import.
- Source Link: Fallback to source links if PDFs are unavailable.
- Status Indicators: Clear feedback on import status (⬇️ Importing, ✅ Imported, ⚠️ Failed).

Papers Tables

Structured Extraction: Extract specific data points from multiple papers into a comparative table.
AI-Powered Columns: Define custom columns with AI prompts (e.g., "Methodology", "Sample Size").
Inline Editing: innovative inline editor for column titles and prompts.
One-Click Generation: Generate data for individual cells or entire columns instantly.
Side Strip Actions: Unified controls for adding, removing columns, generating triggers, and settings.

OCR & Text Extraction

Flexible OCR Options:
- Mistral OCR: High-quality cloud OCR (Recommended).
- DataLab.to: Reliable cloud-based extraction.
- Local Marker: Run your own local OCR server for free, private processing.
Auto-Processing: Automatically processes unindexed PDFs when needed.

Customizable AI

Model Presets: Pre-configured settings for popular providers:
- OpenAI (GPT-5, o3)
- Anthropic (Claude Sonnet 4.5)
- Google (Gemini 3 Pro)
- DeepSeek, Mistral, Groq, OpenRouter
- Local Models (Openai compatible endpoint, Ollama, LM Studio)
  - 12-16g Vram - Qwen3-4B-Thinking-2507
  - 24-32g Vram - gpt-oss-20b
  - 48-64g Vram - QwQ-32B
  - 96-128g Vram - Qwen3-Next-80B-A3B-Instruct
Per-Conversation Models: Switch models dynamically based on the task complexity.

Installation

From GitHub (Recommended)

Download the latest release (.xpi file) from Releases.
In Zotero, go to Tools → Add-ons.
Click the gear icon ⚙️ and select Install Add-on From File....
Select the downloaded .xpi file.
Restart Zotero.

From Source

# Clone the repository
git clone https://github.com/dralkh/seerai.git
cd seerai

# Install dependencies
npm install

# Build the plugin
npm run build

# The .xpi file will be generated in the root directory

Configuration

Go to Zotero → Settings → seerai to configure your AI providers and services.

1. AI Models

Use the Add Configuration button to set up your AI models.

Presets: Select from built-in presets (OpenAI, Anthropic, Ollama, etc.) for quick setup.
Custom: Manually configure API URL, Key, and Model ID for any OpenAI-compatible provider.
Default: Set a preferred model as your default for new conversations.

2. OCR Services

Choose your preferred text extraction engine:

Mistral OCR: Requires Mistral API Key. Best for accuracy.
Cloud (DataLab.to): Requires DataLab API Key.
Local Marker Server: Requires running a local Python server.
- URL: http://localhost:8001 (Default)
- See Marker Project for setup.

3. Search Integrations

Semantic Scholar: Add your API Key for higher rate limits and faster searches.
Firecrawl: Add API Key to enable deep web search capabilities - local instance with (GitHub).

Usage Guide

Chatting with Papers

Select a paper (or multiple) in your library.
Open the SeerAI sidebar tab.
(Optional) Customize context inclusions via the settings icon (Abstracts, Notes).
Type your question or use templates from the Prompt Library (Book icon).

Creating Data Tables

Open the Tables tab in the main view.
Click + on the side strip to add a new column.
Define the column header and AI prompt (e.g., "What is the sample size?").
Drag and drop papers into the table.
Click Generate on cells to extract data.

Prompt Library

Access via the Book Icon 📖 in chat.
Use built-in templates (Summarize, Critique, Compare).
Create custom templates with placeholders:
- !: Saved Prompts
- /: Papers
- ^: Folders
- ~: Tags
- @: Authors
- #: Topics

Future Implementations Ideas

Propose several advanced features to enhance seerai's capabilities. These are currently in the just idea board.

1. Advanced Search Capabilities

Enhanced search functionality to help users find relevant literature more effectively.

Autocomplete: Intelligent suggestions for tags, creators, and collections as you type.
Complex Queries: Support for boolean logic (AND/OR) and nested search conditions (e.g., "Title contains X AND Year > 2020").
Field-Specific Search: Dedicated filters for titles, authors, years, and tags.

2. Semantic Vector Search

Voice, Transcription, Embedding Integration: Support for OpenAI-compatible embedding, voice, transcription models (e.g., text-embedding-3-small, local Ollama embeddings).
Contextual Retrieval: Find papers based on conceptual similarity rather than just exact text matches.
In-Memory Vector Store — Fast, local indexing of session-relevant papers for semantic analysis.
RAG - used when 80% limit reached to context size

3. Data Verification & Quality Control

Verifier Button — One-click verification to check all extracted data against source text.
Confidence Scores — AI-generated confidence ratings for each extracted data point.
Source Highlighting — Click a cell to see the exact passage in the paper where the data came from.

4. Firecrawl API Integration

URL Discovery — Usage Firecrawl API for pdf discovery

Others

Citations referencing within tables and chat on generation - MCP Connectors UI revamp

Development

Prerequisites

Node.js 18+
Zotero 7

Project Structure

The codebase follows a modular architecture:

seerai/
├── addon/                 # Zotero integration files (XUL/XHTML)
├── src/
│   ├── modules/           # Core feature modules
│   │   ├── chat/          # Chat engine & state
│   │   ├── assistant.ts   # Main assistant logic
│   │   ├── firecrawl.ts   # Firecrawl integration
│   │   ├── ocr.ts         # OCR implementation
│   │   ├── openai.ts      # LLM client implementation
│   │   ├── semanticScholar.ts # Semantic Scholar integration
│   │   └── preferenceScript.ts # Preferences logic
│   ├── utils/             # Utility functions
│   └── hooks.ts           # Zotero event listeners
└── package.json

Commands

npm start       # Start dev server with hot reload
npm run build   # Build for production
npm run lint:fix # Fix code style issues

Contributing

Contributions are welcome!

Fork the repo.
Create a feature branch (git checkout -b feature/MyFeature).
Commit changes (git commit -m 'Add MyFeature').
Push to branch (git push origin feature/MyFeature).
Open a Pull Request.

License

MIT License - see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 463 Commits
.github		.github
.vscode		.vscode
addon		addon
src		src
test		test
typings		typings
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.prettierignore		.prettierignore
LICENSE		LICENSE
README.md		README.md
eslint.config.mjs		eslint.config.mjs
package-lock.json		package-lock.json
package.json		package.json
prefs.js		prefs.js
tsconfig.json		tsconfig.json
zotero-plugin.config.ts		zotero-plugin.config.ts

License

dralkh/seerai

Folders and files

Latest commit

History

Repository files navigation

seerai

Features

AI-Powered Chat Interface

Semantic Search & Discovery

Papers Tables

OCR & Text Extraction

Customizable AI

Installation

From GitHub (Recommended)

From Source

Configuration

1. AI Models

2. OCR Services

3. Search Integrations

Usage Guide

Chatting with Papers

Creating Data Tables

Prompt Library

Future Implementations Ideas

1. Advanced Search Capabilities

2. Semantic Vector Search

3. Data Verification & Quality Control

4. Firecrawl API Integration

Others

Development

Prerequisites

Project Structure

Commands

Contributing

License

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 10

Packages 0

Contributors 16

Languages

Packages