AbstractCore

A unified Python library for interaction with multiple Large Language Model (LLM) providers.

Write once, run everywhere.

Quick Start

Installation

pip install abstractcore[all]

Basic Usage

from abstractcore import create_llm

# Works with any provider - just change the provider name
llm = create_llm("anthropic", model="claude-3-5-haiku-latest")
response = llm.generate("What is the capital of France?")
print(response.content)

Deterministic Generation

from abstractcore import create_llm

# Deterministic outputs with seed + temperature=0
llm = create_llm("openai", model="gpt-3.5-turbo", seed=42, temperature=0.0)

# These will produce identical outputs
response1 = llm.generate("Write exactly 3 words about coding")
response2 = llm.generate("Write exactly 3 words about coding")
print(f"Response 1: {response1.content}")  # "Innovative, challenging, rewarding."
print(f"Response 2: {response2.content}")  # "Innovative, challenging, rewarding."

Tool Calling

from abstractcore import create_llm, tool

@tool
def get_current_weather(city: str):
    """Fetch current weather for a given city."""
    return f"Weather in {city}: 72°F, Sunny"

llm = create_llm("openai", model="gpt-4o-mini")
response = llm.generate(
    "What's the weather like in San Francisco?",
    tools=[get_current_weather]
)
print(response.content)

Response Object (GenerateResponse)

Every LLM generation returns a GenerateResponse object with consistent structure across all providers:

from abstractcore import create_llm

llm = create_llm("openai", model="gpt-4o-mini")
response = llm.generate("Explain quantum computing in simple terms")

# Core response data
print(f"Content: {response.content}")               # Generated text
print(f"Model: {response.model}")                   # Model used
print(f"Finish reason: {response.finish_reason}")   # Why generation stopped

# Consistent token access across ALL providers (NEW in v2.4.7)
print(f"Input tokens: {response.input_tokens}")     # Always available
print(f"Output tokens: {response.output_tokens}")   # Always available  
print(f"Total tokens: {response.total_tokens}")     # Always available

# Generation time tracking (NEW in v2.4.7)
print(f"Generation time: {response.gen_time}ms")    # Always available (rounded to 1 decimal)

# Advanced access
print(f"Tool calls: {response.tool_calls}")         # Tools executed (if any)
print(f"Raw usage: {response.usage}")               # Provider-specific token data
print(f"Metadata: {response.metadata}")             # Additional context

# Comprehensive summary
print(f"Summary: {response.get_summary()}")         # "Model: gpt-4o-mini | Tokens: 117 | Time: 1234.5ms"

Token Count Sources:

Provider APIs: OpenAI, Anthropic, LMStudio (native API token counts)
AbstractCore Calculation: MLX, HuggingFace (using token_utils.py)
Mixed Sources: Ollama (combination of provider and calculated tokens)

Backward Compatibility: Legacy prompt_tokens and completion_tokens keys remain available in response.usage dictionary.

Built-in Tools

AbstractCore includes a comprehensive set of ready-to-use tools for common tasks:

from abstractcore.tools.common_tools import fetch_url, search_files, read_file

# Intelligent web content fetching with automatic parsing
result = fetch_url("https://api.github.com/repos/python/cpython")
# Automatically detects JSON, HTML, images, PDFs, etc. and provides structured analysis

# File system operations
files = search_files("def.*fetch", ".", file_pattern="*.py")  # Find function definitions
content = read_file("config.json")  # Read file contents

# Use with any LLM
llm = create_llm("anthropic", model="claude-3-5-haiku-latest")
response = llm.generate(
    "Analyze this API response and summarize the key information",
    tools=[fetch_url]
)

Available Tools:

fetch_url - Intelligent web content fetching with automatic content type detection and parsing
search_files - Search for text patterns inside files using regex
list_files - Find and list files by names/paths using glob patterns
read_file - Read file contents with optional line range selection
write_file - Write content to files with directory creation
edit_file - Edit files using pattern matching and replacement
web_search - Search the web using DuckDuckGo
execute_command - Execute shell commands safely with security controls

Session Management

from abstractcore import BasicSession, create_llm

# Create a persistent conversation session
llm = create_llm("openai", model="gpt-4o-mini")
session = BasicSession(llm, system_prompt="You are a helpful assistant.")

# Add messages with metadata
session.add_message('user', 'Hello!', name='alice', location='Paris')
response = session.generate('What is Python?', name='bob')

# Save complete conversation with optional analytics
session.save('conversation.json')  # Basic save
session.save('analyzed.json', summary=True, assessment=True, facts=True)  # With analytics

# Load and continue conversation
loaded_session = BasicSession.load('conversation.json', provider=llm)

Learn more about Session

Media Handling

AbstractCore provides unified media handling across all providers with automatic resolution optimization. Upload images, PDFs, and documents using the same simple API regardless of your provider.

from abstractcore import create_llm

# Vision analysis - works with any vision model
# Images automatically processed at maximum supported resolution
llm = create_llm("openai", model="gpt-4o")
response = llm.generate(
    "What's in this image?",
    media=["photo.jpg"]  # Auto-resized to model's maximum capability
)

# Document analysis - works with any model
llm = create_llm("anthropic", model="claude-3.5-sonnet")
response = llm.generate(
    "Summarize this research paper",
    media=["research_paper.pdf"]
)

# Multiple files - mix images, PDFs, spreadsheets
response = llm.generate(
    "Analyze these business documents",
    media=["report.pdf", "chart.png", "data.xlsx"]
)

# Same code works with local models
llm = create_llm("ollama", model="qwen3-vl:8b")
response = llm.generate(
    "Describe this screenshot",
    media=["screenshot.png"]  # Auto-optimized for qwen3-vl
)

Key Features:

Smart Resolution: Automatically uses maximum resolution supported by each model
Format Support: PNG, JPEG, GIF, WEBP, BMP, TIFF images; PDF, TXT, MD, CSV, TSV, JSON documents
Office Documents: DOCX, XLSX, PPT (with pip install abstractcore[all])
Vision Optimization: Model-specific image processing for vision results

Provider compatibility:

High-resolution vision: GPT-4o (up to 4096x4096), Claude 3.5 Sonnet (up to 1568x1568)
Local models: qwen3-vl (up to 3584x3584), gemma3:4b, llama3.2-vision
All models: Automatic text extraction for non-vision models

Learn more about Media Handling

Key Features

Provider Agnostic: Seamlessly switch between OpenAI, Anthropic, Ollama, LMStudio, MLX, HuggingFace
Centralized Configuration: Global defaults and app-specific preferences at ~/.abstractcore/config/abstractcore.json
Intelligent Media Handling: Upload images, PDFs, and documents with automatic maximum resolution optimization
Vision Model Support: Smart image processing at each model's maximum capability
Document Processing: PDF extraction (PyMuPDF4LLM), Office documents (DOCX/XLSX/PPT), CSV/TSV analysis
Unified Tools: Consistent tool calling across all providers
Session Management: Persistent conversations with metadata, analytics, and complete serialization
Native Structured Output: Server-side schema enforcement for Ollama and LMStudio (OpenAI and Anthropic also supported)
Streaming Support: Real-time token generation for interactive experiences
Consistent Token Terminology: Unified input_tokens, output_tokens, total_tokens across all providers
Embeddings: Built-in support for semantic search and RAG applications
Universal Server: Optional OpenAI-compatible API server with /v1/responses endpoint

Supported Providers

Provider	Status	SEED Support	Setup
OpenAI	Full	Native	Get API key
Anthropic	Full	Warning*	Get API key
Ollama	Full	Native	Install guide
LMStudio	Full	Native	Install guide
MLX	Full	Native	Setup guide
HuggingFace	Full	Native	Setup guide

*Anthropic doesn't support seed parameters but issues a warning when provided. Use temperature=0.0 for more consistent outputs.

Server Mode (Optional HTTP REST API)

AbstractCore is primarily a Python library. The server is an optional component that provides OpenAI-compatible HTTP endpoints:

# Install with server support
pip install abstractcore[server]

# Start the server
uvicorn abstractcore.server.app:app --host 0.0.0.0 --port 8000

Use with any OpenAI-compatible client:

from openai import OpenAI

client = OpenAI(base_url="http://localhost:8000/v1", api_key="unused")
response = client.chat.completions.create(
    model="anthropic/claude-3-5-haiku-latest",
    messages=[{"role": "user", "content": "Hello!"}]
)

Server Features:

OpenAI-compatible REST endpoints (/v1/chat/completions, /v1/embeddings, /v1/responses)
NEW in v2.5.0: OpenAI Responses API (/v1/responses) with native input_file support
Multi-provider support through one HTTP API
Comprehensive media processing (images, PDFs, Office documents, CSV/TSV)
Agentic CLI integration (Codex, Crush, Gemini CLI)
Streaming responses with optional opt-in
Tool call format conversion
Enhanced debug logging with --debug flag
Interactive API docs at /docs (Swagger UI)

When to use the server:

Integrating with existing OpenAI-compatible tools
Using agentic CLIs (Codex, Crush, Gemini CLI)
Building web applications that need HTTP API
Multi-language access (not just Python)

AbstractCore CLI (Optional Interactive Testing Tool)

AbstractCore includes a built-in CLI for interactive testing, development, and conversation management. This is an internal testing tool, distinct from external agentic CLIs.

# Start interactive CLI
python -m abstractcore.utils.cli --provider ollama --model qwen3-coder:30b

# With streaming enabled
python -m abstractcore.utils.cli --provider openai --model gpt-4o-mini --stream

# Single prompt execution
python -m abstractcore.utils.cli --provider anthropic --model claude-3-5-haiku-latest --prompt "What is Python?"

Key Features:

Interactive REPL with conversation history
Chat history compaction and management
Fact extraction from conversations
Conversation quality evaluation (LLM-as-a-judge)
Intent analysis and deception detection
Tool call testing and debugging
System prompt management
Multiple provider support

Popular Commands:

/compact - Compress chat history while preserving context
/facts [file] - Extract structured facts from conversation
/judge - Evaluate conversation quality with feedback
/intent [participant] - Analyze psychological intents and detect deception
/history [n] - View conversation history
/stream - Toggle real-time streaming
/system [prompt] - Show or change system prompt
/status - Show current provider, model, and capabilities

Full Documentation: AbstractCore CLI Guide

When to use the CLI:

Interactive development and testing
Debugging tool calls and provider behavior
Conversation management experiments
Quick prototyping with different models
Learning AbstractCore capabilities

Built-in Applications (Ready-to-Use CLI Tools)

AbstractCore includes four specialized command-line applications for common LLM tasks. These are production-ready tools that can be used directly from the terminal without any Python programming.

Available Applications

Application	Purpose	Direct Command
Summarizer	Document summarization	`summarizer`
Extractor	Entity and relationship extraction	`extractor`
Judge	Text evaluation and scoring	`judge`
Intent Analyzer	Psychological intent analysis & deception detection	`intent`

Quick Usage Examples

# Document summarization with different styles and lengths
summarizer document.pdf --style executive --length brief
summarizer report.txt --focus "technical details" --output summary.txt
summarizer large_doc.txt --chunk-size 15000 --provider openai --model gpt-4o-mini

# Entity extraction with various formats and options
extractor research_paper.pdf --format json-ld --focus technology
extractor article.txt --entity-types person,organization,location --output entities.jsonld
extractor doc.txt --iterate 3 --mode thorough --verbose

# Text evaluation with custom criteria and contexts
judge essay.txt --criteria clarity,accuracy,coherence --context "academic writing"
judge code.py --context "code review" --format plain --verbose
judge proposal.md --custom-criteria has_examples,covers_risks --output assessment.json

# Intent analysis with psychological insights and deception detection
intent conversation.txt --focus-participant user --depth comprehensive
intent email.txt --format plain --context document --verbose
intent chat_log.json --conversation-mode --provider lmstudio --model qwen/qwen3-30b-a3b-2507

Installation & Setup

Apps are automatically available after installing AbstractCore:

# Install with all features
pip install abstractcore[all]

# Apps are immediately available
summarizer --help
extractor --help  
judge --help
intent --help

Alternative Usage Methods

# Method 1: Direct commands (recommended)
summarizer document.txt
extractor report.pdf
judge essay.md
intent conversation.txt

# Method 2: Via Python module
python -m abstractcore.apps summarizer document.txt
python -m abstractcore.apps extractor report.pdf
python -m abstractcore.apps judge essay.md
python -m abstractcore.apps intent conversation.txt

Key Parameters

Common Parameters (all apps):

--provider + --model - Use different LLM providers (OpenAI, Anthropic, Ollama, etc.)
--output - Save results to file instead of console
--verbose - Show detailed progress information
--timeout - HTTP timeout for LLM requests (default: 300s)

Summarizer Parameters:

--style - Summary style: structured, narrative, objective, analytical, executive, conversational
--length - Summary length: brief, standard, detailed, comprehensive
--focus - Specific focus area for summarization
--chunk-size - Chunk size for large documents (1000-32000, default: 8000)

Extractor Parameters:

--format - Output format: json-ld, triples, json, yaml
--entity-types - Focus on specific entities: person,organization,location,technology,etc.
--mode - Extraction mode: fast, balanced, thorough
--iterate - Number of refinement iterations (1-10, default: 1)
--minified - Output compact JSON without indentation

Judge Parameters:

--context - Evaluation context (e.g., "code review", "academic writing")
--criteria - Standard criteria: clarity,soundness,effectiveness,etc.
--custom-criteria - Custom evaluation criteria
--format - Output format: json, plain, yaml
--include-criteria - Include detailed criteria explanations

Key Features

Provider Agnostic: Works with any configured LLM provider (OpenAI, Anthropic, Ollama, etc.)
Multiple Formats: Support for PDF, TXT, MD, DOCX, and more
Flexible Output: JSON, JSON-LD, YAML, plain text formats
Batch Processing: Process multiple files at once
Configurable: Custom prompts, criteria, and evaluation rubrics
Production Ready: Robust error handling and logging

Full Documentation

Each application has documentation with examples and usage information:

Summarizer Guide - Document summarization with multiple strategies
Extractor Guide - Entity and relationship extraction
Intent Analyzer Guide - Psychological intent analysis and deception detection
Judge Guide - Text evaluation and scoring systems

When to use the apps:

Processing documents without writing code
Batch text analysis workflows
Quick prototyping of text processing pipelines
Integration with shell scripts and automation
Standardized text processing tasks

Configuration

AbstractCore provides a centralized configuration system that manages default models, cache directories, and logging settings from a single location. This eliminates the need to specify --provider and --model parameters repeatedly.

Quick Setup

# Check current configuration (shows how to change each setting)
abstractcore --status

# Set defaults for all applications
abstractcore --set-global-default ollama/llama3:8b

# Or configure specific applications (examples of customization)
abstractcore --set-app-default summarizer openai gpt-4o-mini
abstractcore --set-app-default extractor ollama qwen3:4b-instruct
abstractcore --set-app-default judge anthropic claude-3-5-haiku

# Configure logging (common examples)
abstractcore --set-console-log-level WARNING  # Reduce console output
abstractcore --set-console-log-level NONE     # Disable console logging
abstractcore --enable-file-logging            # Save logs to files
abstractcore --enable-debug-logging           # Full debug mode

# Configure vision for image analysis with text-only models
abstractcore --set-vision-provider ollama qwen2.5vl:7b
abstractcore --set-vision-provider lmstudio qwen/qwen3-vl-4b

# Set API keys as needed
abstractcore --set-api-key openai sk-your-key-here
abstractcore --set-api-key anthropic your-anthropic-key

# Verify configuration (includes change commands for each setting)
abstractcore --status

Priority System

AbstractCore uses a clear priority system where explicit parameters always override defaults:

Explicit parameters (highest priority): summarizer doc.txt --provider openai --model gpt-4o-mini
App-specific config: abstractcore --set-app-default summarizer openai gpt-4o-mini
Global config: abstractcore --set-global-default openai/gpt-4o-mini
Built-in defaults (lowest priority): huggingface/unsloth/Qwen3-4B-Instruct-2507-GGUF

Usage After Configuration

Once configured, apps use your defaults automatically:

# Before configuration (requires explicit parameters)
summarizer document.pdf --provider openai --model gpt-4o-mini

# After configuration (uses configured defaults)
summarizer document.pdf

# Explicit parameters still override when needed
summarizer document.pdf --provider anthropic --model claude-3-5-sonnet

Configuration Features

Application defaults: Different optimal models for each app
Cache directories: Configurable cache locations for models and data
Logging control: Package-wide logging levels and debug mode
API key management: Centralized API key storage
Interactive setup: abstractcore --configure for guided configuration

Complete guide: Centralized Configuration

Documentation

📚 Complete Documentation: docs/ - Full documentation index and navigation guide

Getting Started

Prerequisites & Setup - Install and configure providers (OpenAI, Anthropic, Ollama, etc.)
Getting Started Guide - 5-minute quick start with core concepts
Troubleshooting - Common issues and solutions

Core Library (Python)

Python API Reference - Complete Python API documentation
Media Handling System - Images, PDFs, and document processing across all providers
Session Management - Persistent conversations, serialization, and analytics
Embeddings Guide - Semantic search, RAG, and vector embeddings
Code Examples - Working examples for all features
Capabilities - What AbstractCore can and cannot do

Server (Optional HTTP REST API)

Server Documentation - Complete server setup, API reference, and deployment

Architecture & Advanced

Architecture - System design and architecture overview
Tool Calling - Universal tool system and format conversion

Use Cases

1. Provider Flexibility

# Same code works with any provider
providers = ["openai", "anthropic", "ollama"]

for provider in providers:
    llm = create_llm(provider, model="gpt-4o-mini")  # Auto-selects appropriate model
    response = llm.generate("Hello!")

2. Vision Analysis Across Providers

# Same image analysis works with any vision model
image_files = ["product_photo.jpg", "user_feedback.png"]
prompt = "Analyze these product images and suggest improvements"

# OpenAI GPT-4o
openai_llm = create_llm("openai", model="gpt-4o")
openai_analysis = openai_llm.generate(prompt, media=image_files)

# Anthropic Claude
claude_llm = create_llm("anthropic", model="claude-3.5-sonnet")
claude_analysis = claude_llm.generate(prompt, media=image_files)

# Local model (free)
local_llm = create_llm("ollama", model="qwen3-vl:8b")
local_analysis = local_llm.generate(prompt, media=image_files)

3. Document Processing Pipeline

# Universal document analysis
documents = ["contract.pdf", "financial_data.xlsx", "presentation.ppt"]
analysis_prompt = "Extract key information and identify potential risks"

# Works with any provider
llm = create_llm("anthropic", model="claude-3.5-sonnet")
response = llm.generate(analysis_prompt, media=documents)

# Automatic format handling:
# - PDF: Text extraction with PyMuPDF4LLM
# - Excel: Table parsing with pandas
# - PowerPoint: Slide content extraction with unstructured

4. Local Development, Cloud Production

# Development (free, local)
llm_dev = create_llm("ollama", model="qwen3:4b-instruct-2507-q4_K_M")

# Production (high quality, cloud)
llm_prod = create_llm("openai", model="gpt-4o-mini")

5. Embeddings & RAG

from abstractcore.embeddings import EmbeddingManager

# Create embeddings for semantic search
embedder = EmbeddingManager()
docs_embeddings = embedder.embed_batch([
    "Python is great for data science",
    "JavaScript powers the web",
    "Rust ensures memory safety"
])

# Find most similar document
query_embedding = embedder.embed("Tell me about web development")
similarity = embedder.compute_similarity(query, docs[0])

Learn more about Embeddings

6. Structured Output

from pydantic import BaseModel

class MovieReview(BaseModel):
    title: str
    rating: int  # 1-5
    summary: str

llm = create_llm("openai", model="gpt-4o-mini")
review = llm.generate(
    "Review the movie Inception",
    response_model=MovieReview
)
print(f"{review.title}: {review.rating}/5")

Learn more about Structured Output

7. Universal API Server

# Start server once
uvicorn abstractcore.server.app:app --port 8000

# Use with any OpenAI client
curl -X POST http://localhost:8000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "ollama/qwen3-coder:30b",
    "messages": [{"role": "user", "content": "Write a Python function"}]
  }'

Why AbstractCore?

Unified Interface: One API for all LLM providers
Multimodal Support: Upload images, PDFs, and documents across all providers
Vision Models: Seamless integration with GPT-4o, Claude Vision, qwen3-vl, and more
Production Ready: Robust error handling, retries, timeouts
Type Safe: Full Pydantic integration for structured outputs
Local & Cloud: Run models locally or use cloud APIs
Tool Calling: Consistent function calling across providers
Streaming: Real-time responses for interactive applications
Embeddings: Built-in vector embeddings for RAG
Server Mode: Optional OpenAI-compatible API server
Well Documented: Comprehensive guides and examples

Installation Options

# Minimal core
pip install abstractcore

# With media handling (images, PDFs, documents)
pip install abstractcore[media]

# With specific providers
pip install abstractcore[openai]
pip install abstractcore[anthropic]
pip install abstractcore[ollama]

# With server support
pip install abstractcore[server]

# With embeddings
pip install abstractcore[embeddings]

# Everything (recommended)
pip install abstractcore[all]

Media processing extras:

# For PDF processing
pip install pymupdf4llm

# For Office documents (DOCX, XLSX, PPT)
pip install unstructured

# For image optimization
pip install pillow

# For data processing (CSV, Excel)
pip install pandas

Testing Status

All tests passing as of October 12th, 2025.

Test Environment:

Hardware: MacBook Pro (14-inch, Nov 2024)
Chip: Apple M4 Max
Memory: 128 GB
Python: 3.12.2

Quick Links

📚 Documentation Index - Complete documentation navigation guide
Getting Started - 5-minute quick start
⚙️ Prerequisites - Provider setup (OpenAI, Anthropic, Ollama, etc.)
📖 Python API - Complete Python API reference
🌐 Server Guide - HTTP API server setup
🔧 Troubleshooting - Fix common issues
💻 Examples - Working code examples
🐛 Issues - Report bugs
💬 Discussions - Get help

Contact

Maintainer: Laurent-Philippe Albou
📧 Email: [email protected]

Contributing

We welcome contributions! See CONTRIBUTING.md for guidelines.

License

MIT License - see LICENSE file for details.

AbstractCore - One interface, all LLM providers. Focus on building, not managing API differences.

Migration Note: This project was previously known as "AbstractLLM" and has been completely rebranded to "AbstractCore" as of version 2.4.0. See CHANGELOG.md for migration details.

Name		Name	Last commit message	Last commit date
Latest commit History 208 Commits
abstractcore		abstractcore
docs		docs
examples		examples
tests		tests
.gitignore		.gitignore
ACKNOWLEDGEMENTS.md		ACKNOWLEDGEMENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
llms-full.txt		llms-full.txt
llms.txt		llms.txt
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini

License

lpalbou/AbstractCore

Folders and files

Latest commit

History

Repository files navigation

AbstractCore

Quick Start

Installation

Basic Usage

Deterministic Generation

Tool Calling

Response Object (GenerateResponse)

Built-in Tools

Session Management

Media Handling

Key Features

Supported Providers

Server Mode (Optional HTTP REST API)

AbstractCore CLI (Optional Interactive Testing Tool)

Built-in Applications (Ready-to-Use CLI Tools)

Available Applications

Quick Usage Examples

Installation & Setup

Alternative Usage Methods

Key Parameters

Key Features

Full Documentation

Configuration

Quick Setup

Priority System

Usage After Configuration

Configuration Features

Documentation

Getting Started

Core Library (Python)

Server (Optional HTTP REST API)

Architecture & Advanced

Use Cases

1. Provider Flexibility

2. Vision Analysis Across Providers

3. Document Processing Pipeline

4. Local Development, Cloud Production

5. Embeddings & RAG

6. Structured Output

7. Universal API Server

Why AbstractCore?

Installation Options

Testing Status

Quick Links

Contact

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Uh oh!

Languages

Packages