tests

ROMA-DSPy Test Suite

This directory contains the test suite for ROMA-DSPy, organized by test type and categorized with pytest markers for flexible test execution.

Test Organization

tests/
├── unit/              # Fast, isolated unit tests
├── integration/       # Integration tests with external services
├── tools/             # Toolkit-specific tests
├── validation/        # Validation and verification tests
├── performance/       # Performance benchmarks (future)
└── fixtures/          # Shared test fixtures

Test Markers

Tests are categorized using pytest markers. Use markers to run specific subsets of tests:

Primary Categories

unit - Fast unit tests with no external dependencies
integration - Integration tests requiring external services
e2e - End-to-end system tests

Requirement Markers

requires_db - Requires PostgreSQL database
requires_llm - Requires LLM API keys (OpenAI, etc.)
requires_e2b - Requires E2B sandbox environment

Feature Markers

checkpoint - Checkpoint/recovery functionality tests
error_handling - Error propagation tests
tools - Toolkit integration tests
performance - Performance benchmarks
slow - Long-running tests

Running Tests

Run all tests

pytest

Run only unit tests (fast)

pytest -m unit

Run integration tests (requires services)

pytest -m integration

Run tests that require PostgreSQL

# Start Postgres first
docker-compose up -d postgres

# Run database tests
pytest -m requires_db

# Cleanup
docker-compose down

Run tests that require LLM APIs

# Set API keys
export OPENAI_API_KEY=your_key_here

# Run LLM tests
pytest -m requires_llm

Run specific test categories

# Only checkpoint tests
pytest -m checkpoint

# Only toolkit tests
pytest -m tools

# Integration tests that don't need DB
pytest -m "integration and not requires_db"

# E2E tests with all requirements
pytest -m "e2e and requires_db and requires_llm"

Run tests by directory

# All unit tests
pytest tests/unit/

# Specific test file
pytest tests/unit/test_dag_serialization.py

# Specific test function
pytest tests/unit/test_dag_serialization.py::test_serialize_task_node

Test Coverage

# Run with coverage report
pytest --cov=src/roma_dspy --cov-report=html

# Open coverage report
open htmlcov/index.html

Setting Up Test Environment

1. Install Development Dependencies

pip install -e ".[dev]"

2. Start PostgreSQL (for DB tests)

docker-compose up -d postgres

# Verify it's running
docker-compose ps

# Check logs
docker-compose logs postgres

3. Set Environment Variables

# Required for LLM tests
export OPENAI_API_KEY=sk-...
export FIREWORKS_API_KEY=...

# Required for DB tests (docker-compose defaults)
export DATABASE_URL=postgresql+asyncpg://postgres:postgres@localhost/roma_dspy_test

# Optional: E2B sandbox
export E2B_API_KEY=...

4. Run Database Migrations (first time)

# Apply migrations to test database
uv run alembic upgrade head

Writing Tests

Test Structure

import pytest

@pytest.mark.unit
def test_my_unit_test():
    """Test description."""
    # Fast test with no external dependencies
    assert True

@pytest.mark.integration
@pytest.mark.requires_db
async def test_my_integration_test(postgres_storage):
    """Test description."""
    # Integration test using fixtures
    result = await postgres_storage.get_execution("exec_123")
    assert result is not None

Using Markers

# Single marker
@pytest.mark.unit

# Multiple markers
@pytest.mark.integration
@pytest.mark.slow
@pytest.mark.requires_db

# Skip with condition
@pytest.mark.skipif(
    not os.getenv("OPENAI_API_KEY"),
    reason="Requires OPENAI_API_KEY environment variable"
)

Fixtures

Common fixtures are available in tests/conftest.py and tests/fixtures/:

postgres_storage - Initialized PostgresStorage instance
postgres_config - PostgresConfig for testing
temp_checkpoint_dir - Temporary directory for checkpoint tests
Mock fixtures for LMs and external services

Continuous Integration

Tests run automatically on:

Pull requests (unit + integration without external deps)
Main branch commits (full suite with services)

See .github/workflows/ci.yml for CI configuration.

Troubleshooting

Tests Timing Out

# Increase timeout for slow tests
pytest --timeout=300

Database Connection Errors

# Check Postgres is running
docker-compose ps

# Reset database
docker-compose down -v
docker-compose up -d postgres

Import Errors

# Reinstall in editable mode
pip install -e .

Skipped Tests

# See why tests were skipped
pytest -v -rs

# Force run skipped tests (dangerous!)
pytest --runxfail

Test Best Practices

Keep unit tests fast - No I/O, no network, no external services
Use appropriate markers - Tag tests accurately for selective running
Mock external dependencies - Use mocks for LLMs in unit tests
Clean up resources - Use fixtures for setup/teardown
Test edge cases - Invalid inputs, error conditions, boundary values
Document test purpose - Clear docstrings explaining what's being tested

Performance Testing

Performance tests are planned for future development:

# Run performance benchmarks (future)
pytest -m performance --benchmark-only

Test Data

Test data and fixtures are in:

tests/fixtures/ - Reusable test data
Individual test files - Test-specific data

Avoid committing sensitive data (API keys, credentials) to test files.

Name		Name	Last commit message	Last commit date
parent directory ..
fixtures		fixtures
integration		integration
roma_dspy/tui		roma_dspy/tui
tools		tools
unit		unit
validation		validation
README.md		README.md
conftest.py		conftest.py
test_cli_integration.py		test_cli_integration.py
test_cli_minimal_install.py		test_cli_minimal_install.py
test_config.py		test_config.py
test_engine.py		test_engine.py
test_enhanced_config_validation.py		test_enhanced_config_validation.py
test_minimal_e2e_real_install.py		test_minimal_e2e_real_install.py
test_minimal_install.py		test_minimal_install.py
test_modules.py		test_modules.py
test_package_build.py		test_package_build.py
test_parallel.py		test_parallel.py
test_sdk_usage.py		test_sdk_usage.py
test_toolkit_injection_bugs.py		test_toolkit_injection_bugs.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

ROMA-DSPy Test Suite

Test Organization

Test Markers

Primary Categories

Requirement Markers

Feature Markers

Running Tests

Run all tests

Run only unit tests (fast)

Run integration tests (requires services)

Run tests that require PostgreSQL

Run tests that require LLM APIs

Run specific test categories

Run tests by directory

Test Coverage

Setting Up Test Environment

1. Install Development Dependencies

2. Start PostgreSQL (for DB tests)

3. Set Environment Variables

4. Run Database Migrations (first time)

Writing Tests

Test Structure

Using Markers

Fixtures

Continuous Integration

Troubleshooting

Tests Timing Out

Database Connection Errors

Import Errors

Skipped Tests

Test Best Practices

Performance Testing

Test Data

FilesExpand file tree

tests

Directory actions

More options

Directory actions

More options

Latest commit

History

tests

Folders and files

parent directory

README.md

ROMA-DSPy Test Suite

Test Organization

Test Markers

Primary Categories

Requirement Markers

Feature Markers

Running Tests

Run all tests

Run only unit tests (fast)

Run integration tests (requires services)

Run tests that require PostgreSQL

Run tests that require LLM APIs

Run specific test categories

Run tests by directory

Test Coverage

Setting Up Test Environment

1. Install Development Dependencies

2. Start PostgreSQL (for DB tests)

3. Set Environment Variables

4. Run Database Migrations (first time)

Writing Tests

Test Structure

Using Markers

Fixtures

Continuous Integration

Troubleshooting

Tests Timing Out

Database Connection Errors

Import Errors

Skipped Tests

Test Best Practices

Performance Testing

Test Data