Testing Guide

Quick Start

cd backend

# Run all unit tests (fast, no external deps)
pytest -m unit

# Run a specific test file
pytest tests/test_mode_router.py

# Run a single test
pytest tests/test_mode_router.py::TestModeRouter::test_respond_wins_on_warm_question

# Verbose output
pytest -m unit -v

Conventions

Naming

test_{behavior}_when_{condition}

Omit _when_ for obvious cases:

def test_empty_query_returns_empty(self):       # obvious
def test_site_goes_down(self):                   # obvious
def test_accept_task_enforces_max_active_limit(self):  # with condition

Structure (AAA)

Every test follows Arrange / Act / Assert:

def test_get_returns_value_when_found(self, mock_db):
    # Arrange
    db, _, result = mock_db
    result.fetchone.return_value = ('my-value',)
    service = SettingsService(db)

    # Act
    value = service.get('some_key')

    # Assert
    assert value == 'my-value'

Markers

@pytest.mark.unit          # No external deps (Redis, Postgres, LLM)
@pytest.mark.integration   # Requires running services

Banned Patterns

assert True — always passes, tests nothing
assert isinstance(x, dict) as sole assertion — verify content, not type
assert x is not None without content check — verify the actual value
Tests with zero assertions — every test must assert specific behavior

Fixture Catalog

All fixtures live in tests/conftest.py.

`mock_redis`

In-memory Redis via fakeredis.FakeRedis(decode_responses=True). Patches RedisClientService.create_connection. Flushes on teardown.

`mock_config`

Patches ConfigService.get_agent_config, get_agent_prompt, and connections. Provides realistic agent configs for memory-chunker, mode-router, fact-store, frontal-cortex.

`mock_ollama`

Returns LLMResponse(text='{"gists": [], "scope": "test"}', model='test-model', provider='ollama'). Also mocks generate_embedding → [0.0] * 256.

`mock_db`

Basic DB mock. Context manager yields cursor directly. Good for simple services.

`mock_db_rows`

Extended DB mock with programmable cursor returns. Supports both patterns:

db.connection() → conn → conn.cursor() → cursor
db.get_session() → session → session.execute() → result

Returns (db, cursor) tuple.

`mock_llm`

Configurable LLM mock. Set mock_llm.response_text before calling:

def test_something(self, mock_llm):
    mock_llm.response_text = '{"verdict": "good"}'
    # Services calling create_llm_service().send_message() get that text

`authed_client`

Full Flask app with all blueprints, auth bypassed, DB/Redis mocked:

def test_endpoint(self, authed_client):
    client, mock_db, mock_redis = authed_client
    response = client.get('/system/health')
    assert response.status_code == 200

Test Data Factories

Located in tests/helpers.py. Return tuples matching actual DB column orders:

from tests.helpers import make_task_row, make_scheduled_item, make_trait_row

# 18-element tuple matching persistent_tasks SELECT
row = make_task_row(status='accepted', goal='Monitor weather')

# 11-element tuple matching scheduled_items SELECT
row = make_scheduled_item(message='Take medicine', recurrence='daily')

# 4-element tuple matching user_traits SELECT
row = make_trait_row(trait_key='timezone', trait_value='CET')

# Dict matching episodic retrieval service output
episode = make_episode_row(gist='Discussed morning routine')

# 9-element tuple matching providers SELECT
provider = make_provider_row(platform='ollama', model='qwen3:4b')

Mock Strategies

LLM Mocking

Never call real LLMs in unit tests. Two approaches:

Via mock_llm fixture (patches create_llm_service):

def test_critic(self, mock_llm):
    mock_llm.response_text = '{"safe": true, "verdict": "ok"}'
    result = critic.evaluate(action_result)

Via mock_ollama fixture (patches OllamaService):

def test_chunker(self, mock_ollama, mock_config):
    mock_ollama.send_message.return_value = LLMResponse(
        text='{"gists": [{"content": "test"}]}',
        model='test', provider='ollama'
    )

DB Mocking — Connection Pattern

For services using db.connection() → cursor (most services):

@pytest.fixture
def mock_db(self):
    db = MagicMock()
    cursor = MagicMock()
    conn = MagicMock()
    conn.cursor.return_value = cursor
    ctx = MagicMock()
    ctx.__enter__ = MagicMock(return_value=conn)
    ctx.__exit__ = MagicMock(return_value=False)
    db.connection.return_value = ctx
    return db, cursor

DB Mocking — Session Pattern

For services using db.get_session() → session (SettingsService, ProviderDbService, auth):

@pytest.fixture
def mock_db(self):
    db = MagicMock()
    session = MagicMock()
    result = MagicMock()
    session.execute.return_value = result
    ctx = MagicMock()
    ctx.__enter__ = MagicMock(return_value=session)
    ctx.__exit__ = MagicMock(return_value=False)
    db.get_session.return_value = ctx
    return db, session, result

Auth Bypass for API Tests

The @require_session decorator checks validate_session(request) at runtime:

@pytest.fixture(autouse=True)
def bypass_auth(self):
    with patch('services.auth_session_service.validate_session', return_value=True):
        yield

Optional Dependencies

Use pytest.importorskip for tools with optional deps:

feedparser = pytest.importorskip('feedparser', reason='feedparser not installed')
from tools.reddit_digest.handler import execute

Module Injection for Missing Packages

When a dependency is lazily imported inside a function and not installed:

MockWebPushException = type('WebPushException', (Exception,), {})
mock_pywebpush = MagicMock()
mock_pywebpush.webpush = MagicMock()
mock_pywebpush.WebPushException = MockWebPushException

with patch.dict('sys.modules', {'pywebpush': mock_pywebpush}):
    send_push_to_all("Test", "Body")

Adding Tests for a New Service

Create tests/test_my_service.py
Import the service: from services.my_service import MyService
Add @pytest.mark.unit to the test class
Mock external deps (DB, Redis, LLM) in fixtures
Write tests following AAA structure
Run: pytest tests/test_my_service.py -v
Verify: pytest -m unit still passes

Adding Tests for a New API Blueprint

Create tests/test_api_my_endpoint.py

@pytest.fixture
def client(self):
    app = Flask(__name__)
    app.register_blueprint(my_bp)
    app.config['TESTING'] = True
    return app.test_client()

Add bypass_auth autouse fixture
Patch service constructors at their lazy-import points
Test HTTP status codes, JSON response shape, and key field values
Test validation (400), not-found (404), and error (500) paths

Adding Tests for a New Tool Handler

Create tests/test_tool_my_tool.py
Import: from tools.my_tool.handler import execute
Mock external HTTP calls via patch('requests.get')
Test state round-trip: result['_state'] → feed back as params['_state']
Test error cases: missing config, API failures, invalid responses