Test agent

`FixedResponseModel`

Bases: Model

A Mock LLM Implementation for deterministic testing.

Instead of calling OpenAI, this class returns pre-defined response_parts. This allows us to simulate specific LLM behaviors (e.g., calling a tool, generating text, or even hallucinating a malicious command) to test how the Agent loop responds.

Attributes:

Name	Type	Description
`response_parts`	`list[Any]`	The content the "LLM" should return in the first turn.

Source code in api/tests/integration/test_agent.py

class FixedResponseModel(Model):
    """
    A Mock LLM Implementation for deterministic testing.

    Instead of calling OpenAI, this class returns pre-defined `response_parts`.
    This allows us to simulate specific LLM behaviors (e.g., calling a tool,
    generating text, or even hallucinating a malicious command) to test how
    the Agent loop responds.

    Attributes:
        response_parts (list[Any]): The content the "LLM" should return in the first turn.
    """

    def __init__(self, response_parts: list[Any]):
        self.response_parts = response_parts
        self._call_count = 0

    async def request(
        self,
        messages: list[Any],
        model_settings: ModelSettings | None,
        model_request_parameters: Any | None,
    ) -> ModelResponse:
        if self._call_count == 0:
            self._call_count += 1
            return ModelResponse(parts=self.response_parts)
        else:
            return ModelResponse(parts=[TextPart("Testing complete.")])

    @property
    def model_name(self) -> str:
        return "fixed-response-mock"

    @property
    def system(self) -> str | None:
        return "mock-system-prompt"

`mock_deps()`

Creates mocked Agent Dependencies (Database).

Simulates a DuckDB connection to avoid needing a real database file on disk.

Source code in api/tests/integration/test_agent.py

@pytest.fixture
def mock_deps():
    """
    Creates mocked Agent Dependencies (Database).

    Simulates a DuckDB connection to avoid needing a real database file on disk.
    """
    deps = MagicMock(spec=AgentDeps)
    deps.db_path = ":memory:"

    mock_conn = MagicMock()
    mock_conn.execute.return_value.df.return_value.empty = False

    deps.get_db_connection.return_value = mock_conn
    return deps

`orchestrator()`

Initializes the Orchestrator with Test settings.

Source code in api/tests/integration/test_agent.py

@pytest.fixture
def orchestrator():
    """Initializes the Orchestrator with Test settings."""
    settings = get_settings()
    return SRAGAgentOrchestrator(settings)

`test_agent_sql_guardrail_interception(orchestrator, mock_deps)` `async`

Integration Test: Ensures the Guardrail actively stops a compromised Agent.

This test forces the Mock LLM to emit a malicious DROP TABLE tool call. We assert that pydantic-ai-guardrails intercepts this before it reaches the tool execution layer.

Flow:

Setup FixedResponseModel to return a DROP SQL query.
Run Agent.
Assert: An OutputGuardrailViolation exception is raised.

Source code in api/tests/integration/test_agent.py

@pytest.mark.asyncio
async def test_agent_sql_guardrail_interception(orchestrator, mock_deps):
    """
    Integration Test: Ensures the Guardrail actively stops a compromised Agent.

    This test forces the Mock LLM to emit a malicious `DROP TABLE` tool call.
    We assert that `pydantic-ai-guardrails` intercepts this *before* it reaches
    the tool execution layer.

    **Flow:**

    1.  Setup `FixedResponseModel` to return a `DROP` SQL query.
    2.  Run Agent.
    3.  **Assert:** An `OutputGuardrailViolation` exception is raised.

    """
    malicious_response = [
        ToolCallPart(
            tool_name="stats_tool",
            args={"sql_query": "DROP TABLE srag_analytics; --"},
            tool_call_id="call_rogue_1",
        )
    ]

    orchestrator.base_agent.model = FixedResponseModel(malicious_response)

    with pytest.raises(OutputGuardrailViolation) as excinfo:
        await orchestrator.agent.run("Please count the records", deps=mock_deps)

    error_msg = str(excinfo.value)
    assert "validate_tool_parameters" in error_msg
    assert "Security Violation" in error_msg
    assert "DROP" in error_msg

`test_agent_tool_routing_success(orchestrator, mock_deps)` `async`

Integration Test: Verifies successful tool routing for valid queries.

Ensures that when the LLM emits a valid SELECT call, the arguments are passed correctly to the underlying database connection.

Source code in api/tests/integration/test_agent.py

@pytest.mark.asyncio
async def test_agent_tool_routing_success(orchestrator, mock_deps):
    """
    Integration Test: Verifies successful tool routing for valid queries.

    Ensures that when the LLM emits a valid `SELECT` call, the arguments are
    passed correctly to the underlying database connection.
    """
    valid_response = [
        ToolCallPart(
            tool_name="stats_tool",
            args={"sql_query": "SELECT count(*) FROM srag"},
            tool_call_id="call_valid_1",
        )
    ]

    orchestrator.base_agent.model = FixedResponseModel(valid_response)

    await orchestrator.agent.run("Count cases", deps=mock_deps)

    mock_conn = mock_deps.get_db_connection.return_value
    mock_conn.execute.assert_called()

    called_query = mock_conn.execute.call_args[0][0]
    assert "SELECT count(*)" in called_query

`test_full_report_flow_integration(client)`

End-to-End API Test: Checks the /report endpoint wiring.

This test mocks the internal orchestrator.run() method to isolate the FastAPI layer from the AI logic. It verifies that inputs (focus_area) are passed down and outputs (markdown) are returned up correctly.

Checks:

Status Code 200.
JSON response structure matches AgentResponse.
Prompt injection logic (Focus Area appending) works as expected.

Source code in api/tests/integration/test_agent.py

def test_full_report_flow_integration(client):
    """
    End-to-End API Test: Checks the `/report` endpoint wiring.

    This test mocks the internal `orchestrator.run()` method to isolate the
    FastAPI layer from the AI logic. It verifies that inputs (focus_area)
    are passed down and outputs (markdown) are returned up correctly.

    **Checks:**

    1.  Status Code 200.
    2.  JSON response structure matches `AgentResponse`.
    3.  Prompt injection logic (Focus Area appending) works as expected.
    """
    mock_orchestrator = MagicMock()

    mock_result = MagicMock()
    mock_result.output = "Report generated with **bold text**."
    mock_result.all_messages.return_value = []

    mock_orchestrator.run = AsyncMock(return_value=mock_result)

    app.dependency_overrides[get_orchestrator] = lambda: mock_orchestrator

    try:
        payload = {"focus_area": "Children under 5"}
        response = client.post("/api/v1/agent/report", json=payload)

        assert response.status_code == 200, f"API failed: {response.text}"

        data = response.json()
        assert data["response"] == "Report generated with **bold text**."
        assert "execution_time" in data

        called_prompt = mock_orchestrator.run.call_args[0][0]
        assert "Children under 5" in called_prompt
        assert "STRICTLY FOLLOW" in called_prompt

    finally:
        app.dependency_overrides = {}

Test agent

FixedResponseModel

mock_deps()

orchestrator()

test_agent_sql_guardrail_interception(orchestrator, mock_deps) async

test_agent_tool_routing_success(orchestrator, mock_deps) async

test_full_report_flow_integration(client)

`FixedResponseModel`

`mock_deps()`

`orchestrator()`

`test_agent_sql_guardrail_interception(orchestrator, mock_deps)` `async`

`test_agent_tool_routing_success(orchestrator, mock_deps)` `async`

`test_full_report_flow_integration(client)`