Skip to content

Service test planning#478

Open
hanna-paasivirta wants to merge 4 commits intotesting-architecturefrom
testing-architecture-service-tests
Open

Service test planning#478
hanna-paasivirta wants to merge 4 commits intotesting-architecturefrom
testing-architecture-service-tests

Conversation

@hanna-paasivirta
Copy link
Copy Markdown
Contributor

Short Description

Updated section 2) service tests in agent-team-architecture-plan/2-service-tests.md

AI Usage

Please disclose how you've used AI in this work (it's cool, we just want to know!):

  • Code generation (copilot but not intellisense)
  • Learning or fact checking
  • Strategy / design
  • Optimisation / refactoring
  • Translation / spellchecking / doc gen
  • Other
  • I have not used AI

You can read more details in our Responsible AI Policy

mc.last_request.headers["x-api-key"]
mock = MockAnthropic()
mock.set_response(r"haiku", "sure, here's a haiku")
mock.set_response(r"create workflow", tool_use("call_workflow_agent", {...}))
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not sure I understand this bit

Copy link
Copy Markdown
Collaborator

@josephjclark josephjclark Apr 28, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh, is it changing the response structure to be a like a strucuted tool output, rather than free text?

So the regular response is like { content: "here's a haiku" }, but the tool response is like { type: "tool_use", content: { rag_results: [] } } or whatever

It might be nicer to have functions like mock.set_tool_response(type, input, output) , which give us more control when simulating these particular requests.

But we can resolve that in code, maybe we don't need to interfere with the spec yet

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

By set_tool_response(type, input, output), do you mean one call that handles the whole round trip like "when the model decides to call tool x with input y, the tool returns z"? Now they're separate as set_response + tool_use() for the model request and tool_stubs for what the tool returns

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah one call for the round trip is what I was thinking of

This approach with the tool_use call just feels weird to me.

But maybe we should just generate with this approach and take a close look at it, then maybe experiment with other approaches. Sometimes it's easier to make these decisions against a diff.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants