-
Notifications
You must be signed in to change notification settings - Fork 424
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add ability to end agent runs as a result of a tool call #142
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
overall LGTM 👍
@@ -45,6 +51,11 @@ class RunContext(Generic[AgentDeps]): | |||
tool_name: str | None | |||
"""Name of the tool being called.""" | |||
|
|||
def end_run(self, result: ResultData) -> NoReturn: | |||
"""End the call to `agent.run` as soon as possible, using the provided value as the result.""" | |||
# NOTE: this means we ignore any other tools called concurrently |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
mention in docs what happens about cancelling other tools. Do the return values of functions that finish get added to messages?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
mention in docs what happens about cancelling other tools.
I added this comment to match the comments near the result schema things:
elif model_response.role == 'model-structured-response':
if self._result_schema is not None:
# if there's a result schema, and any of the calls match one of its tools, return the result
# NOTE: this means we ignore any other tools called here
if match := self._result_schema.find_tool(model_response):
Is there a discussion of this in the docs? I didn't see one in a quick search, but if so I can adapt that; if not, I guess we could add a note about that too?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would propose that we wait to update the docs about this until after we merge Jeremiah's PR that adds the "eager" vs. "correct" mode, and support reflect those modes here too
Deploying pydantic-ai with Cloudflare Pages
|
messages.extend(task_results) | ||
except _exceptions.StopAgentRun as e: | ||
span.set_attribute('stop_agent_run', e.tool_name) | ||
return _MarkFinalResult(data=e.result), [] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is not correct, and basically should be a type error, but the problem is that e.result
has type Any
(it is the value passed to ctx.stop_run
). However, I am not sure what's going on here so not sure how to tweak things to fix this. Maybe we can discuss tomorrow.
ToolReturn(tool_name='final_result', content=('abcdef', 777), timestamp=IsNow(tz=timezone.utc)), | ||
] | ||
) | ||
assert await result.get_data() == snapshot(('abcdef', 777)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm hitting:
E pydantic_ai.exceptions.UnexpectedModelBehavior: Invalid message, unable to find tool. Called tool names: ['maybe_stop_run_tool']. Expected tool names: ['final_result']
And I can't for the life of me figure out what is happening or why. Could use some help understanding how streaming is supposed to work
Hello, Also if there are any temp workarounds available to accomplish this, would be great. |
Quality Gate failedFailed conditions |
Any news on this at all? |
This will be my top priority after merging #468. |
A funny workaround 😉 Just let the model know that it should call final-result after the terminal tool.
|
Any progress here? |
Would it be better to just provide the |
|
||
messages = [r for r in task_results if not isinstance(r, BaseException)] | ||
stop_runs = [r for r in task_results if isinstance(r, _exceptions.StopAgentRun)] | ||
except _exceptions.StopAgentRun as e: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
When will this happen? Shouldn't the line above capture them?
|
||
first_tool_return: _messages.ToolReturn | None = None | ||
for stop_run in stop_runs: | ||
tool_return = _messages.ToolReturn( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would it perhaps be better to have some attribute on ToolReturn
to indicate that it stopped the run, instead of returning first_tool_return
? So this can be inspected later by the user or elsewhere
Just confirming that the way this works is that when a tool stops a run, a And if there are multiple run-stopping tool executions then there will be multiple |
I'm going to pick this up shortly, but will move over to a different PR given that this one is relatively stale. |
@dyllamt does that mean it calls |
With this change, you can have tool calls end the full run early by calling
ctx.stop_run(result)
inside the tool call — this immediately ends tool execution (and type-checkers can even tell that subsequent code is unreachable).I've also added the ability to disable the standard result tools through a new keyword argument, so you can ensure that the only way for the agent to exit is to call a tool that exits by callingI think we should add this in a separate PR that exposes more control over the auto-generated result tools.ctx.stop_run
.I still need to add documentation and examples.
NOTE: I no longer intend for this to resolve #127, as this is a fairly convoluted way to implement a "result tool", and I think we should just add an API for that directly. But @samuelcolvin has noted that there are other possible use cases for this feature, such as providing ways to explicitly interrupt agent execution during tool calls, etc., so I think it still makes sense to merge this.