feat(3/n): agent resume_turn #1194

yanxi0830 · 2025-02-21T01:39:16Z

What does this PR do?

client changes

feat (1/n): agents resume turn (Sync updates from stainless branch: yanxi0830/dev) llama-stack-client-python#157
feat (2/n): Agent lib using resume_turn llama-stack-client-python#158

Test Plan

LLAMA_STACK_BASE_URL=http://localhost:8321 pytest -v tests/client-sdk/agents/test_agents.py --inference-model meta-llama/Llama-3.1-8B-Instruct

LLAMA_STACK_CONFIG=fireworks pytest -v tests/client-sdk/agents/test_agents.py --inference-model meta-llama/Llama-3.1-8B-Instruct

llama-stack-apps

python -m examples.agents.react_agent localhost 8321

Test with script: https://gist.github.com/yanxi0830/f2e407527f468998a700cd29fd271b15

Output Before: we have 2 turn_id with 2 turns

https://gist.github.com/yanxi0830/9fbd7a80fcddc784a28c59d4a9c1d943

Output After: we have 1 turn_id, the final turn have all 3 steps

https://gist.github.com/yanxi0830/17754d56d08ccbeaec419b693137500c

Telemetry

llama_stack/providers/inline/agents/meta_reference/agent_instance.py

ashwinb · 2025-02-21T06:06:10Z

llama_stack/providers/inline/agents/meta_reference/agent_instance.py

+                request.session_id, request.turn_id
+            )
+            tool_execution_step = ToolExecutionStep(
+                step_id=(in_progress_tool_call_step.step_id if in_progress_tool_call_step else str(uuid.uuid4())),


shouldn't you error out if there is no step found?

Agree its a bit confusing (had to play around with the react_agent app). Let me add a comment here.

We do not error out here b/c in the case of ReActAgent (with a custom tool parser), server do not output a tool_execution step_start, and don't have the step. However, we should still allow the turn to be resumed with the ToolCallResponse in this case because server outputs message (no ToolCall) --> parser parse into ToolCall --> client execute ToolCall --> resume turn.

@yanxi0830 hm yeah this is confusing and broke my mental model.

in that case, the server wouldn't have sent a step_start event either

yeah, in this case (of custom tool parsers), the server wouldn't have sent a step_start event, but we would still like to log the ToolExecutionStep in resume as a client tool call did indeed happen.

llama_stack/providers/inline/agents/meta_reference/agent_instance.py

ashwinb · 2025-02-21T06:08:18Z

llama_stack/providers/inline/agents/meta_reference/agent_instance.py

+
+            # get the steps from the turn id
+            steps = []
+            if len(turns) > 0:


shouldn't this be true always?

yes, but just in case kvstore gets destroyed.

ehhuang · 2025-02-21T07:12:32Z

llama_stack/providers/inline/agents/meta_reference/agent_instance.py

+                            tool_responses=[],
+                            started_at=datetime.now(),
+                        ),
+                    )


I think we also need to save n_iter of inference so that we respect self.agent_config.max_infer_iters, which btw we don't currently respect when custom tool is used.

meta-llama/llama-stack-client-python#158

The max_infer_iters is still used here to track number of times we call resume.

We should make it so that the total number of inference doesn't exceed max_infer_iters? Currently we could have max_infer_iters^2 of inference in the worst case (each resume could have max_infer_iters inference)

Ah yeah, this will be max_infer_iters^2, I think that's what the current behaviour now if creating a second turn instead of resume too.

Will need to think a bit on how we can keep track of the same max_infer_iters across the client & server boundary in a follow up. E.g. we need to introduce extra params to communicate b/w the number of iters, whether we should use this max_infer_iters on client side.

SG for following up on this. Thanks!

hardikjshah

looks good but left a couple of comments ( could be follow ups )

llama_stack/providers/inline/agents/meta_reference/agent_instance.py

hardikjshah · 2025-02-21T19:29:59Z

llama_stack/providers/inline/agents/meta_reference/agent_instance.py

+            )
+
+            output_message = None
+            async for chunk in self.run(


This logic seems to be significantly overlapping with impl of create_and_execute_turn right ?
May be a follow up : but would be good to consolidate the code so that logic of running a turn is in one place.

Yes, doing that for 0.1.5 (so that existing create_and_execute_turn are fully verified and we can depreacate the allow_resume_turn flag: #1212

# What does this PR do? - #157 - Server change: meta-llama/llama-stack#1194 ## Test Plan - See meta-llama/llama-stack#1194 <img width="1080" alt="image" src="https://github.com/user-attachments/assets/fb4cf70d-1c3d-423d-ac75-432c2a3505d7" /> [//]: # (## Documentation) [//]: # (- [ ] Added a Changelog entry if the change is significant)

yanxi0830 added 3 commits February 20, 2025 17:26

Merge branch 'agents-unify-tools' into agents-unify-tools-3

c7e8425

Merge branch 'agents-unify-tools-2' into agents-unify-tools-3

01f90df

3/n

cd36a77

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Feb 21, 2025

yanxi0830 changed the title ~~feat(3/n): wip~~ feat(3/n): wip agents continue_turn Feb 21, 2025

yanxi0830 changed the base branch from main to agents-unify-tools-2 February 21, 2025 01:39

yanxi0830 added 5 commits February 20, 2025 17:40

add back 2/n

ee3c174

add back 2/n

157cf32

add back 2/n

22355e3

continue turn

4923270

persist pending tool execution

5e00e9f

yanxi0830 changed the title ~~feat(3/n): wip agents continue_turn~~ feat(3/n): agent resume_turn Feb 21, 2025

yanxi0830 mentioned this pull request Feb 21, 2025

wip: agent_continue meta-llama/llama-stack-client-python#155

Closed

yanxi0830 added 4 commits February 20, 2025 19:45

Merge branch 'agents-unify-tools-2' into agents-unify-tools-3

9f2f6c9

merge

6d08a93

rename

9a07e70

rename

97f9580

yanxi0830 mentioned this pull request Feb 21, 2025

feat (2/n): Agent lib using resume_turn meta-llama/llama-stack-client-python#158

Merged

yanxi0830 marked this pull request as ready for review February 21, 2025 04:08

yanxi0830 requested review from ashwinb, hardikjshah, dltn, raghotham, dineshyv, vladimirivic, sixianyi0721, ehhuang and terrytangyuan as code owners February 21, 2025 04:08

ashwinb reviewed Feb 21, 2025

View reviewed changes

llama_stack/providers/inline/agents/meta_reference/agent_instance.py Show resolved Hide resolved

fix tool execution step from tool response

9c40529

Merge branch 'agents-unify-tools-2' into agents-unify-tools-3

0de38a2

ashwinb reviewed Feb 21, 2025

View reviewed changes

llama_stack/providers/inline/agents/meta_reference/agent_instance.py Outdated Show resolved Hide resolved

ashwinb reviewed Feb 21, 2025

View reviewed changes

yanxi0830 added 4 commits February 20, 2025 22:16

fix duplicate tool msg

99bc54b

refactor

2c06704

refactor

fa4a56c

add comment

b1b45ed

ehhuang reviewed Feb 21, 2025

View reviewed changes

raghotham approved these changes Feb 21, 2025

View reviewed changes

hardikjshah approved these changes Feb 21, 2025

View reviewed changes

yanxi0830 mentioned this pull request Feb 21, 2025

Simplify agent loops #1212

Open

datetime nit

ea050f7

yanxi0830 merged commit 4dc7f05 into agents-unify-tools-2 Feb 21, 2025
3 checks passed

yanxi0830 deleted the agents-unify-tools-3 branch February 21, 2025 19:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(3/n): agent resume_turn #1194

feat(3/n): agent resume_turn #1194

yanxi0830 commented Feb 21, 2025 •

edited

Loading

ashwinb Feb 21, 2025

yanxi0830 Feb 21, 2025 •

edited

Loading

ashwinb Feb 21, 2025

ashwinb Feb 21, 2025

yanxi0830 Feb 21, 2025 •

edited

Loading

ashwinb Feb 21, 2025

yanxi0830 Feb 21, 2025

ehhuang Feb 21, 2025

yanxi0830 Feb 21, 2025

ehhuang Feb 21, 2025

yanxi0830 Feb 21, 2025 •

edited

Loading

ehhuang Feb 21, 2025

hardikjshah left a comment

hardikjshah Feb 21, 2025

yanxi0830 Feb 21, 2025

feat(3/n): agent resume_turn #1194

feat(3/n): agent resume_turn #1194

Conversation

yanxi0830 commented Feb 21, 2025 • edited Loading

What does this PR do?

Test Plan

Choose a reason for hiding this comment

yanxi0830 Feb 21, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yanxi0830 Feb 21, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yanxi0830 Feb 21, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hardikjshah left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yanxi0830 commented Feb 21, 2025 •

edited

Loading

yanxi0830 Feb 21, 2025 •

edited

Loading

yanxi0830 Feb 21, 2025 •

edited

Loading

yanxi0830 Feb 21, 2025 •

edited

Loading