fix: refactor handle_response_model #1032

jxnl · 2024-09-30T03:12:52Z

Important

Refactor handle_response_model in process_response.py into multiple mode-specific functions and update retry.py for optional response_model.

Refactoring:
- Split handle_response_model into multiple mode-specific functions in process_response.py.
- Introduced prepare_response_model for common response model preparations.
Functions Added:
- handle_parallel_tools, handle_functions, handle_tools_strict, handle_tools, handle_mistral_tools, handle_json_o1, handle_json_modes, handle_anthropic_tools, handle_anthropic_json, handle_cohere_modes, handle_gemini_json, handle_gemini_tools, handle_vertexai_tools, handle_vertexai_json, handle_cohere_json_schema, handle_cohere_tools.
Behavior:
- No change in core functionality, only internal restructuring for clarity and maintainability.
Misc:
- Updated retry_sync and retry_async in retry.py to handle optional response_model.
- Fix typo in parallel.py.

^{This description was created by}^{for fd150be. It will automatically update as commits are pushed.}

ellipsis-dev

👍 Looks good to me! Reviewed everything up to 67e0cdf in 13 seconds

More details

Looked at 817 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 3 drafted comments based on config settings.

1. instructor/process_response.py:497

Draft comment:
The function is_typed_dict is defined twice in the file. Consider removing the duplicate definition to avoid redundancy.
Reason this comment was not posted:
Confidence changes required: 80%
The function is_typed_dict is defined twice in the file, which is redundant and can lead to confusion. It should be defined only once.

2. instructor/process_response.py:548

Draft comment:
The lambda functions for JSON modes in mode_handlers are unnecessary and add complexity. Consider directly referencing the handle_json_modes function with the mode as an argument instead of using lambdas.
Reason this comment was not posted:
Confidence changes required: 50%
The refactoring of handle_response_model has introduced a potential issue with the mode_handlers dictionary. The lambda functions used for JSON modes are not necessary and add complexity.

3. instructor/process_response.py:528

Draft comment:
Ensure that tests and documentation are updated to reflect changes in handle_response_model. This is crucial for maintaining library integrity.
Reason this comment was not posted:
Comment did not seem useful.

Workflow ID: wflow_YaQ3OblEfHhIXOYL

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

cloudflare-workers-and-pages · 2024-09-30T03:14:07Z

Deploying instructor with Cloudflare Pages

Latest commit:	`7315225`
Status:	✅ Deploy successful!
Preview URL:	https://fab4174a.instructor.pages.dev
Branch Preview URL:	https://refactor-process-response.instructor.pages.dev

View logs

ellipsis-dev

👍 Looks good to me! Incremental review on 34a09d4 in 10 seconds

More details

Looked at 27 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 2 drafted comments based on config settings.

1. instructor/process_response.py:531

Draft comment:
The docstring should mention the new functions added for handling different modes, as it currently does not reflect the refactoring changes.
Reason this comment was not posted:
Confidence changes required: 50%
The function handle_response_model has a docstring that is not updated to reflect the refactoring changes. It should mention the new functions added for handling different modes.

2. instructor/process_response.py:528

Draft comment:
The function names in the mode_handlers dictionary should follow a consistent naming pattern. For example, handle_json_o1 and handle_json_modes could be renamed to handle_json_o1_mode and handle_json_mode respectively to maintain consistency with other handler names like handle_parallel_tools. This issue is also present in other handler functions like handle_functions, handle_tools_strict, etc.
Reason this comment was not posted:
Confidence changes required: 80%
The function handle_response_model has been refactored, but the docstring is overly verbose and could be more concise. Additionally, the function names in the mode_handlers dictionary should follow a consistent naming pattern, which they currently do not.

Workflow ID: wflow_qm7zhjqo0ekpBBgt

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

ellipsis-dev

👍 Looks good to me! Incremental review on 50a19ac in 14 seconds

More details

Looked at 98 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 3 drafted comments based on config settings.

1. instructor/process_response.py:537

Draft comment:
The check for response_model is None is redundant here as it is already handled in handle_response_model. Consider removing it.
Reason this comment was not posted:
Confidence changes required: 50%
The function prepare_response_model has a redundant check for response_model being None at the beginning, which is already handled in the calling function handle_response_model.

2. instructor/process_response.py:40

Draft comment:
The check for response_model is None in prepare_response_model is redundant as it is already handled in handle_response_model. Consider removing it.
Reason this comment was not posted:
Confidence changes required: 50%
The function prepare_response_model has a redundant check for response_model being None at the beginning, which is already handled in the calling function handle_response_model.

3. instructor/process_response.py:40

Draft comment:
When library code changes, ensure that both documentation and tests are updated accordingly. This refactoring of handle_response_model should be reflected in the documentation and tests.
Reason this comment was not posted:
Comment did not seem useful.

Workflow ID: wflow_ZOiOulxi6mRjtJyI

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

ellipsis-dev

👍 Looks good to me! Incremental review on 2b858fa in 9 seconds

More details

Looked at 13 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 2 drafted comments based on config settings.

1. instructor/retry.py:157

Draft comment:
The change in the type hint for response_model from type[T_Model] to type[T_Model] | None is appropriate and aligns with the refactoring changes, allowing response_model to be None.
Reason this comment was not posted:
Confidence changes required: 0%
The change in the type hint for response_model from type[T_Model] to type[T_Model] | None is appropriate given the context of the refactoring. It allows for the possibility of response_model being None, which is a valid scenario in the refactored code.

2. instructor/retry.py:155

Draft comment:
Function names should follow a consistent pattern. Consider renaming retry_sync to match the pattern used in the refactored functions, such as handle_retry_sync. This applies to retry_async as well.
Reason this comment was not posted:
Confidence changes required: 80%
The function names in the refactored code should follow a consistent pattern. The function retry_sync and retry_async should have consistent naming with the new functions added in the refactor.

Workflow ID: wflow_9eHY4Xs5DqWTwhws

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

ellipsis-dev

👍 Looks good to me! Incremental review on e56a84a in 22 seconds

More details

Looked at 37 lines of code in 1 files
Skipped 0 files when reviewing.
Skipped posting 2 drafted comments based on config settings.

1. instructor/process_response.py:505

Draft comment:
new_kwargs["tools"] is not set in handle_cohere_tools. Ensure tools is set if expected.
Reason this comment was not posted:
Comment did not seem useful.

2. instructor/process_response.py:497

Draft comment:
Consider using a more descriptive variable name instead of new_kwargs to improve readability and maintainability. This applies to other functions as well.
Reason this comment was not posted:
Confidence changes required: 70%
The variable new_kwargs is used in multiple functions but is not descriptive enough. A more descriptive name would improve readability and maintainability.

Workflow ID: wflow_yg3G3SY76r0DG3O3

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

ellipsis-dev

👍 Looks good to me! Incremental review on 7c78eca in 19 seconds

More details

Looked at 73 lines of code in 3 files
Skipped 0 files when reviewing.
Skipped posting 4 drafted comments based on config settings.

1. tests/llm/test_openai/test_parallel.py:41

Draft comment:
Ensure that the model name gpt-4o-mini is used consistently across all test cases to maintain uniformity in testing conditions.
Reason this comment was not posted:
Confidence changes required: 30%
The test cases in test_parallel.py have been updated to use a different model name gpt-4o-mini instead of gpt-4-turbo-preview. This change should be consistent across all test cases to ensure they are testing the same conditions.

2. tests/llm/test_openai/test_parallel.py:90

Draft comment:
Consider moving the import statement for AsyncOpenAI to the top of the file with other imports for better readability and efficiency.
Reason this comment was not posted:
Confidence changes required: 30%
The import statement for AsyncOpenAI in test_async_parallel_tools_one is placed inside the function. It would be more efficient to move it to the top of the file with other imports to avoid repeated imports and improve readability.

3. instructor/process_response.py:594

Draft comment:
Assertions should have a formatted error message. Please update the assertion in handle_parallel_tools. This also applies to other assertions in the newly added functions.
Reason this comment was not posted:
Confidence changes required: 80%
The function handle_parallel_tools is added in this PR and its assertion lacks a formatted error message.

4. tests/llm/test_openai/test_parallel.py:38

Draft comment:
If library code changes, ensure that documentation is updated. Please check if the documentation reflects the changes made in this test file.
Reason this comment was not posted:
Confidence changes required: 70%
The test file test_parallel.py has been modified, but there is no indication that the documentation has been updated accordingly.

Workflow ID: wflow_FW27WwvzXUFdDrs2

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

ivanleomk · 2024-10-04T08:31:40Z

Gemini Tests are passing


tests/llm/test_gemini/test_format.py ....                                          [  6%]
tests/llm/test_gemini/test_list_content.py .                                       [  7%]
tests/llm/test_gemini/test_modes.py ....                                           [ 14%]
tests/llm/test_gemini/test_multimodal_content.py ..                                [ 17%]
tests/llm/test_gemini/test_patch.py ....                                           [ 23%]
tests/llm/test_gemini/test_retries.py ....                                         [ 30%]
tests/llm/test_gemini/test_roles.py .                                              [ 31%]
tests/llm/test_gemini/test_simple_types.py ...                                     [ 36%]
tests/llm/test_gemini/test_stream.py ......                                        [ 46%]
tests/llm/test_gemini/evals/test_classification_enums.py ..........                [ 61%]
tests/llm/test_gemini/evals/test_classification_literals.py ..........             [ 77%]
tests/llm/test_gemini/evals/test_entities.py ..                                    [ 80%]
tests/llm/test_gemini/evals/test_extract_users.py ......                           [ 90%]
tests/llm/test_gemini/evals/test_sentiment_analysis.py ......                      [100%]

==================================== warnings summary ====================================
.venv/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py:273: 11 warnings
  /Users/ivanleo/Documents/coding/instructor/.venv/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py:273: PydanticDeprecatedSince20: `json_encoders` is deprecated. See https://docs.pydantic.dev/2.8/concepts/serialization/#custom-serializers for alternatives. Deprecated in Pydantic V2.0 to be removed in V3.0. See Pydantic V2 Migration Guide at https://errors.pydantic.dev/2.8/migration/
    warnings.warn(

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
============================ 63 passed, 11 warnings in 59.25s ============================

Vertex AI Tests are also passing

tests/llm/test_vertexai/test_format.py ....          [ 14%]
tests/llm/test_vertexai/test_message_parser.py ....  [ 28%]
tests/llm/test_vertexai/test_modes.py ......         [ 50%]
tests/llm/test_vertexai/test_retries.py ....         [ 64%]
tests/llm/test_vertexai/test_simple_types.py ......  [ 85%]
tests/llm/test_vertexai/test_stream.py ....          [100%]

ivanleomk

Looks good to me

ivanleomk · 2024-10-04T08:32:57Z

tests/llm/test_openai/test_parallel.py

-async def test_async_parallel_tools_one(aclient):
-    client = instructor.patch(aclient, mode=instructor.Mode.PARALLEL_TOOLS)
+async def test_async_parallel_tools_one():
+    from openai import AsyncOpenAI


@jxnl ,Added this import here because we were getting an event loop error when using the aclient.

ivanleomk · 2024-10-04T08:33:35Z

instructor/retry.py

@@ -154,7 +154,7 @@ def reask_messages(response: ChatCompletion, mode: Mode, exception: Exception):

 def retry_sync(
    func: Callable[T_ParamSpec, T_Retval],
-    response_model: type[T_Model],
+    response_model: type[T_Model] | None,


This makes it consistent with retry_async below @jxnl

async def retry_async( func: Callable[T_ParamSpec, T_Retval], response_model: type[T] | None, context: dict[str, Any] | None, args: Any, kwargs: Any, max_retries: int | AsyncRetrying = 1, strict: bool | None = None, mode: Mode = Mode.TOOLS, ) -> T:

ivanleomk · 2024-10-04T08:35:26Z

instructor/process_response.py

+            return handle_cohere_modes(new_kwargs)
+        return None, new_kwargs
+
+    if mode in {Mode.PARALLEL_TOOLS}:


In the original mode code, we handled parallel tools differently too so I added it as a special case

if is_simple_type(response_model): response_model = ModelAdapter[response_model] if is_typed_dict(response_model): response_model: BaseModel = create_model( response_model.__name__, **{k: (v, ...) for k, v in response_model.__annotations__.items()}, ) # This a special case for parallel tools if mode == Mode.PARALLEL_TOOLS: assert ( new_kwargs.get("stream", False) is False ), "stream=True is not supported when using PARALLEL_TOOLS mode" new_kwargs["tools"] = handle_parallel_model(response_model) new_kwargs["tool_choice"] = "auto" # This is a special case for parallel models response_model = ParallelModel(typehint=response_model) return response_model, new_kwargs

@jxnl

ivanleomk · 2024-10-04T08:36:07Z

instructor/process_response.py

-                    "allowed_function_names": [response_model.__name__],
-                },
-            }
+    new_kwargs["chat_history"] = [


Reverted back to using normal JSON mode - wasn't able to get tool calling working with Cohere, will fix in a separate PR down the line

Co-authored-by: Sarah Chieng <[email protected]>

ellipsis-dev

👍 Looks good to me! Incremental review on fd150be in 53 seconds

More details

Looked at 902 lines of code in 24 files
Skipped 1 files when reviewing.
Skipped posting 4 drafted comments based on config settings.

1. pyproject.toml:79

Draft comment:
The pytest-examples dependency version has been updated. Ensure that any related documentation and tests are also updated to reflect this change.
Reason this comment was not posted:
Confidence changes required: 50%
The PR includes changes to the pyproject.toml file, specifically updating the pytest-examples dependency version. This is a library code change, so I need to check if the documentation and tests are updated accordingly.

2. docs/examples/batch_job_oai.md:63

Draft comment:
The example output has been updated. Ensure that any related documentation and tests are also updated to reflect this change.
Reason this comment was not posted:
Confidence changes required: 50%
The PR includes changes to the docs/examples/batch_job_oai.md file, specifically updating the example output. This is a library code change, so I need to check if the documentation and tests are updated accordingly.

3. docs/examples/multi_modal_gemini.md:178

Draft comment:
The example output has been updated. Ensure that any related documentation and tests are also updated to reflect this change.
Reason this comment was not posted:
Confidence changes required: 50%
The PR includes changes to the docs/examples/multi_modal_gemini.md file, specifically updating the example output. This is a library code change, so I need to check if the documentation and tests are updated accordingly.

4. docs/examples/multi_modal_gemini.md:179

Draft comment:
The example output has been updated. Ensure that any related documentation and tests are also updated to reflect this change.
Reason this comment was not posted:
Confidence changes required: 50%
The PR includes changes to the docs/examples/multi_modal_gemini.md file, specifically updating the example output. This is a library code change, so I need to check if the documentation and tests are updated accordingly.

Workflow ID: wflow_FhyHRvCFxpQAMdsG

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

refactor handle_response_model

67e0cdf

ellipsis-dev bot reviewed Sep 30, 2024

View reviewed changes

adds docs

34a09d4

ellipsis-dev bot reviewed Sep 30, 2024

View reviewed changes

docs

50a19ac

jxnl requested a review from ivanleomk September 30, 2024 03:18

ellipsis-dev bot reviewed Sep 30, 2024

View reviewed changes

ivanleomk added 2 commits October 4, 2024 10:02

Merge branch 'main' into refactor-process-response

b530820

Fixed up type issue

2b858fa

ellipsis-dev bot reviewed Oct 4, 2024

View reviewed changes

Fixed up cohere json mode errors

e56a84a

ellipsis-dev bot reviewed Oct 4, 2024

View reviewed changes

Fixed up parallel tool handling

7c78eca

ellipsis-dev bot reviewed Oct 4, 2024

View reviewed changes

ivanleomk approved these changes Oct 4, 2024

View reviewed changes

ivanleomk reviewed Oct 4, 2024

View reviewed changes

ivanleomk and others added 2 commits October 4, 2024 11:18

fix: bumping Pytest and Pytest-asyncio + Gemini deps (#1040)

7315225

Co-authored-by: Sarah Chieng <[email protected]>

formatting ruff

fd150be

ellipsis-dev bot reviewed Oct 4, 2024

View reviewed changes

jxnl merged commit 1d0ab9f into main Oct 4, 2024
13 of 15 checks passed

jxnl deleted the refactor-process-response branch October 4, 2024 18:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: refactor handle_response_model #1032

fix: refactor handle_response_model #1032

jxnl commented Sep 30, 2024 •

edited by ellipsis-dev bot

Loading

ellipsis-dev bot left a comment

cloudflare-workers-and-pages bot commented Sep 30, 2024 •

edited

Loading

ellipsis-dev bot left a comment

ellipsis-dev bot left a comment

ellipsis-dev bot left a comment

ellipsis-dev bot left a comment

ellipsis-dev bot left a comment

ivanleomk commented Oct 4, 2024

ivanleomk left a comment

ivanleomk Oct 4, 2024 •

edited

Loading

ivanleomk Oct 4, 2024 •

edited

Loading

ivanleomk Oct 4, 2024

ivanleomk Oct 4, 2024

ellipsis-dev bot left a comment

fix: refactor handle_response_model #1032

fix: refactor handle_response_model #1032

Conversation

jxnl commented Sep 30, 2024 • edited by ellipsis-dev bot Loading

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

cloudflare-workers-and-pages bot commented Sep 30, 2024 • edited Loading

Deploying instructor with Cloudflare Pages

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

ivanleomk commented Oct 4, 2024

ivanleomk left a comment

Choose a reason for hiding this comment

ivanleomk Oct 4, 2024 • edited Loading

Choose a reason for hiding this comment

ivanleomk Oct 4, 2024 • edited Loading

Choose a reason for hiding this comment

ivanleomk Oct 4, 2024

Choose a reason for hiding this comment

ivanleomk Oct 4, 2024

Choose a reason for hiding this comment

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

jxnl commented Sep 30, 2024 •

edited by ellipsis-dev bot

Loading

cloudflare-workers-and-pages bot commented Sep 30, 2024 •

edited

Loading

ivanleomk Oct 4, 2024 •

edited

Loading

ivanleomk Oct 4, 2024 •

edited

Loading