-
Notifications
You must be signed in to change notification settings - Fork 81
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix broken sanity tests + Add skip wait option in test teardown in model serving #1157
Fix broken sanity tests + Add skip wait option in test teardown in model serving #1157
Conversation
|
||
Verify User Can Autoscale Using Concurrency | ||
[Documentation] Checks if model successfully scale up based on concurrency metrics (KPA) | ||
[Tags] Sanity Tier1 ODS-2377 | ||
[Setup] Set Project And Runtime namespace=autoscale-con | ||
${test_namespace}= Set Variable autoscale-con | ||
${flan_model_name}= Set Variable flan-t5-small-caikit | ||
${model_name}= Create List ${flan_model_name} | ||
${models_names}= Create List ${flan_model_name} |
Check notice
Code scanning / Robocop
{{ create_keyword }} can be replaced with VAR Note test
@@ -292,6 +292,7 @@ | |||
[Documentation] Group together the test steps for preparing, deploying | |||
... and querying a model | |||
[Arguments] ${model_storage_uri} ${model_name} ${isvc_name}=${model_name} | |||
... ${runtime}=caikit-tgis-runtime ${protocol}=grpc ${inference_type}=all-tokens |
Check notice
Code scanning / Robocop
There is too many arguments per continuation line ({{ arguments_count }} / {{ max_arguments_count }}) Note test
PR validation:
Dry-run failures are not due to changes in this PR '* partial failure to investigate:
|
3f4de28
to
dec4caa
Compare
ods_ci/tests/Tests/400__ods_dashboard/420__model_serving/420__model_serving.robot
Fixed
Show resolved
Hide resolved
ods_ci/tests/Tests/400__ods_dashboard/420__model_serving/420__model_serving.robot
Fixed
Show resolved
Hide resolved
16373e9
to
c50f4f6
Compare
@@ -580,5 +579,9 @@ Clean Up Test Project | |||
... namespace=${test_ns} | |||
${rc} ${out}= Run And Return Rc And Output oc delete project ${test_ns} | |||
Should Be Equal As Integers ${rc} ${0} | |||
${rc} ${out}= Run And Return Rc And Output oc wait --for=delete namespace ${test_ns} --timeout=300s | |||
Should Be Equal As Integers ${rc} ${0} | |||
IF ${wait_prj_deletion} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There are a lot of keywords deleting projects, for example Delete Data Science Project From CLI
in ods_ci/tests/Resources/Page/ODH/ODHDashboard/ODHDataScienceProject/Projects.resource
Could you consider enhancing the existing one for your purposes in another PR?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I thought about that while reviewing the code. I think the main difference is that the first one you mention handle DS Projects, while in this case we're handling basic OCP projects.
I didn't apply enhancements for now, need to thinki a bit more about how to implement it, but I agree
...ci/tests/Tests/400__ods_dashboard/420__model_serving/423__model_serving_customruntimes.robot
Fixed
Show resolved
Hide resolved
...ci/tests/Tests/400__ods_dashboard/420__model_serving/423__model_serving_customruntimes.robot
Fixed
Show resolved
Hide resolved
ods_ci/tests/Tests/400__ods_dashboard/420__model_serving/420__model_serving.robot
Fixed
Show fixed
Hide fixed
ods_ci/tests/Tests/400__ods_dashboard/420__model_serving/420__model_serving.robot
Fixed
Show resolved
Hide resolved
@@ -276,6 +274,24 @@ | |||
Fail msg=comparison between expected and actual failed, ${list} | |||
END | |||
|
|||
Verify Model Inference With Retries |
Check warning
Code scanning / Robocop
Keyword '{{ keyword_name }}' has too many arguments ({{ arguments_count }}/{{ max_allowed_count }}) Warning test
... timing: model not ready to reply yet, despite the pod is up and running and the | ||
... endpoint exposed. | ||
... This is a temporary mitigation meanwhile we find a better way to check the model | ||
[Arguments] ${model_name} ${inference_input} ${expected_inference_output} |
Check notice
Code scanning / Robocop
There is too many arguments per continuation line ({{ arguments_count }} / {{ max_arguments_count }}) Note test
Robot Results
|
... endpoint exposed. | ||
... This is a temporary mitigation meanwhile we find a better way to check the model | ||
[Arguments] ${model_name} ${inference_input} ${expected_inference_output} | ||
... ${token_auth}=${FALSE} ${project_title}=${NONE} ${retries}=${5} |
Check notice
Code scanning / Robocop
There is too many arguments per continuation line ({{ arguments_count }} / {{ max_arguments_count }}) Note test
ods_ci/tests/Tests/400__ods_dashboard/420__model_serving/424_model_serving_bias_metrics.robot
Show resolved
Hide resolved
ods_ci/tests/Tests/400__ods_dashboard/420__model_serving/420__model_serving.robot
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, there are some small things to be fixed. Otherwise LGTM from my restricted knowledge point of view.
ods_ci/tests/Tests/400__ods_dashboard/420__model_serving/420__model_serving.robot
Fixed
Show fixed
Hide fixed
ods_ci/tests/Tests/400__ods_dashboard/420__model_serving/LLMs/422__model_serving_llm.robot
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One more comment, please let me know if you want to touch it (note it needs a change few lines above too). Otherwise I'll approve.
|
||
Verify User Can Validate Scale To Zero | ||
[Documentation] Checks if model successfully scale down to 0 if there's no traffic | ||
[Tags] Sanity Tier1 ODS-2379 | ||
[Setup] Set Project And Runtime namespace=autoscale-zero | ||
${flan_model_name}= Set Variable flan-t5-small-caikit | ||
${model_name}= Create List ${flan_model_name} | ||
${models_names}= Create List ${flan_model_name} |
Check notice
Code scanning / Robocop
{{ create_keyword }} can be replaced with VAR Note test
|
Fixing:
In addition, the PR is adding the option to skip waiting for project to be deleted in test teardown. The reason is that the project deletion takes very much time in model serving test (OCP takes from 3 to 6 minutes to delete it)