[TESTS] Make a common entry point for conformance tests #3265

kshpv · 2025-02-10T11:05:31Z

Changes

Make a common entry point for all tests inside test/post_training
Put logic of failing/xfailing test in a common part

Reason for changes

Reduce code complexity
Reduce potential number of bugs

Related tickets

Tests

WC run - https://github.com/openvinotoolkit/nncf/actions/runs/13239414168 - passed
ptq run -job/manual/job/post_training_quantization/608/ - in progress
WC CI - job/manual/job/post_training_weight_compression/317 - passed

AlexanderDokuchaev · 2025-02-10T23:04:11Z

tests/post_training/test_quantize_conformance.py

@@ -293,50 +390,49 @@ def test_ptq_quantization(
    capsys: pytest.CaptureFixture,
    extra_columns: bool,
    memory_monitor: bool,
+    use_avx2: Optional[bool] = None,


use_avx2 used only as fixture to set env variable, no need to pass it to pipeline

AlexanderDokuchaev · 2025-02-10T23:07:21Z

tests/post_training/data/ptq_reference_data.yaml

@@ -18,7 +18,7 @@ hf/hf-internal-testing/tiny-random-GPTNeoXForCausalLM_statefull_backend_OPTIMUM:
  metric_value: null
 hf/hf-internal-testing/tiny-random-GPTNeoXForCausalLM_stateless_backend_OPTIMUM:
  metric_value: null
-  xfail_reason: "Issue-161969"
+  exception_xfail_reason: "Issue-161969"


For exception need to know exception type and msg to set xfail only for expected error

exception_xfail_reason: msg: "Issie-123" class: RuntimeError errmsg: "some error"

AlexanderDokuchaev · 2025-02-10T23:27:08Z

tests/post_training/test_quantize_conformance.py

@@ -174,6 +178,9 @@ def fixture_ptq_report_data(output_dir, run_benchmark_app, pytestconfig):
        if not run_benchmark_app:
            df = df.drop(columns=["FPS"])

+        df = df.drop(columns=["Num sparse activations"])


looks like report_dat can be use one function to generate csv,
some thing like save_results(daf, droped_columns)

nikita-savelyevv · 2025-02-11T10:09:59Z

tests/post_training/pipelines/base.py

+    num_fq_nodes: int = 0
+    num_int8: int = 0
+    num_int4: int = 0
+    num_sparse_activations: int = 0


Does it have to be defined in the base.py? I believe a test pipeline should define it's own fields here.

I think so, it makes code common. If any test does not need any field, that's fine, only values in reference data matter

nikita-savelyevv · 2025-02-11T10:10:52Z

tests/post_training/pipelines/base.py

@@ -196,6 +197,7 @@ def get_result_dict(self):
            "Num FQ": self.num_compress_nodes.num_fq_nodes,
            "Num int4": self.num_compress_nodes.num_int4,
            "Num int8": self.num_compress_nodes.num_int8,
+            "Num sparse activations": self.num_compress_nodes.num_sparse_activations,


Same comment as for NumCompressNodes.num_sparse_activations

nikita-savelyevv · 2025-02-11T10:16:44Z

tests/post_training/test_quantize_conformance.py

+    columns_to_drop = ["Num sparse activations", "Num int4"]
+    yield from create_fixture_report_data(output_dir, run_benchmark_app, pytestconfig, columns_to_drop)


It seems like if we add another conformance job similar to the activation sparsity, we will need to update PTQ and WC report data fixtures to omit the newly added fields. Am I getting this right? If so, I believe this should not be the case, and this logic should be independent of other possible conformance pipelines.

You are right. But I do not see any problem with omitting some columns (it is already done on the develop in the same manner), it is much easier to implement in that way than to have several different classes with different logic.
What do you think?

github-actions bot added the NNCF PTQ Pull requests that updates NNCF PTQ label Feb 10, 2025

kshpv added 4 commits February 10, 2025 12:09

make common run_pipeline()

704c9e4

more common

0955c8a

skip check if None in reference

8398b0a

fix pytest.skip

7a3f8fb

github-actions bot added the documentation Improvements or additions to documentation label Feb 10, 2025

kshpv changed the title ~~[TESTS] Make more common logic in conformance~~ [TESTS] Make a common entry point for conformance tests Feb 10, 2025

kshpv added 3 commits February 10, 2025 17:17

get_num_compressed() for SAPipelineMixin

ea320d3

polishing

0511568

add drop num int4 for ptq

c7088f4

kshpv requested review from nikita-savelyevv and AlexanderDokuchaev February 10, 2025 18:19

kshpv marked this pull request as ready for review February 10, 2025 18:19

kshpv requested a review from a team as a code owner February 10, 2025 18:19

AlexanderDokuchaev requested changes Feb 10, 2025

View reviewed changes

kshpv added 2 commits February 11, 2025 09:58

add save_results()

5eebb1f

rm use_avx2 from pipeline args

a1d92d5

kshpv requested a review from AlexanderDokuchaev February 11, 2025 09:57

alexsu52 assigned AlexanderDokuchaev Feb 11, 2025

add more strict check on exception for xfail

228dd37

nikita-savelyevv reviewed Feb 11, 2025

View reviewed changes

kshpv added 8 commits February 11, 2025 16:19

use some OOP

6a56169

add get_compression_numbers for NumCompressNodes

c71be04

change order of report

01b5a55

upd exception report msg

39d30d8

add None filling missed columns

211b917

add list

5adaf4f

add columns filling for --forked

ec1d5f6

make order based on max length in dataframed

e11f1bb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TESTS] Make a common entry point for conformance tests #3265

[TESTS] Make a common entry point for conformance tests #3265

kshpv commented Feb 10, 2025 •

edited

Loading

AlexanderDokuchaev Feb 10, 2025

AlexanderDokuchaev Feb 10, 2025

AlexanderDokuchaev Feb 10, 2025

nikita-savelyevv Feb 11, 2025

kshpv Feb 11, 2025

nikita-savelyevv Feb 11, 2025

nikita-savelyevv Feb 11, 2025

kshpv Feb 11, 2025

		columns_to_drop = ["Num sparse activations", "Num int4"]
		yield from create_fixture_report_data(output_dir, run_benchmark_app, pytestconfig, columns_to_drop)

[TESTS] Make a common entry point for conformance tests #3265

Are you sure you want to change the base?

[TESTS] Make a common entry point for conformance tests #3265

Conversation

kshpv commented Feb 10, 2025 • edited Loading

Changes

Reason for changes

Related tickets

Tests

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kshpv commented Feb 10, 2025 •

edited

Loading