Improve CI speed and resolve issues of `run_quantization_check` #1682

james77777778 · 2024-07-04T05:52:42Z

I have analyzed the time cost in run_quantization_check

Test command: CUDA_VISIBLE_DEVICES= KERAS_BACKEND=jax pytest keras_nlp/src/ -k backbone_basics

	time cost
`run_quantization_check=False`	45.39s
`run_quantization_check=True` without calling `quantize`	47.45s
`run_quantization_check=True` with calling `quantize`	269s
`run_quantization_check=True` with calling `quantize` + saving	275s

Obviously, the bottleneck is the underlying computation when calling Model.quantize.

Here, I propose an improvement by pre-configuring DTypePolicyMap and using it to instantiate the model to avoid quantization-related computation.
This should improve the speed of CI.

Some minor bugs have been spotted and resolved too.

Backbone.get_config() should consider that self.dtype_policy is already a DTypePolicyMap
The name of self.embeddings_layer_norm in BloomBackbone has been changed to avoid a duplicated name that breaks DTypePolicyMap.
OPTBackbone.get_config missed super()
XLNetBackbone failed to pass run_quantization_check. (Will try to fix it in the future)

EDITED:
The CI is much faster now. (JAX: ~27mins -> 18mins)

mattdangerw

This looks great! Thank you!

Improve CI speed and resolve issues of run_quantization_check

f1dc238

james77777778 force-pushed the improve-test-speed branch from 224e80f to f1dc238 Compare July 4, 2024 06:14

mattdangerw approved these changes Jul 8, 2024

View reviewed changes

mattdangerw added the kokoro:force-run Runs Tests on GPU label Jul 8, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Jul 8, 2024

mattdangerw merged commit f9faaf1 into keras-team:master Jul 8, 2024
8 checks passed

james77777778 deleted the improve-test-speed branch July 9, 2024 00:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve CI speed and resolve issues of `run_quantization_check` #1682

Improve CI speed and resolve issues of `run_quantization_check` #1682

james77777778 commented Jul 4, 2024 •

edited

Loading

mattdangerw left a comment

Improve CI speed and resolve issues of run_quantization_check #1682

Improve CI speed and resolve issues of run_quantization_check #1682

Conversation

james77777778 commented Jul 4, 2024 • edited Loading

mattdangerw left a comment

Choose a reason for hiding this comment

Improve CI speed and resolve issues of `run_quantization_check` #1682

Improve CI speed and resolve issues of `run_quantization_check` #1682

james77777778 commented Jul 4, 2024 •

edited

Loading