feat(Accelerator): Introduce AI runtime configuration scheme #514

nikos-livathinos · 2024-12-04T16:53:29Z

feat(Accelerator): Introduce options to control the number of threads and device from API, envvars, CLI.

Introduce the AcceleratorOptions, AcceleratorDevice and use them to set the device where the models run.
Introduce the accelerator_utils with function to decide the device and resolve the AUTO setting.
Refactor the way how the docling-ibm-models are called to match the new signature of models constructor.
Translate the accelerator options to the specific inputs for third-party models.
Extend the docling CLI with parameters to set the num_threads and device.
Add new unit tests.
Write new example how to use the accelerator options.

This PR implements the points from the Discussion #373

Issues addressed:
#308
#133

Checklist:

Documentation has been updated, if necessary.
Examples have been added, if necessary.
Tests have been added, if necessary.

…evice from API, envvars, CLI. - Introduce the AcceleratorOptions, AcceleratorDevice and use them to set the device where the models run. - Introduce the accelerator_utils with function to decide the device and resolve the AUTO setting. - Refactor the way how the docling-ibm-models are called to match the new init signature of models. - Translate the accelerator options to the specific inputs for third-party models. - Extend the docling CLI with parameters to set the num_threads and device. - Add new unit tests. - Write new example how to use the accelerator options.

Signed-off-by: Christoph Auer <[email protected]>

dolfim-ibm · 2024-12-06T11:23:01Z

docling/datamodel/pipeline_options.py

+        input_num_threads = data.get("num_threads")
+
+        # Check if to set the num_threads from the alternative envvar
+        if input_num_threads is None:
+            docling_num_threads = os.getenv("DOCLING_NUM_THREADS")
+            omp_num_threads = os.getenv("OMP_NUM_THREADS")
+            if docling_num_threads is None and omp_num_threads is not None:
+                try:
+                    data["num_threads"] = int(omp_num_threads)
+                except ValueError:
+                    _log.error(
+                        "Ignoring misformatted envvar OMP_NUM_THREADS '%s'",
+                        omp_num_threads,
+                    )


This should be done in the field validator not the __init__, which should already have the standard ENV resolved.

Please check my comments. Pydantic raises an exception for an "extra" field if you let it do the standard resolution (under the conditions I describe in my comments). Therefore we cannot use a field validator and any custom code must run before the resolution and have access to the full model input.
As an alternative approach I can implement the same logic inside a model validator.

Assuming this is really needed, it cannot be in the __init__, because now this is breaking all the argument resolution and validation (see example IDE).

Let's try the model validator.

dolfim-ibm · 2024-12-06T11:24:55Z

docling/datamodel/pipeline_options.py

@@ -78,7 +132,16 @@ class EasyOcrOptions(OcrOptions):

    kind: Literal["easyocr"] = "easyocr"
    lang: List[str] = ["fr", "de", "es", "en"]
-    use_gpu: bool = True  # same default as easyocr.Reader
+    use_gpu: bool = Field(


here we should use the newer Annotated[] way of defining the fields

dolfim-ibm · 2024-12-06T11:26:08Z

docling/models/easyocr_model.py

 from docling.datamodel.settings import settings
 from docling.models.base_ocr_model import BaseOcrModel
+from docling.utils import accelerator_utils as au


we are using only decide_device, let's import that function instead of using aliases for our own modules

docling/models/rapid_ocr_model.py

dolfim-ibm · 2024-12-06T11:28:54Z

docling/pipeline/standard_pdf_pipeline.py

        download_path = snapshot_download(
            repo_id="ds4sd/docling-models",
            force_download=force,
            local_dir=local_dir,
-            revision="v2.0.1",
+            revision="refs/pr/2",


TODO: update with tag before merge

dolfim-ibm · 2024-12-06T11:33:52Z

tests/test_options.py

+
+    # Use envvars (regular + alternative) and default values
+    os.environ["OMP_NUM_THREADS"] = "1"
+    ao.__init__()


do not test the __init__() we should check the complete object initialization. (also see comment above, we should not have a custom __init__)

It is not a check for the __init__() but an in-place reloading that allows to re-evaluate the new values of the envvars.

Signed-off-by: Nikos Livathinos <[email protected]>

…structure for fast/accurate model Signed-off-by: Nikos Livathinos <[email protected]>

Signed-off-by: Christoph Auer <[email protected]>

Signed-off-by: Nikos Livathinos <[email protected]>

cau-git · 2024-12-10T14:53:35Z

Merging it to release_v3 for further refinement. We will take care of the tests there.

Signed-off-by: Nikos Livathinos <[email protected]>

feat(Accelerator): Introduce AI runtime configuration scheme Signed-off-by: Christoph Auer <[email protected]>

nikos-livathinos requested review from cau-git and dolfim-ibm December 4, 2024 16:53

nikos-livathinos self-assigned this Dec 4, 2024

nikos-livathinos marked this pull request as draft December 5, 2024 08:44

Rebase from release_v3

6f0b912

Signed-off-by: Christoph Auer <[email protected]>

dolfim-ibm requested changes Dec 6, 2024

View reviewed changes

nikos-livathinos and others added 5 commits December 6, 2024 14:56

fix: Improve the pydantic objects in the pipeline_options and imports.

975fe07

Signed-off-by: Nikos Livathinos <[email protected]>

fix: TableStructureModel: Refactor the artifacts path to use the new …

5d5d14d

…structure for fast/accurate model Signed-off-by: Nikos Livathinos <[email protected]>

Updated test ground-truth

03f8690

Signed-off-by: Christoph Auer <[email protected]>

Updated test ground-truth (again), bugfix for empty layout

46ae215

Signed-off-by: Christoph Auer <[email protected]>

Merge branch 'release_v3' into nli/performance

bb1774d

cau-git marked this pull request as ready for review December 10, 2024 14:34

fix: Do proper check to set the device in EasyOCR, RapidOCR.

accb7b4

Signed-off-by: Nikos Livathinos <[email protected]>

nikos-livathinos added 2 commits December 10, 2024 15:07

fix: Correct the way to set GPU for EasyOCR, RapidOCR

94caee3

Signed-off-by: Nikos Livathinos <[email protected]>

fix: Ocr AccleratorDevice

f46fd9c

Signed-off-by: Nikos Livathinos <[email protected]>

cau-git merged commit e282bfd into release_v3 Dec 10, 2024
3 of 9 checks passed

cau-git deleted the nli/performance branch December 10, 2024 15:26

cau-git restored the nli/performance branch December 13, 2024 11:41

cau-git deleted the nli/performance branch December 16, 2024 09:31

cau-git added a commit that referenced this pull request Dec 17, 2024

Merge pull request #514 from DS4SD/nli/performance

184eed4

feat(Accelerator): Introduce AI runtime configuration scheme Signed-off-by: Christoph Auer <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(Accelerator): Introduce AI runtime configuration scheme #514

feat(Accelerator): Introduce AI runtime configuration scheme #514

nikos-livathinos commented Dec 4, 2024

dolfim-ibm Dec 6, 2024

nikos-livathinos Dec 6, 2024

dolfim-ibm Dec 6, 2024

dolfim-ibm Dec 6, 2024

dolfim-ibm Dec 6, 2024

dolfim-ibm Dec 6, 2024

dolfim-ibm Dec 6, 2024

nikos-livathinos Dec 6, 2024

cau-git commented Dec 10, 2024

feat(Accelerator): Introduce AI runtime configuration scheme #514

feat(Accelerator): Introduce AI runtime configuration scheme #514

Conversation

nikos-livathinos commented Dec 4, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cau-git commented Dec 10, 2024