`create-uber-principal` fixes and improvements #2941

JCZuurmond · 2024-10-11T14:45:18Z

Changes

create-uber-principal fixes and improvements:

Handle the missing set_workspace_warehouse_config_wrapper in Databricks warehouses API.
Solves updating the configuration on the warehouse.
Ask for uber principal name only if going to create one.
Avoid unnecessary storage account crawl.

Linked issues

Resolves #2764
Resolves #2771
Progresses #2949
Reference databricks/databricks-sdk-py#305

Functionality

modified existing command: databricks labs ucx create-uber-principal

Tests

manually tested
modified unit tests
added integration tests

JCZuurmond · 2024-10-14T08:03:33Z

tests/unit/azure/test_resources.py

@@ -13,29 +13,31 @@
    StorageAccount,
 )

-from . import azure_api_client, get_az_api_mapping
+from . import azure_api_client as create_azure_api_client, get_az_api_mapping


I needed to update the API client in the test_access.py, was on the way to make it a fixture but decided to leave the remainder for #2949 as it became too many changes

JCZuurmond · 2024-10-14T08:32:57Z

tests/integration/azure/test_access.py

+@pytest.fixture
+def clean_warehouse_config(env_or_skip, az_cli_ctx) -> Generator[None, None, None]:
+    """Clean workspace warehouse configuration."""
+    env_or_skip("IDE_PROJECT_ROOTS")  # Only run from editor


What do we think about only running this in the editor not in the CI?

Pro: The failing test_create_global_spn went unnoticed because it is not ran in the CI. To be more precise: I cannot run that integration test as I miss permissions to change Azure resources, I cannot confirm it fails, but confirmed the logic fails when running the UCX cli. This put me on the wrong track because I assumed the test was passing, letting me to think the issue solved by this PR was due to a context different then our testing environment. Running the integration test more often will make issues visible quicker

Con: we are changing a global config that will interfere with other tests when they run in parallel. Especially now the tests uses dummy resources thus blocking the warehouse to start for a moment as the config is invalid

Resulted in a partial delete to error described in #2771

Integration test for #2764

…serverless_compute

github-actions · 2024-10-14T08:39:29Z

✅ 6/6 passed, 14 skipped, 1m13s total

_{Running from acceptance #6689}

nfx

lgtm

* Added `imbalanced-learn` to known list ([#2943](#2943)). A new open-source library, "imbalanced-learn," has been added to the project's known list of libraries, providing various functionalities for handling imbalanced datasets. The addition includes modules such as "imblearn", "imblearn._config", "imblearn._min_dependencies", "imblearn._version", "imblearn.base", and many others, enabling features such as over-sampling, under-sampling, combining sampling techniques, and creating ensembles. This change partially resolves issue [#1931](#1931), which may have been related to the handling of imbalanced datasets, thereby enhancing the project's ability to manage such datasets. * Added `importlib_resources` to known list ([#2944](#2944)). In this update, we've added the `importlib_resources` package to the known list in the `known.json` file. This package offers a consistent and straightforward interface for accessing resources such as data files and directories in Python packages. It includes several modules, including `importlib_resources`, `importlib_resources._adapters`, `importlib_resources._common`, `importlib_resources._functional`, `importlib_resources._itertools`, `importlib_resources.abc`, `importlib_resources.compat`, `importlib_resources.compat.py38`, `importlib_resources.compat.py39`, `importlib_resources.future`, `importlib_resources.future.adapters`, `importlib_resources.readers`, and `importlib_resources.simple`. These modules provide various functionalities for handling resources within a Python package. By adding this package to the known list, we enable its usage and integration with the project's codebase. This change partially addresses issue [#1931](#1931), improving the management and accessibility of resources within our Python packages. * Dependency update: ensure we install with at least version 0.9.1 of `databricks-labs-blueprint` ([#2950](#2950)). In the updated `pyproject.toml` file, the version constraint for the `databricks-labs-blueprint` dependency has been revised to range between 0.9.1 and 0.10, specifically targeting 0.9.1 or higher. This modification ensures the incorporation of a fixed upstream issue (databrickslabs/blueprint[#157](#157)), which was integrated in the 0.9.1 release. This adjustment was triggered by a preceding change ([#2920](#2920)) that standardized notebook paths, thereby addressing issue [#2882](#2882), which was dependent on this upstream correction. By embracing this upgrade, users can engage the most recent dependency version, thereby ensuring the remediation of the aforementioned issue. * Fixed an issue with source table deleted after migration ([#2927](#2927)). In this release, we have addressed an issue where a source table was marked as migrated even after it was deleted following migration. An exception handling mechanism has been added to the `is_migrated` method to return `True` and log a warning message if the source table does not exist, indicating that it has been migrated. A new test function, `test_migration_index_deleted_source`, has also been included to verify the migration index behavior when the source table no longer exists. This function creates a source and destination table, sets the destination table's `upgraded_from` property to the source table, drops the source table, and checks if the migration index contains the source table and if an error message was recorded, indicating that the source table no longer exists. The `get_seen_tables` method remains unchanged in this diff. * Improve robustness of `sqlglot` failure handling ([#2952](#2952)). This PR introduces changes to improve the robustness of error handling in the `sqlglot` library, specifically targeting issues with inadequate parsing quality. The `collect_table_infos` method has been updated and renamed to `collect_used_tables` to accurately gather information about tables used in a SQL expression. The `lint_expression` and `collect_tables` methods have also been updated to use the new `collect_used_tables` method for better accuracy. Additionally, methods such as `find_all`, `walk_expressions`, and the test suite for the SQL parser have been enhanced to handle potential failures and unsupported SQL syntax more gracefully, by returning empty lists or logging warning messages instead of raising errors. These changes aim to improve the reliability and robustness of the `sqlglot` library, enabling it to handle unexpected input more effectively. * Log warnings when mounts are discovered on incorrect cluster type ([#2929](#2929)). The `migrate-tables` command in the ucx project's CLI now includes a verification step to ensure the successful completion of a prerequisite assessment workflow before execution. If this workflow has not been completed, a warning message is logged and the command is not executed. A new exception handling mechanism has been implemented for the `dbutils.fs.mounts()` method, which logs a warning and skips mount point discovery if an exception is raised. A new unit test has been added to verify that a warning is logged when attempting to discover mounts on an incompatible cluster type. The diff also includes a new method `VerifyProgressTracking` for verifying progress tracking and updates to existing test methods to include verification of successful runs and error handling before assessment. These changes improve the handling of edge cases in the mount point discovery process, add warnings for mounts on incorrect cluster types, and increase test coverage with progress tracking verification. * `create-uber-principal` fixes and improvements ([#2941](#2941)). This change introduces fixes and improvements to the `create-uber-principal` functionality within the `databricks-sdk-py` project, specifically targeting the Azure access module. The main enhancements include addressing an issue with the Databricks warehouses API by adding the `set_workspace_warehouse_config_wrapper` function, modifying the command to request the uber principal name only when necessary, improving storage account crawl logic, and introducing new methods to manage workspace-level configurations. Error handling mechanisms have been fortified through added and modified try-except blocks. Additionally, several unit and integration tests have been implemented and verified to ensure the functionality is correct and running smoothly. These changes improve the overall robustness and versatility of the `create-uber-principal` command, directly addressing issues [#2764](#2764), [#2771](#2771), and progressing on [#2949](#2949).

JCZuurmond added bug cloud/azure issues related to Azure feat/cli CLI commands labels Oct 11, 2024

JCZuurmond self-assigned this Oct 11, 2024

JCZuurmond commented Oct 14, 2024

View reviewed changes

JCZuurmond marked this pull request as ready for review October 14, 2024 08:37

JCZuurmond requested a review from a team as a code owner October 14, 2024 08:37

JCZuurmond had a problem deploying to account-admin October 14, 2024 08:37 — with GitHub Actions Error

JCZuurmond added 21 commits October 14, 2024 10:37

Make safe call broader when deleting uber principal resources

218550a

Resulted in a partial delete to error described in #2771

Request uber principal name after verifying pre-requisites

4f53e0e

Avoid unnecessary crawl of Azure storage accounts

2860388

Add TODO about testing warehouse config in integration test

c76ad0d

Integration test for #2764

Format

0a01500

Fix test when creating uber service principal without storage accounts

1a76398

Fix cli test

38a00d0

Update test

f9268b9

Change config prefix for cluster and warehouse

dacfbbc

Update except

5143972

Add set_workspace_warehouse_config_wrapper

4d90e27

Use set_workspace_warehouse_config_wrapper when failing with enabled_…

1a2b553

…serverless_compute

Add integration test for warehouse config

1c92975

Use literal to limit strings

2480d4c

Add type hint

84cc0bb

Fix typo

1296308

Shorten names

c49cc37

Ignore protected access

d1d3b85

Use newer type hints

b45bb6b

Add retry to remove service principal configuration

9875a99

Mock tests correctly

d6f19b7

JCZuurmond added 13 commits October 14, 2024 10:37

Mock tests correctly

b7dacd6

Move Azure API Client back to init

ffcc842

Add Azure api client to setup create uber principal

29790b5

Rename Azure api client to create_azure_api_client

e6a0cf1

Fix pylint comment

fd00e1e

Shorten docs

8f7e545

Format

cc8022c

Ignore missing param doc

413237d

Simplify enable serverless compute handling

83aeeff

Use Literal for default network action type hint

4fcdb72

Remove useless suppression

7f8d7e2

Set back Literal

4a82815

Reuse is_in_debug

5d65f9b

JCZuurmond force-pushed the fix/improve-create-uber-principal branch from 4ede2e7 to 5d65f9b Compare October 14, 2024 08:38

JCZuurmond requested a review from nfx October 14, 2024 08:38

JCZuurmond temporarily deployed to account-admin October 14, 2024 08:38 — with GitHub Actions Inactive

nfx approved these changes Oct 14, 2024

View reviewed changes

nfx merged commit 69a0cf8 into main Oct 14, 2024
6 of 7 checks passed

nfx deleted the fix/improve-create-uber-principal branch October 14, 2024 12:47

nfx mentioned this pull request Oct 14, 2024

Release v0.44.0 #2957

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`create-uber-principal` fixes and improvements #2941

`create-uber-principal` fixes and improvements #2941

JCZuurmond commented Oct 11, 2024 •

edited

Loading

JCZuurmond Oct 14, 2024 •

edited

Loading

JCZuurmond Oct 14, 2024 •

edited

Loading

github-actions bot commented Oct 14, 2024

nfx left a comment

create-uber-principal fixes and improvements #2941

create-uber-principal fixes and improvements #2941

Conversation

JCZuurmond commented Oct 11, 2024 • edited Loading

Changes

Linked issues

Functionality

Tests

JCZuurmond Oct 14, 2024 • edited Loading

Choose a reason for hiding this comment

JCZuurmond Oct 14, 2024 • edited Loading

Choose a reason for hiding this comment

github-actions bot commented Oct 14, 2024

nfx left a comment

Choose a reason for hiding this comment

`create-uber-principal` fixes and improvements #2941

`create-uber-principal` fixes and improvements #2941

JCZuurmond commented Oct 11, 2024 •

edited

Loading

JCZuurmond Oct 14, 2024 •

edited

Loading

JCZuurmond Oct 14, 2024 •

edited

Loading