feat(fs): add azure blob storage #8415

NadineYasser1 · 2025-03-05T20:54:10Z

Description

Short explanation of this PR (feel free to re-use commit message)

Introduced Azure Blob Storage integration
Added Azurite installation and usage Documentation

Checklist

Commit
- Title follows commit conventions
- Reference the relevant issue (Fixes #007, See xoa-support#42, See https://...)
- If bug fix, add Introduced by
Changelog
- If visible by XOA users, add changelog entry
- Update "Packages to release" in CHANGELOG.unreleased.md
PR
- If UI changes, add screenshots
- If not finished or not tested, open as Draft

Review process

This 2-passes review process aims to:

develop skills of junior reviewers

limit the workload for senior reviewers

limit the number of unnecessary changes by the author

The author creates a PR.
Review process:
1. The author assigns the junior reviewer.
2. The junior reviewer conducts their review:
  - Resolves their comments if they are addressed.
  - Adds comments if necessary or approves the PR.
3. The junior reviewer assigns the senior reviewer.
4. The senior reviewer conducts their review:
  - If there are no unresolved comments on the PR → merge.
  - Otherwise, we continue with 3.
The author responds to comments and/or makes corrections, and we go back to 2.

Notes:

The author can request a review at any time, even if the PR is still a Draft.
In theory, there should not be more than one reviewer at a time.
The author should not make any changes:
- When a reviewer is assigned.
- Between the junior and senior reviews.

@xen-orchestra/fs/src/azure.js

fbeauchamp · 2025-03-06T09:28:45Z

@xen-orchestra/fs/src/azure.js

+    this.#container = parts.shift()
+    this.#dir = join(...parts)
+    this.#containerClient = this.#blobServiceClient.getContainerClient(this.#container)
+    this.#createContainer()


you can't call an async method without a await or a then. And you can't call an async method in a constructor. Please move this this in _sync()

@xen-orchestra/fs/src/azure.js

packages/xo-remote-parser/src/index.js

@xen-orchestra/fs/src/azure.js

fbeauchamp · 2025-03-07T15:01:03Z

@xen-orchestra/fs/src/azure.js

+    await super._sync()
+    await this.#containerClient.createIfNotExists()


can you test if this works without retry :

Suggested change

await super._sync()

await this.#containerClient.createIfNotExists()

await this.#containerClient.createIfNotExists()

await super._sync()

@xen-orchestra/fs/src/azure.js

Co-authored-by: Florent BEAUCHAMP <[email protected]>

fbeauchamp · 2025-03-10T15:13:20Z

@xen-orchestra/fs/src/azure.js

+    const prefix = path === '/' ? '' : path + '/'
+    const result = []
+    for await (const item of this.#containerClient.listBlobsByHierarchy('/', { prefix })) {
+      const strippedName = item.name.startsWith(`${path}/`) ? item.name.replace(`${path}/`, '') : item.name


I think the item.name will always start with prefix , no ?

fbeauchamp · 2025-03-10T15:14:57Z

@xen-orchestra/fs/src/azure.js

+    }
+
+    const blobClient = this.#containerClient.getBlockBlobClient(file)
+    const blockCount = Math.ceil(data.length / MAX_BLOCK_SIZE)


the stream don't always have a length property. Please reuse the parameters of s3._outuputStream async _outputStream(path, input, { streamLength, maxStreamLength = streamLength, validator })

fbeauchamp · 2025-03-10T15:21:43Z

@xen-orchestra/fs/src/azure.js

+
+      const start = i * MAX_BLOCK_SIZE
+      const end = Math.min(start + MAX_BLOCK_SIZE, data.length)
+      const chunk = data.slice(start, end)


data is a stream, you can't use data.slice here.
You can use readChunkStrict from package @vates/read-chunk to get a part of a stream as a buffer.

fbeauchamp · 2025-03-10T15:24:04Z

packages/xo-remote-parser/src/index.js

@@ -94,6 +111,12 @@ export const format = ({ type, host, path, port, username, password, domain, pro
    string = protocol === 'https' ? 's3://' : 's3+http://'
    string += `${encodeURIComponent(username)}:${encodeURIComponent(password)}@${host}`
  }
+  if (type === 'azure') {
+    // used a double slash to seperate path cause password might contain slashes


slash in password should be encoded by encoreURIComponent

fbeauchamp · 2025-03-10T15:25:05Z

@xen-orchestra/fs/src/azure.js

+
+  // list blobs in container
+  async _list(path) {
+    const prefix = path === '/' ? '' : path + '/'


you can use makePrefix if you prefer

fbeauchamp · 2025-03-10T15:25:30Z

@xen-orchestra/fs/src/azure.js

+  }
+
+  async _rmtree(path) {
+    const iter = this.#containerClient.listBlobsFlat({ prefix: path?.endsWith('/') ? path : `${path}/` })


you can use makeprefix here

NadineYasser1 added 5 commits March 4, 2025 16:08

docs: add Azurite documentation

c99cbc2

feat: Add azure blob storage

0a34be8

feat: Add link parser for azure

4470529

fix: Add yarn.lock

79ac972

fix: Remove comment and log

5afb257

NadineYasser1 requested a review from fbeauchamp March 6, 2025 09:09

fbeauchamp requested changes Mar 6, 2025

View reviewed changes

NadineYasser1 added 7 commits March 6, 2025 14:59

fix: refactor code and fix blob not found error code

00e74ec

feat: support both azurite and azure connections

a803bc3

fix: add logs to differentiate between azure and azurite connections

c48e603

fix: replace s3 error codes with azure's

5fea1c4

fix: list blobs by hierarchy instead of flat listing

77493b4

feat: add rmtree method that removes a dir and its contents

41fb68f

fix: add parallel deletion in rmtree

7544eab

fbeauchamp reviewed Mar 7, 2025

View reviewed changes

NadineYasser1 and others added 8 commits March 7, 2025 16:41

fix: fix typo

cab2521

Co-authored-by: Florent BEAUCHAMP <[email protected]>

fix: fix typo

c3edfa5

Co-authored-by: Florent BEAUCHAMP <[email protected]>

fix: fix dir not empty func

6a3e253

Co-authored-by: Florent BEAUCHAMP <[email protected]>

fix: fix list output format

5dae256

fix: add concurrency to rmtree and fix list

4809bf6

fix: refactor code

1607800

fix: fix max block size and block count

3b32abf

fix: use stageBlock instead of uploadStream to avoid stocking in memory

bb1ae14

fbeauchamp requested changes Mar 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(fs): add azure blob storage #8415

feat(fs): add azure blob storage #8415

NadineYasser1 commented Mar 5, 2025

fbeauchamp Mar 6, 2025

fbeauchamp Mar 7, 2025

fbeauchamp Mar 10, 2025

fbeauchamp Mar 10, 2025

fbeauchamp Mar 10, 2025

fbeauchamp Mar 10, 2025

fbeauchamp Mar 10, 2025

fbeauchamp Mar 10, 2025

		await super._sync()
		await this.#containerClient.createIfNotExists()

feat(fs): add azure blob storage #8415

Are you sure you want to change the base?

feat(fs): add azure blob storage #8415

Conversation

NadineYasser1 commented Mar 5, 2025

Description

Checklist

Review process

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment