[LoRA] Adds support for bias in LoRA #5733

followumesh · 2024-06-21T08:07:26Z

Motivation
PEFT, https://github.com/foundation-model-stack/fms-hf-tuning includes support for tuning LoRA bias. This PR enables bias for lora, so the adapters with bias will work with vLLM.

Changes Included

LoRA bias support for different types of modules.
LoRA bias support for fully sharded LoRA.
Test file test-lora-bias.py

Yard1

Could we add an argument to the engine enable_lora_bias and avoid initializing the bias tensors if it's false? If the user knows none of their loras will have bias, we can save memory.

…-for-lora

followumesh · 2024-06-27T02:30:52Z

@Yard1 Thanks for reviewing the PR. I have added the enable_lora_bias flag (default set to false), which prevents the allocation of lora bias tensors when false.

njhill · 2024-06-27T20:21:04Z

Related: #5930

Yard1

Looks good, can we also add an e2e test?

DarkLight1337 · 2024-06-28T06:29:49Z

To speed up the CI queue for #5905, I've cancelled the distributed tests for the latest CI run in this PR since they won't pass anyway until #5905 has been merged. Please merge main into your branch after that happens so that the CI can pass once again.

…-for-lora

followumesh · 2024-07-28T06:56:12Z

@Yard1 Thanks for reviewing. I've added an e2e test for the lora_bias support.

njhill · 2024-07-29T23:44:26Z

@followumesh you need to run ./format.sh to fix the linting errors

followumesh · 2024-10-29T19:30:02Z

@njhill I have addressed your comments above. Can you please review this again? Thanks

njhill

Thanks @followumesh and sorry for the delay.

There's one remaining small but major thing to fix (and tests are failing due to this).

njhill · 2024-11-07T00:31:17Z

vllm/lora/models.py

+                if not self.lora_config.bias_enabled:
+                    module_lora.bias = None
+                    raise ValueError(
+                        f"Adapter bias cannot be used for {module_name}"
+                        " without --enable-lora-bias.")


This doesn't look right and is causing blanket lora failures. I think it should be:

Suggested change

if not self.lora_config.bias_enabled:

module_lora.bias = None

raise ValueError(

f"Adapter bias cannot be used for {module_name}"

" without --enable-lora-bias.")

if module_lora.bias is not None and not self.lora_config.bias_enabled:

raise ValueError(

f"Adapter bias cannot be used for {module_name}"

" without --enable-lora-bias.")

Incorporated the comment.

njhill · 2024-11-07T00:39:49Z

vllm/lora/layers.py

    ):
        self.reset_lora(index)

        if self.tp_size > 1:
            lora_a = self.slice_lora_a(lora_a)
            lora_b = self.slice_lora_b(lora_b)
+            if bias is not None:
+                bias = self.slice_bias(bias)


Hmm OK fair enough, I guess the typing errors are preexisting.

Signed-off-by: Umesh Deshpande <[email protected]>

njhill

Suggesting small change here to cover case that bias is a tensor rather than a list (from a typing pov the lora_module could be a LoRALayerWeights rather than a PackedLoRALayerWeights ... not sure whether that will ever be the case in practice but no harm in having the check here cover it).

Also suggest a small change to the comment.

vllm/lora/models.py

Signed-off-by: Umesh Deshpande <[email protected]>

njhill

Thanks @followumesh!

njhill · 2024-11-12T02:01:30Z

@followumesh there are a few failures in the existing LoRA tests which look related.

Signed-off-by: Umesh Deshpande <[email protected]>

followumesh · 2024-11-12T05:44:48Z

@njhill All LoRA tests are sucessful now.

Signed-off-by: Umesh Deshpande <[email protected]> Co-authored-by: Umesh Deshpande <[email protected]> Signed-off-by: Dipika <[email protected]>

jeejeelee · 2024-11-13T07:40:43Z

Thanks for complete this feature. I have two question about this featue:

Is this feature compatible with PEFT?
Have you done any benchmarking? Adding --enable-lora-bias seems to inevitably impact performance.

Signed-off-by: Umesh Deshpande <[email protected]> Co-authored-by: Umesh Deshpande <[email protected]>

Signed-off-by: Umesh Deshpande <[email protected]> Co-authored-by: Umesh Deshpande <[email protected]> Signed-off-by: Sumit Dubey <[email protected]>

Signed-off-by: Umesh Deshpande <[email protected]> Co-authored-by: Umesh Deshpande <[email protected]>

Signed-off-by: Umesh Deshpande <[email protected]> Co-authored-by: Umesh Deshpande <[email protected]> Signed-off-by: Maxime Fournioux <[email protected]>

Signed-off-by: Umesh Deshpande <[email protected]> Co-authored-by: Umesh Deshpande <[email protected]> Signed-off-by: Tyler Michael Smith <[email protected]>

Signed-off-by: Umesh Deshpande <[email protected]> Co-authored-by: Umesh Deshpande <[email protected]>

Umesh Deshpande added 3 commits June 21, 2024 04:04

LoRA Bias Support

e491d72

Minor changes

b0ed274

Ignore types to avoid error

fced7ec

Yard1 reviewed Jun 21, 2024

View reviewed changes

Umesh Deshpande added 3 commits June 25, 2024 21:35

Merge branch 'main' of https://github.com/vllm-project/vllm into bias…

575032f

…-for-lora

Merge branch 'main' of https://github.com/vllm-project/vllm into bias…

882a8e8

…-for-lora

enable-lora-bias flag

29a58c2

Umesh Deshpande added 4 commits June 27, 2024 15:01

Resolved conflicts

7e64588

yapf formatting

06ba6cf

yapf formatting

84a37ea

yapf formatting

cd1bb03

Yard1 reviewed Jun 27, 2024

View reviewed changes

Umesh Deshpande added 13 commits July 9, 2024 15:58

LoRA Bias Support

d73cecb

Minor changes

857152b

Ignore types to avoid error

c02bee6

enable-lora-bias flag

0eaaecb

yapf formatting

5c8acd0

yapf formatting

f261cf6

yapf formatting

1c78eb2

Merge branch 'bias-for-lora' of github.com:followumesh/vllm into bias…

387be43

…-for-lora

E2E test for lora bias

4845dae

Merged main

1562590

isort imports

e0eca8a

yapf fix

2aacf10

Mixing bias and non-bias lora in a batch

942f2ab

Umesh Deshpande added 4 commits October 28, 2024 18:10

Minor commit

cda128c

Merge remote-tracking branch 'upstream/main' into bias-for-lora

c584a36

Merge remote-tracking branch 'upstream/main' into bias-for-lora

d58851e

Failure without --enable-lora-bias flag

7db0ded

njhill reviewed Nov 7, 2024

View reviewed changes

Umesh Deshpande added 6 commits November 8, 2024 20:00

Error: bias is present and not enabled

7128fa0

Merge remote-tracking branch 'upstream/main' into bias-for-lora

b8dc556

Formatting fix

06162b9

Signed-off-by: Umesh Deshpande <[email protected]>

Formatting fix

b762fe2

Signed-off-by: Umesh Deshpande <[email protected]>

Check for list of None

3b6beb0

Signed-off-by: Umesh Deshpande <[email protected]>

Check for list of None

70a40f6

Signed-off-by: Umesh Deshpande <[email protected]>

njhill reviewed Nov 11, 2024

View reviewed changes

vllm/lora/models.py Outdated Show resolved Hide resolved

Umesh Deshpande added 2 commits November 11, 2024 15:08

Check for bias tensor

092814b

Signed-off-by: Umesh Deshpande <[email protected]>

Formatting fix

52e0247

Signed-off-by: Umesh Deshpande <[email protected]>

njhill approved these changes Nov 12, 2024

View reviewed changes

njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 12, 2024

Umesh Deshpande added 2 commits November 11, 2024 22:02

Bug fix

c72bbf4

Signed-off-by: Umesh Deshpande <[email protected]>

Util test fix

6f040e9

Signed-off-by: Umesh Deshpande <[email protected]>

njhill merged commit 8a06428 into vllm-project:main Nov 12, 2024
55 checks passed

rickyyx pushed a commit to rickyyx/vllm that referenced this pull request Nov 13, 2024

[LoRA] Adds support for bias in LoRA (vllm-project#5733)

15b7498

Signed-off-by: Umesh Deshpande <[email protected]> Co-authored-by: Umesh Deshpande <[email protected]>

KuntaiDu pushed a commit to KuntaiDu/vllm that referenced this pull request Nov 20, 2024

[LoRA] Adds support for bias in LoRA (vllm-project#5733)

9ee414d

Signed-off-by: Umesh Deshpande <[email protected]> Co-authored-by: Umesh Deshpande <[email protected]>

sleepwalker2017 pushed a commit to sleepwalker2017/vllm that referenced this pull request Dec 13, 2024

[LoRA] Adds support for bias in LoRA (vllm-project#5733)

37dcbd3

Signed-off-by: Umesh Deshpande <[email protected]> Co-authored-by: Umesh Deshpande <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LoRA] Adds support for bias in LoRA #5733

[LoRA] Adds support for bias in LoRA #5733

followumesh commented Jun 21, 2024

Yard1 left a comment

followumesh commented Jun 27, 2024

njhill commented Jun 27, 2024

Yard1 left a comment

DarkLight1337 commented Jun 28, 2024

followumesh commented Jul 28, 2024

njhill commented Jul 29, 2024

followumesh commented Oct 29, 2024

njhill left a comment

njhill Nov 7, 2024

followumesh Nov 9, 2024

njhill Nov 7, 2024

njhill left a comment

njhill left a comment

njhill commented Nov 12, 2024

followumesh commented Nov 12, 2024

jeejeelee commented Nov 13, 2024 •

edited

Loading

[LoRA] Adds support for bias in LoRA #5733

[LoRA] Adds support for bias in LoRA #5733

Conversation

followumesh commented Jun 21, 2024

Yard1 left a comment

Choose a reason for hiding this comment

followumesh commented Jun 27, 2024

njhill commented Jun 27, 2024

Yard1 left a comment

Choose a reason for hiding this comment

DarkLight1337 commented Jun 28, 2024

followumesh commented Jul 28, 2024

njhill commented Jul 29, 2024

followumesh commented Oct 29, 2024

njhill left a comment

Choose a reason for hiding this comment

njhill Nov 7, 2024

Choose a reason for hiding this comment

followumesh Nov 9, 2024

Choose a reason for hiding this comment

njhill Nov 7, 2024

Choose a reason for hiding this comment

njhill left a comment

Choose a reason for hiding this comment

njhill left a comment

Choose a reason for hiding this comment

njhill commented Nov 12, 2024

followumesh commented Nov 12, 2024

jeejeelee commented Nov 13, 2024 • edited Loading

jeejeelee commented Nov 13, 2024 •

edited

Loading