Implement LoRA patching via the Loader #211

LLukas22 · 2023-05-11T12:32:16Z

This pull request adds the ability to apply a LoRA adapter while loading the model. The adapter can be defined via the optional lora_adapter field in ModelParameters.

The patches are then lazily applied while the tensors are loaded via the MmapCompatibleLoader.

This currently produces garbled outputs. Probably because the weights aren't copied correctly from the patched_tensor to the base_tensor.

Currently the weights are moved by setting the tensor data directly, this probably doesn't work as i expected it to.

unsafe {
    tensor.set_data(target_tensor.data());
}

To use a LoRA adapter via the cli simply add a --lora-path flag.

Example: llama infer --model-path "path\to\model" --lora-path "path\to\adapter" -p "The meaning of life is"

LLukas22 · 2023-05-11T16:16:54Z

Alright the output is now correct (at least not gibberish anymore). I just moved the memory regions correctly and its working now.

philpax · 2023-05-12T00:58:42Z

Will look at soon! Can you mention what you tested with, and make Clippy happy?

LLukas22 · 2023-05-12T07:40:07Z

Sure i used this Alpaca adapter and this LLama base model.

LLukas22 · 2023-05-12T08:01:22Z

Maybe the vicuna-chat should be removed from the examples 🤔

philpax · 2023-05-12T08:07:24Z

That'd be a question for @danforbes 🤔 What would be your reasoning for removing it?

LLukas22 · 2023-05-12T08:10:00Z

Its fine, just had a brain-lag. Thought i was another inference example but its an actual chat implementation.😅

LLukas22 added 6 commits May 10, 2023 16:59

Made loader compatible with ggla format

4cad34f

Added lora_adapter and scaling factor

56035f9

Extracted tensor loading

60f315a

Finished LoRA patching

83ecb00

Removed example

85d4548

Fixed garbled output

1c710aa

Formatting

7b459ee

LLukas22 marked this pull request as ready for review May 11, 2023 16:20

LLukas22 added 3 commits May 12, 2023 09:51

Give clippy a cookie

d843e7e

Merge branch 'rustformers:main' into feat/lora

e6fb66b

Update vicuna-chat.rs

da3e390

philpax added 4 commits May 14, 2023 03:33

Merge branch 'main' into feat/lora

c8a2471

refactor: cleanup

77834c6

refactor: more cleanup

1e13a7f

ra -> RA

ec6e855

philpax merged commit f523cbf into rustformers:main May 14, 2023

philpax mentioned this pull request May 14, 2023

Support for LoRA adapters? #196

Closed

LLukas22 deleted the feat/lora branch May 16, 2023 13:28

hhamud mentioned this pull request Aug 7, 2023

Write a 0.2 changelog #244

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement LoRA patching via the Loader #211

Implement LoRA patching via the Loader #211

LLukas22 commented May 11, 2023

LLukas22 commented May 11, 2023

philpax commented May 12, 2023

LLukas22 commented May 12, 2023

LLukas22 commented May 12, 2023

philpax commented May 12, 2023

LLukas22 commented May 12, 2023

Implement LoRA patching via the Loader #211

Implement LoRA patching via the Loader #211

Conversation

LLukas22 commented May 11, 2023

LLukas22 commented May 11, 2023

philpax commented May 12, 2023

LLukas22 commented May 12, 2023

LLukas22 commented May 12, 2023

philpax commented May 12, 2023

LLukas22 commented May 12, 2023