Custom RoPE Scaling #389

LLukas22 · 2023-07-26T14:58:12Z

Closes #378.

Adds custom context scaling to llama, falcon, gpt-j, gpt-neox.

Adds an Option<ggml::CustomRoPEArguments> parameter to the ModelParameters.

Adds the optional --rope-base and --rope-scaling cli parameters.

philpax

Code looks good. What's the easiest way to test it?

LLukas22 · 2023-07-27T10:10:14Z

Sample command for 8k context of llama 2:
cargo run --release --features cublas -- infer -a llama -m "C:\Users\lkreu\Downloads\llama-2-13b-chat.ggmlv3.q5_K_M.bin" -p "A llama riding a crab" --use-gpu --rope-scaling 0.5 --num-ctx-tokens 8192 --ignore-eos --stats
Sit back and get some coffee☕ (8192 tokens are a lot of tokens to be generated)

16k context is also possible by setting rope-scaling to 0.25 but then i don't have enough VRAM to infer on my GPU.

LLukas22 · 2023-07-27T10:20:06Z

The generated text gets repetitive after some time but i guess that's a smapler/setting issue.
lama_story.txt

philpax · 2023-07-28T20:12:05Z

Great work! I just tested it with LLongMa-2; it's a bit finicky, but that shouldn't be a problem from us. I've revised the names a little to match llama.cpp / refer to frequency, but the rest is the same. Will merge once CI passes 🚀

Update llama.cpp + Custom RoPE Scaling

13e3d2a

philpax approved these changes Jul 27, 2023

View reviewed changes

LLukas22 and others added 2 commits July 27, 2023 12:22

Pull latest metal fixes from llama.cpp

3d5df47

feat(ggml): revise RoPE names

db1e049

philpax merged commit 9fe9f19 into rustformers:main Jul 28, 2023

hhamud mentioned this pull request Aug 7, 2023

Write a 0.2 changelog #244

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Custom RoPE Scaling #389

Custom RoPE Scaling #389

LLukas22 commented Jul 26, 2023

philpax left a comment

LLukas22 commented Jul 27, 2023

LLukas22 commented Jul 27, 2023

philpax commented Jul 28, 2023

Custom RoPE Scaling #389

Custom RoPE Scaling #389

Conversation

LLukas22 commented Jul 26, 2023

philpax left a comment

Choose a reason for hiding this comment

LLukas22 commented Jul 27, 2023

LLukas22 commented Jul 27, 2023

philpax commented Jul 28, 2023