Fix perplexity calculation #251

tanmaysachan · 2023-05-19T20:54:38Z

There were some errors (one quite big) with #248 for larger files which caused it to panic with a large context window.

I believe this fixes most of those cases. I'm limited in my testing abilities, so if someone else could run perplexity checks for a few models, that would be better.

refactor: move a bunch of things to enable the above

philpax · 2023-05-20T01:50:10Z

Apologies - I ended up carried away while testing this and rewrote it in the process 😅. While doing so, I discovered a few things:

Perplexity should be treated as a separate CLI command, not run after inference - otherwise, the previous state will interfere with the calculated perplexity. I've made this change.
The calculated perplexity doesn't match up with llama.cpp's, so I ported that one over, and while they're closer, they're still not the same. I suspect this is because of Update to latest upstream LLaMA implementation #210.
The llama.cpp implementation has multiple rounds of perplexity, which are calculated and reported to the user. I've implemented this through a callback.
The implementation is still segfaulty with long enough text (both your implementation and mine). I tested a context length of 512 with about ~10,000 characters, which was sufficient to exceed the context length.

I suspect both 2) and 4) will be resolved by updating the LLaMA implementation. I've made a few interface changes, so I'm inclined to merge this and make an issue to improve its behaviour over time.

Thanks again for the PR, and my sincere apologies about what I did to it - I was trying to get to the bottom of things and realised it was deeper than I expected 😬

tanmaysachan · 2023-05-20T05:33:20Z

@philpax Thanks for the deep dive! I was a little confused about why llama.cpp's perplexity was not matching up with their implementation, because of which I tried implementing it from the source. I'll also have a look at the llama changes to see if I can contribute in fixing this. Thanks for such a great repo!

tanmaysachan and others added 7 commits May 20, 2023 02:18

Fix perplexity calculation

aab0b19

Merge branch 'main' of github.com:rustformers/llm into perplexity_fix

5d6df41

feat(llm): add --stats option to infer

403a23c

refactor(inference): InferenceStats Display

06129d6

feat: move perplexity to cli cmd

72882ac

refactor: move a bunch of things to enable the above

feat: use llama.cpp's perplexity algorithm

b587b8d

fix: doctest

e5727f7

philpax merged commit c4b2ca8 into rustformers:main May 20, 2023

hhamud mentioned this pull request Aug 7, 2023

Write a 0.2 changelog #244

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix perplexity calculation #251

Fix perplexity calculation #251

tanmaysachan commented May 19, 2023

philpax commented May 20, 2023

tanmaysachan commented May 20, 2023

Fix perplexity calculation #251

Fix perplexity calculation #251

Conversation

tanmaysachan commented May 19, 2023

philpax commented May 20, 2023

tanmaysachan commented May 20, 2023