Lower level API? #267

spion · 2023-05-22T12:56:51Z

I was wondering if any of the crates exposes a simple, low-level API

Something along the lines of

(sessionState, tokenIds) -> (nextSessionState, next_logits)

This separates the network itself from the other fiddly bits, allowing you to implement any strategy for handling the logits (such as increasing/decreasing probabilities based on external criteria before sampling with top_p / top_k, restricting to a subset with logits using context-free grammars, or any number of other strategies i.e. #235 stuff).

It also removes the need to pass a rng, have a lot of parameters tied to a session, lets one use a different tokenizer (e.g. the huggingface one) and removes the need to set up progress tracking callbacks.

The text was updated successfully, but these errors were encountered:

philpax · 2023-05-22T14:11:24Z

The closest thing is infer_next_token. You're right that there should be a lower-level API that evaluates the transformer and recalculates the logits, but leaves sampling up to you.

philpax · 2023-05-31T19:54:46Z

Hi there! I've just merged #280 which should address this by letting you define your own sampling strategy, which you can combine with infer_next_token. Can you let me know if that solves your problem?

The reason for not supporting arbitrary inference is that the previous tokens are required for continued sampling, so the session needs to know which token was chosen after inference.

spion · 2023-05-31T20:09:35Z

The Sampler trait looks perfect at first glance, I'm going to give it a try in my project. edit: Thank you!

philpax added the issue:enhancement New feature or request label May 22, 2023

tanmaysachan mentioned this issue May 25, 2023

Add interface to allow changing of logits and sampling before inference #276

Closed

philpax closed this as completed Jun 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lower level API? #267

Lower level API? #267

spion commented May 22, 2023 •

edited

Loading

philpax commented May 22, 2023

philpax commented May 31, 2023

spion commented May 31, 2023 •

edited

Loading

Lower level API? #267

Lower level API? #267

Comments

spion commented May 22, 2023 • edited Loading

philpax commented May 22, 2023

philpax commented May 31, 2023

spion commented May 31, 2023 • edited Loading

spion commented May 22, 2023 •

edited

Loading

spion commented May 31, 2023 •

edited

Loading