Cached / multimodal token are not passed through to Langfuse #8515

hassiebp · 2025-02-13T10:51:46Z

What happened?

Context

Langfuse has shipped V3 cost tracking that allows tracking usage details and cost details by arbitrary usage keys beyond justinput, output, and total. If OpenAI returns prompt_tokens_details.cached_tokens then Langfuse can infer costs when the cached_tokens are provided in usage_details.input_cached_tokens for the Generation. See docs here

Issue
LiteLLM is currently using the soon-to-be-deprecated usage key and not the new usage_details and cost_details map. See here.

Desired behavior
Langfuse accepts the OpenAI schema server side and via its SDK interface for usage_details and cost_details. Langfuse flattens the prompt_tokens_details and completion_tokens_details provided by OpenAI by prefixing the keys with input_ and output_ respectively. Please pass the OpenAI usage object as generation.usage_details and for cost_details respectively.

Here's the Pydantic schema enforced by Langfuse for the OpenAI Usage schema

Relevant log output

Are you a ML Ops Team?

No

What LiteLLM version are you on ?

v1.60.4

Twitter / LinkedIn details

https://www.linkedin.com/in/hassieb/

The text was updated successfully, but these errors were encountered:

hassiebp added the bug Something isn't working label Feb 13, 2025

ishaan-jaff added feb 2025 langfuse labels Feb 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cached / multimodal token are not passed through to Langfuse #8515

Cached / multimodal token are not passed through to Langfuse #8515

hassiebp commented Feb 13, 2025

Cached / multimodal token are not passed through to Langfuse #8515

Cached / multimodal token are not passed through to Langfuse #8515

Comments

hassiebp commented Feb 13, 2025

What happened?

Relevant log output

Are you a ML Ops Team?

What LiteLLM version are you on ?

Twitter / LinkedIn details