Deepseek-R1 decendants and Nemo guardrail compatibility #948

YaphetKG · 2025-01-22T17:39:02Z

YaphetKG
Jan 22, 2025

I was trying to use nemo with deepseek-distilled llama , and these models always start with <think> blah blah </think> actual resposne

after some debugging i noticed that nemo-guardrails makes calls to the llm providing max token =3 which captures this opening think tag and the rail blocks further flow .

Question is is there a way to customize this , maybe having a shim to parse the LLM response... or something to extend the token size etc..

Answered by trebedea

Feb 14, 2025

Hi @YaphetKG ,

While changing the max_tokens was needed, using reasoning models with reasoning traces (deepseek-r1 or distilled models) in the output required a bit of extra work from the user. We now have a MR to make the process painless, see #996

This is still in review, but will be merged soon in develop. If you can test the changes and provide feedback, it would be helpful.

Thanks,
Traian

View full answer

Pouyanpi · 2025-01-28T19:09:34Z

Pouyanpi
Jan 28, 2025
Maintainer

Hi @YaphetKG, yes it is possible to set max_tokens in the configuration, In the respective prompt of the task (e.g. self check output prompt, add max_tokens)

For example have a look at content safety prompts.yml

If your issue persists please let me know, it could be that the task you are using does not support max_tokens in that case let me know and we can open an issue.

0 replies

trebedea · 2025-02-14T12:41:01Z

trebedea
Feb 14, 2025
Collaborator

Hi @YaphetKG ,

While changing the max_tokens was needed, using reasoning models with reasoning traces (deepseek-r1 or distilled models) in the output required a bit of extra work from the user. We now have a MR to make the process painless, see #996

This is still in review, but will be merged soon in develop. If you can test the changes and provide feedback, it would be helpful.

Thanks,
Traian

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deepseek-R1 decendants and Nemo guardrail compatibility #948

{{title}}

Replies: 2 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Deepseek-R1 decendants and Nemo guardrail compatibility #948

YaphetKG Jan 22, 2025

Replies: 2 comments

Pouyanpi Jan 28, 2025 Maintainer

trebedea Feb 14, 2025 Collaborator

YaphetKG
Jan 22, 2025

Pouyanpi
Jan 28, 2025
Maintainer

trebedea
Feb 14, 2025
Collaborator