Chris-Alexiuk commited on
Commit
75dfd78
·
verified ·
1 Parent(s): fdb0e3e

Add clarification for empty <think> tags

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -107,6 +107,7 @@ Llama-3.1-Nemotron-Ultra-253B-v1 is a general purpose reasoning and chat model i
107
  3. We recommend using greedy decoding (temperature 0\) for Reasoning OFF mode
108
  4. We do not recommend to add additional system prompts besides the control prompt, all instructions should be put into user query
109
  5. We have provided a list of prompts to use for evaluation for each benchmark where a specific template is required
 
110
 
111
  You can try this model out through the preview API, using this link: [Llama-3\_1-Nemotron-Ultra-253B-v1](https://build.nvidia.com/nvidia/llama-3\_1-nemotron-ultra-253b-v1).
112
 
 
107
  3. We recommend using greedy decoding (temperature 0\) for Reasoning OFF mode
108
  4. We do not recommend to add additional system prompts besides the control prompt, all instructions should be put into user query
109
  5. We have provided a list of prompts to use for evaluation for each benchmark where a specific template is required
110
+ 6. The model will include `<think></think>` if no reasoning was necessary in Reasoning ON model, this is expected behaviour
111
 
112
  You can try this model out through the preview API, using this link: [Llama-3\_1-Nemotron-Ultra-253B-v1](https://build.nvidia.com/nvidia/llama-3\_1-nemotron-ultra-253b-v1).
113