Research-EAI commited on
Commit
f2c7da2
·
verified ·
1 Parent(s): 75d870a

Changelog for update to Rnj-1.

Browse files
Files changed (1) hide show
  1. README.md +18 -2
README.md CHANGED
@@ -1,6 +1,8 @@
1
  ---
2
  license: apache-2.0
3
  library_name: transformers
 
 
4
  ---
5
  # Rnj-1
6
 
@@ -66,6 +68,16 @@ library_name: transformers
66
 
67
  Rnj-1 is a family of 8B parameter open-weight, dense models trained from scratch by Essential AI, optimized for code and STEM with capabilities on par with SOTA open-weight models. These models perform well across a range of programming languages and boast strong agentic capabilities (e.g., inside agentic frameworks like mini-SWE-agent), while also excelling at tool-calling. They additionally exhibit strong capabilities in math and science. Herein, `rnj-1` refers to the base model, while `rnj-1-instruct` refers to the post-trained instruction tuned model.
68
 
 
 
 
 
 
 
 
 
 
 
69
  # Capabilities
70
 
71
  We evaluate Rnj-1 models against models of comparable size. In addition to accuracy, we also show the FLOPs used in pre-training for each model.
@@ -159,9 +171,13 @@ The global batch sizes used were:
159
 
160
  # Recommendations
161
 
162
- ### Temperature
 
 
 
 
163
 
164
- We recommend using temperatures in the range [0, 0.6] for `rnj-1-instruct`.
165
 
166
  ### Propensity to write code
167
 
 
1
  ---
2
  license: apache-2.0
3
  library_name: transformers
4
+ base_model:
5
+ - EssentialAI/rnj-1
6
  ---
7
  # Rnj-1
8
 
 
68
 
69
  Rnj-1 is a family of 8B parameter open-weight, dense models trained from scratch by Essential AI, optimized for code and STEM with capabilities on par with SOTA open-weight models. These models perform well across a range of programming languages and boast strong agentic capabilities (e.g., inside agentic frameworks like mini-SWE-agent), while also excelling at tool-calling. They additionally exhibit strong capabilities in math and science. Herein, `rnj-1` refers to the base model, while `rnj-1-instruct` refers to the post-trained instruction tuned model.
70
 
71
+ # Changelog
72
+
73
+ * Update December 18, 2025:
74
+ - System prompt and temperature recommendations: Resolve premature truncations and mitigate unprompted code outputs.
75
+ - Updates to default chat template.
76
+ - Updated STEM and comparables tables.
77
+ - Links to model generations for evals.
78
+
79
+ * Initial version: December 8, 2025
80
+
81
  # Capabilities
82
 
83
  We evaluate Rnj-1 models against models of comparable size. In addition to accuracy, we also show the FLOPs used in pre-training for each model.
 
171
 
172
  # Recommendations
173
 
174
+ ### System Prompt & Temperature
175
+
176
+ We recommend _always_ adding a system prompt. `You are a helpful assistant.` is a good default prompt to use.
177
+
178
+ We recommend using temperatures in the range [0, 0.2] for `rnj-1-instruct`.
179
 
180
+ Failure to follow these recommendations can result in a) truncated outputs, b) code outputs even for non-code prompts.
181
 
182
  ### Propensity to write code
183