nvidia
/

AceReason-Nemotron-14B

@@ -1,26 +1,26 @@
 ---
 library_name: transformers
 license: other
 license_name: nvidia-open-model-license
-license_link: >-
-  https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/
 pipeline_tag: text-generation
-language:
-  - en
 tags:
-  - nvidia
-  - reasoning
-  - math
-  - code
-  - reinforcement learning
-  - pytorch
 ---
 # AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning
 <p align="center">
 [![Technical Report](https://img.shields.io/badge/2505.16400-Technical_Report-blue)](https://arxiv.org/abs/2505.16400)
@@ -111,15 +111,33 @@ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 question = "" # code question
 starter_code = "" # starter code function header
-code_instruction_nostartercode = """Write Python code to solve the problem. Please place the solution code in the following format:\n```python\n# Your solution code here\n```"""
-code_instruction_hasstartercode = """Please place the solution code in the following format:\n```python\n# Your solution code here\n```"""
 if starter_code != "":
-    question += "\n\n" + "Solve the problem starting with the provided function header.\n\nFunction header:\n" + "```\n" + starter_code + "\n```"
-    question += "\n\n" + code_instruction_hasstartercode
 else:
-    question += "\n\n" + code_instruction_nostartercode
-final_prompt = "<｜User｜>" + question + "<｜Assistant｜><think>\n"
 ```
 4. Our inference engine for evaluation is **vLLM==0.7.3** using top-p=0.95, temperature=0.6, max_tokens=32768.
@@ -143,4 +161,4 @@ Your use of this model is governed by the [NVIDIA Open Model License](https://ww
   journal={arXiv preprint arXiv:2505.16400},
   year={2025}
 }
-```

 ---
+language:
+- en
 library_name: transformers
 license: other
 license_name: nvidia-open-model-license
+license_link: https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/
 pipeline_tag: text-generation
 tags:
+- nvidia
+- reasoning
+- math
+- code
+- reinforcement learning
+- pytorch
 ---
 # AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning
+This repository contains the AceReason-Nemotron-1.1 7B model presented in [AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy](https://huggingface.co/papers/2506.13284).
+Project page: https://huggingface.co/nvidia/AceReason-Nemotron-1.1-7B
 <p align="center">
 [![Technical Report](https://img.shields.io/badge/2505.16400-Technical_Report-blue)](https://arxiv.org/abs/2505.16400)
 question = "" # code question
 starter_code = "" # starter code function header
+code_instruction_nostartercode = """Write Python code to solve the problem. Please place the solution code in the following format:
+```python
+# Your solution code here
+```"""
+code_instruction_hasstartercode = """Please place the solution code in the following format:
+```python
+# Your solution code here
+```"""
 if starter_code != "":
+    question += "
+" + "Solve the problem starting with the provided function header.
+Function header:
+" + "```
+" + starter_code + "
+```"
+    question += "
+" + code_instruction_hasstartercode
 else:
+    question += "
+" + code_instruction_nostartercode
+final_prompt = "<｜User｜>" + question + "<｜Assistant｜><think>
+"
 ```
 4. Our inference engine for evaluation is **vLLM==0.7.3** using top-p=0.95, temperature=0.6, max_tokens=32768.
   journal={arXiv preprint arXiv:2505.16400},
   year={2025}
 }
+```