nvidia
/

Llama-3.1-Nemotron-8B-UltraLong-1M-Instruct

Text Generation

text-generation-inference

Model card Files Files and versions Community

xp1992slz commited on 27 days ago

Commit

fa3a10b

·

verified ·

1 Parent(s): 27d64ec

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -57,7 +57,7 @@ print(outputs[0]["generated_text"][-1])
 ## Evaluation Results
-We evaluate Llama-3.1-Nemotron-8B-UltraLong-1M-Instruct on a diverse set of benchmarks, including long-context tasks (e.g., RULER, LV-Eval, and InfiniteBench) and standard tasks (e.g., MMLU, MATH, GSM-8K, and HumanEval). UltraLong-8B achieves superior performance on ultra-long context tasks while maintaining competitive results on standard benchmarks.
 ### Needle in a Haystack

 ## Evaluation Results
+We evaluate Nemotron-UltraLong-8B on a diverse set of benchmarks, including long-context tasks (e.g., RULER, LV-Eval, and InfiniteBench) and standard tasks (e.g., MMLU, MATH, GSM-8K, and HumanEval). UltraLong-8B achieves superior performance on ultra-long context tasks while maintaining competitive results on standard benchmarks.
 ### Needle in a Haystack