mattshumer
/

Reflection-70B-Draft

Model card Files Files and versions Community

mattshumer commited on Sep 5, 2024

Commit

54f9c54

·

verified ·

1 Parent(s): d1a95bc

Update README.md

Files changed (1) hide show

README.md +13 -5

README.md CHANGED Viewed

@@ -1,7 +1,5 @@
----
-license: llama3.1
-base_model: meta-llama/Meta-Llama-3.1-70B-Instruct
----
 **Reflection 70B is (currently) the world's top open-source LLM, trained with a new technique called Reflection-Tuning that teaches a LLM to detect mistakes in its reasoning and correct course.**
 The model was trained on synthetic data generated by [Glaive](https://glaive.ai). If you're training a model, Glaive is incredible — use them.
@@ -39,10 +37,20 @@ You are a world-class AI system, capable of complex reasoning and reflection. Re
 what is 2+2?<|eot_id|><|start_header_id|>assistant<|end_header_id|>
 ```
 ## Dataset / Report
 Both the dataset and a brief report detailing how we trained this model will be released next week, alongside our Reflection 405B model that we expect will be the top-performing LLM in the world, including closed-source models.
 ---
-Thanks to Jason Kuperberg and Josh Bickett from the [HyperWrite](https://hyperwriteai.com) team for reviewing drafts of the report we'll be releasing next week.

+# Reflection 70B
 **Reflection 70B is (currently) the world's top open-source LLM, trained with a new technique called Reflection-Tuning that teaches a LLM to detect mistakes in its reasoning and correct course.**
 The model was trained on synthetic data generated by [Glaive](https://glaive.ai). If you're training a model, Glaive is incredible — use them.
 what is 2+2?<|eot_id|><|start_header_id|>assistant<|end_header_id|>
 ```
+## Tips for Performance
+- We are initially recommending a `temperature` of `.7` and a `top_p` of `.95`.
+- For increased accuracy, append `Think carefully.` at the end of your messages.
 ## Dataset / Report
 Both the dataset and a brief report detailing how we trained this model will be released next week, alongside our Reflection 405B model that we expect will be the top-performing LLM in the world, including closed-source models.
 ---
+Thanks to Jason Kuperberg and Josh Bickett from the [HyperWrite](https://hyperwriteai.com) team for reviewing drafts of the report we'll be releasing next week.
+---
+license: llama3.1
+base_model: meta-llama/Meta-Llama-3.1-70B-Instruct
+---