openchat
/

openchat-3.5-1210

@@ -1,3 +1,4 @@
 <div align="center">
   <img src="https://raw.githubusercontent.com/imoneoi/openchat/master/assets/logo_new.png" style="width: 65%">
   <h1>Advancing Open-source Language Models with Mixed-Quality Data</h1>
@@ -23,31 +24,48 @@
 </p>
 <hr>
-<div style="background-color: white; padding: 0.7em; border-radius: 0.5em; color: black; display: flex; flex-direction: column; justify-content: center; text-align: center; ont-size: 0.5em;">
   <a href="https://huggingface.co/openchat/openchat_3.5" style="text-decoration: none; color: black;">
-  <span style="font-size: 0.7em;  font-family: 'Helvetica'; color:  white; vertical-align: top;  background-color:white;  border-radius: 6em; padding: 0.04em 0.4em; letter-spacing: 0.1em; font-weight: bold">3.51210</span>
-    <span style="font-size: 1.7em; font-family: 'Helvetica'; letter-spacing: 0.1em; font-weight: bold; color: black;">OPENCHAT</span><span style="font-size: 1.8em; font-family: 'Helvetica'; color: #3c72db; ">3.5</span>
-        <span style="font-size: 0.7em;  font-family: 'Helvetica'; color:  white; vertical-align: top;  background-color:red;  border-radius: 6em; padding: 0.066em 0.4em; letter-spacing: 0.1em; font-weight: bold;">1210</span>
-    <span style="font-size: 1em; font-family: 'Helvetica'; color: black;">
-      <br> 🏆 The Overall Best Performing Open Source 7B Model 🏆
-    <br> 🤖 Outperforms <span style="font-weight: bold;">ChatGPT</span> (March) and <span style="font-weight: bold;">Grok-1</span>  on most benchmarks 🤖
-      <br> 🚀<span style="font-size: 1em; font-family: 'Helvetica'; color: black; font-weight: bold;">15</span>-point improvement in Coding Performance over <span style="font-size: 0.9em;
-      font-family: 'Helvetica'; color: black; font-weight: bold;">OpenChat-3.5🚀</span>
-      <br><span style="font-size: 1em; font-family: 'Helvetica'; color: #3c72db; font-weight: bold;">New Features</span>
-      <br> 💡 2 Modes: Coding + Generalist, Mathematical Reasoning 💡
-      <br> 🧑‍⚖️ Experimental support for Evaluator and Feedback capabilities 🧑‍⚖️
     </span>
   </a>
 </div>
 <div style="display: flex; justify-content: center; align-items: center">
-  <img src="https://github.com/alpayariyak/openchat/blob/master/assets/1210bench.png?raw=true" style="width: 100%; border-radius: 1em">
 </div>
 <div>
 <h3> Table of Contents</h3>
 </div>
 1. [Usage](#usage)
 2. [Benchmarks](#benchmarks)
 3. [Limitations](#limitations)
@@ -174,7 +192,6 @@ Score 5: {orig_score5_description}
 | OpenOrca Mistral   | 7B       | 52.7     | 6.86         | 38.4            | 49.4     | 42.9     | 45.9          | 59.3         | 59.1         | 58.1        |
 | Zephyr-β^          | 7B       | 34.6     | 7.34         | 22.0            | 40.6     | 39.0     | 40.8          | 39.8         | 5.1          | 16.0        |
 | Mistral            | 7B       | -        | 6.84         | 30.5            | 39.0     | 38.0     | -             | 60.1         | 52.2         | -           |
 <details>
   <summary>Evaluation Details(click to expand)</summary>
 *: ChatGPT (March) results are from [GPT-4 Technical Report](https://arxiv.org/abs/2303.08774), [Chain-of-Thought Hub](https://github.com/FranxYao/chain-of-thought-hub), and our evaluation. Please note that ChatGPT is not a fixed baseline and evolves rapidly over time.
@@ -189,6 +206,7 @@ All models are evaluated in chat mode (e.g. with the respective conversation tem
 <h3>HumanEval+</h3>
 </div>
 | Model                       | Size     | HumanEval+ pass@1 |
 |-----------------------------|----------|------------|
 | ChatGPT (December 12, 2023) | -        | 64.6       |
@@ -209,6 +227,12 @@ All models are evaluated in chat mode (e.g. with the respective conversation tem
 *: Grok results are reported by [X.AI](https://x.ai/).
 <div align="center">
 <h2> Limitations </h2>
 </div>
@@ -226,6 +250,7 @@ OpenChat may sometimes generate information that does not exist or is not accura
 **Safety**
 OpenChat may sometimes generate harmful, hate speech, biased responses, or answer unsafe questions. It's crucial to apply additional AI safety measures in use cases that require safe and moderated responses.
 <div align="center">
 <h2> License </h2>
 </div>
@@ -251,7 +276,6 @@ OpenChat 3.5 was trained with C-RLFT on a collection of publicly available high-
 <div align="center">
 <h2> Citation </h2>
 </div>
 ```
 @article{wang2023openchat,
   title={OpenChat: Advancing Open-source Language Models with Mixed-Quality Data},

 <div align="center">
   <img src="https://raw.githubusercontent.com/imoneoi/openchat/master/assets/logo_new.png" style="width: 65%">
   <h1>Advancing Open-source Language Models with Mixed-Quality Data</h1>
 </p>
 <hr>
+<div style="background-color: white; padding: 0.7em; border-radius: 0.5em; color: black; display: flex; flex-direction: column; justify-content: center; text-align: center;">
   <a href="https://huggingface.co/openchat/openchat_3.5" style="text-decoration: none; color: black;">
+    <span style="font-size: 0.7em; font-family: 'Helvetica'; color: white; background-color:white; border-radius: 6em; padding: 0.04em 0.4em; letter-spacing: 0.1em; font-weight: bold">3.51210</span>
+    <span style="font-size: 1.7em; font-family: 'Helvetica'; letter-spacing: 0.1em; font-weight: bold; color: black;">OPENCHAT</span><span style="font-size: 1.8em; font-family: 'Helvetica'; color: #3c72db;">3.5</span>
+    <span style="font-size: 0.7em; font-family: 'Helvetica'; color: white; background-color:red; border-radius: 6em; padding: 0.066em 0.4em; letter-spacing: 0.1em; font-weight: bold; vertical-align: top;">1210</span><br>
+    <span style="font-size: 2vw; font-family: 'Helvetica'; color: black; white-space: nowrap;">
+      🏆 The Overall Best Performing Open Source 7B Model 🏆
+    </span>
+    <br> <span style="font-size: 2vw; font-family: 'Helvetica'; color: black; white-space: nowrap;">🤖 Outperforms <span style="font-weight: bold;">ChatGPT</span> (March) and <span style="font-weight: bold;">Grok-1</span>  on most benchmarks 🤖</span>
+      <br> <span style="font-size: 2vw; font-family: 'Helvetica'; color: black; white-space: nowrap;">🚀 <span style="font-size: 1em; font-family: 'Helvetica'; color: black; font-weight: bold;">15</span>-point improvement in Coding Performance over <span style="font-size: 0.9em;
+      font-family: 'Helvetica'; color: black; font-weight: bold;">OpenChat-3.5 🚀</span></span>
+      <br><span style="font-size: 2vw; font-family: 'Helvetica'; color: #3c72db; font-weight: bold; white-space: nowrap;">New Features</span>
+      <br> <span style="font-size: 2vw; font-family: 'Helvetica'; color: black; white-space: nowrap;">💡 2 Modes: Coding + Generalist, Mathematical Reasoning 💡</span>
+      <br><span style="font-size: 2vw; font-family: 'Helvetica'; color: black; white-space: nowrap;"> 🧑‍⚖️ Experimental support for Evaluator and Feedback capabilities 🧑‍⚖️</span>
     </span>
   </a>
 </div>
+<!-- <a href="https://huggingface.co/openchat/openchat_3.5">
+  <button class="common-button">Model Repo</button>
+</a>
+<a href="https://openchat.team">
+  <button class="common-button">OpenChatUI Demo</button>
+</a>
+<a href="https://huggingface.co/spaces/openchat/openchat_3.5">
+  <button class="common-button">HuggingFace Space</button>
+</a>
+<a href="https://arxiv.org/pdf/2309.11235.pdf">
+  <button class="common-button">Paper</button>
+</a>
+ -->
+</p>
 <div style="display: flex; justify-content: center; align-items: center">
+  <img src="https://github.com/alpayariyak/openchat/blob/master/assets/1210bench.png?raw=true" style="width: 100%; border-radius: 1em">">
 </div>
 <div>
 <h3> Table of Contents</h3>
 </div>
 1. [Usage](#usage)
 2. [Benchmarks](#benchmarks)
 3. [Limitations](#limitations)
 | OpenOrca Mistral   | 7B       | 52.7     | 6.86         | 38.4            | 49.4     | 42.9     | 45.9          | 59.3         | 59.1         | 58.1        |
 | Zephyr-β^          | 7B       | 34.6     | 7.34         | 22.0            | 40.6     | 39.0     | 40.8          | 39.8         | 5.1          | 16.0        |
 | Mistral            | 7B       | -        | 6.84         | 30.5            | 39.0     | 38.0     | -             | 60.1         | 52.2         | -           |
 <details>
   <summary>Evaluation Details(click to expand)</summary>
 *: ChatGPT (March) results are from [GPT-4 Technical Report](https://arxiv.org/abs/2303.08774), [Chain-of-Thought Hub](https://github.com/FranxYao/chain-of-thought-hub), and our evaluation. Please note that ChatGPT is not a fixed baseline and evolves rapidly over time.
 <h3>HumanEval+</h3>
 </div>
 | Model                       | Size     | HumanEval+ pass@1 |
 |-----------------------------|----------|------------|
 | ChatGPT (December 12, 2023) | -        | 64.6       |
 *: Grok results are reported by [X.AI](https://x.ai/).
 <div align="center">
 <h2> Limitations </h2>
 </div>
 **Safety**
 OpenChat may sometimes generate harmful, hate speech, biased responses, or answer unsafe questions. It's crucial to apply additional AI safety measures in use cases that require safe and moderated responses.
+## License
 <div align="center">
 <h2> License </h2>
 </div>
 <div align="center">
 <h2> Citation </h2>
 </div>
 ```
 @article{wang2023openchat,
   title={OpenChat: Advancing Open-source Language Models with Mixed-Quality Data},