Update README.md
Browse files
README.md
CHANGED
@@ -33,7 +33,7 @@ less ram intensive, inference engine).
|
|
33 |
|
34 |
The CPU used for each test was the AMD Ryzen 7 2700X<br>
|
35 |
RAM: 64GB DDR4<br>
|
36 |
-
### The SLMB (Small Language Model Benchmark)
|
37 |
#### Quick description
|
38 |
This is a very very simple model test I created to test the capabilies of much smaller LLMs. (The answers are included, though they aren't actually needed)
|
39 |
#### The Benchmark
|
@@ -60,4 +60,7 @@ What is a dog? (any answer that matches at least a very basic description of a d
|
|
60 |
<br>
|
61 |
Category 4: basic logic - x/2 (2 points for correct, 0 for wrong)<br>
|
62 |
I have three friends (Jeremy, Tyler, and Gabe) Friend #1 is Jeremy, Friend #3 is Tyler, who is friend #2? <br>
|
63 |
-
(Gabe)
|
|
|
|
|
|
|
|
33 |
|
34 |
The CPU used for each test was the AMD Ryzen 7 2700X<br>
|
35 |
RAM: 64GB DDR4<br>
|
36 |
+
### The SLMB (Small Language Model Benchmark) v1
|
37 |
#### Quick description
|
38 |
This is a very very simple model test I created to test the capabilies of much smaller LLMs. (The answers are included, though they aren't actually needed)
|
39 |
#### The Benchmark
|
|
|
60 |
<br>
|
61 |
Category 4: basic logic - x/2 (2 points for correct, 0 for wrong)<br>
|
62 |
I have three friends (Jeremy, Tyler, and Gabe) Friend #1 is Jeremy, Friend #3 is Tyler, who is friend #2? <br>
|
63 |
+
(Gabe)<br>
|
64 |
+
## Conclusion
|
65 |
+
While stok is definitely (in my opinion) pretty impressive -- especially given it's performance at such small sizes -- it has lots of room to go (also the
|
66 |
+
benchmark may include more tests in the future)
|