MaxJeblick
commited on
Commit
•
5ec7191
1
Parent(s):
86cf4e3
Update README.md
Browse files
README.md
CHANGED
@@ -1,4 +1,4 @@
|
|
1 |
-
Small dummy deberta-v3-type Model useable for Unit/Integration tests. Suitable for CPU only machines, see [H2O LLM Studio](https://github.com/h2oai/h2o-llmstudio/blob/main/tests/integration/test_integration.py) for an example integration test.
|
2 |
|
3 |
Model was created as follows:
|
4 |
```python
|
|
|
1 |
+
Small dummy deberta-v3-type Reward Model useable for Unit/Integration tests for RLHF. Suitable for CPU only machines, see [H2O LLM Studio](https://github.com/h2oai/h2o-llmstudio/blob/main/tests/integration/test_integration.py) for an example integration test.
|
2 |
|
3 |
Model was created as follows:
|
4 |
```python
|