Spaces:

smol-course
/

leaderboard

Running

burtenshaw commited on Sep 4

Commit

b5eec3d

1 Parent(s): 8b0cd75

use lighteval for evaluation

Files changed (1) hide show

docs.md CHANGED Viewed

@@ -36,16 +36,14 @@ Your trained model will be available at `your-username/your-model-name`. For det
 Now, we will need to evaluate the model. We will use `hf jobs` to evaluate the model as well and combine it with `openbench`. We will push the evaluation results to a dataset on the hub.
 ```sh
-export HF_TOKEN=<your-huggingface-token>
-hf jobs uv run \
---flavor a100-large \ # define machine size
---with openbench --with vllm --with 'transformers<4.54.0' \ # install dependencies
-bench eval mmlu --model vllm/<your-username>/your-model-name \ # define model repo
---hub-repo <your-username>/<your-model-name>-chapter-1 # define output dataset
 ```
-This command will evaluate the model using `openbench` and `vllm` and save the results to the Hugging Face Hub in the dataset repo that you defined.
 <Tip>

 Now, we will need to evaluate the model. We will use `hf jobs` to evaluate the model as well and combine it with `openbench`. We will push the evaluation results to a dataset on the hub.
 ```sh
+hf jobs uv run \ # run a hf jobs job with uv
+--flavor a10g-large \ # select the machine size
+--with "lighteval[vllm]" \ # install lighteval with vllm dependencies
+s HF_TOKEN \ # share the huggingface write token
+lighteval vllm "model_name=<your-username>/<your-model-name>" "lighteval|gsm8k|0|0" --push-to-hub --results-org <your-username>
 ```
+This command will evaluate the model using `lighteval` and `vllm` and save the results to the Hugging Face Hub in the dataset repo that you defined.
 <Tip>