hazyresearch/MATH500_with_Llama_3.1_8B_Instruct_v1
Viewer
•
Updated
•
500
•
40
hazyresearch/GPQA_with_Llama_3.1_8B_Instruct_v1
Viewer
•
Updated
•
646
•
17
hazyresearch/MMLU_with_Llama_3.1_8B_Instruct_v1
Viewer
•
Updated
•
719
•
23
hazyresearch/MMLU-Pro_with_Llama_3.1_8B_Instruct_v1
Viewer
•
Updated
•
500
•
10
hazyresearch/MMLU-Pro_with_Llama_3.1_70B_Instruct_v1
Viewer
•
Updated
•
500
•
16
hazyresearch/MMLU_with_Llama_3.1_70B_Instruct_v1
Viewer
•
Updated
•
719
•
18
hazyresearch/GPQA_with_Llama_3.1_70B_Instruct_v1
Viewer
•
Updated
•
646
•
28
hazyresearch/MATH500_with_Llama_3.1_70B_Instruct_v1
Viewer
•
Updated
•
500
•
28
hazyresearch/MATH-500_with_Llama_3.1_8B_Instruct_v1
Viewer
•
Updated
•
500
•
7
hazyresearch/CodeContests_with_Llama_3.3_70B_Instruct_SCORED_RESULTS
Viewer
•
Updated
•
140
•
7
hazyresearch/CodeContests_with_Llama_3.3_8B_Instruct_SCORED_RESULTS
Viewer
•
Updated
•
140
•
8
hazyresearch/GPQA_GPT-4o-mini_v2
Viewer
•
Updated
•
646
•
30
hazyresearch/monkey_business_128_MATH_llama_70b_unittests_results
Viewer
•
Updated
•
128
•
7
Viewer
•
Updated
•
646
•
37
hazyresearch/GPQA_GPT-4o-mini
Viewer
•
Updated
•
646
•
31
hazyresearch/GSM8K_GPT-4o-mini_with_LM_Judges_and_RMs_v1
Viewer
•
Updated
•
127
•
5
hazyresearch/smoothie_data
Preview
•
Updated
•
337
•
1
hazyresearch/based_nq_1024
Viewer
•
Updated
•
3.16k
•
18
hazyresearch/based_nq_512
Viewer
•
Updated
•
3.16k
•
29
hazyresearch/based_nq_2048
Viewer
•
Updated
•
3.16k
•
595
hazyresearch/based_triviaqa
Viewer
•
Updated
•
1.69k
•
676
Viewer
•
Updated
•
2.09k
•
627
Viewer
•
Updated
•
2.98k
•
2.97k
•
2
Viewer
•
Updated
•
1.11k
•
2.82k
•
4
Viewer
•
Updated
•
1.1k
•
2.98k
•
3
hazyresearch/LoCoV1-Queries
Viewer
•
Updated
•
7.73k
•
64
•
2
hazyresearch/LoCoV1-Documents
Viewer
•
Updated
•
14.8k
•
58
•
4
hazyresearch/based-swde-deprecated
Viewer
•
Updated
•
12.4k
•
6
Viewer
•
Updated
•
1.1k
•
18
•
1
Updated
•
35
•
6