Wenhong
wh-zhu
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 19 hours ago
wh-zhu/short_cot_calibration
published
a dataset
about 19 hours ago
wh-zhu/short_cot_calibration
updated
a dataset
about 19 hours ago
wh-zhu/long_cot_calibration
Organizations
None yet
Collections
3
models
15

wh-zhu/DeepSeek-R1-TrRa-1.5B-lambda_2
Updated
•
49

wh-zhu/DeepSeek-R1-TrRa-1.5B-lambda_5
Updated
•
13

wh-zhu/DeepSeek-R1-TrRa-1.5B-lambda_10
Updated
•
5

wh-zhu/DeepSeek-R1-TrRa-iter2-1.5B-lambda_2
Updated
•
5

wh-zhu/DeepSeek-R1-TrRa-iter1-1.5B-lambda_2
Updated
•
5

wh-zhu/DeepSeek-R1-TrRa-1.5B_lambda_1.5
Updated
•
7

wh-zhu/DeepSeek-R1-TrRa-1.5B_lambda_0.5
Updated
•
5

wh-zhu/DeepScaleR-7B-WSPO
Updated
•
8

wh-zhu/qwen2_7B-ultrachatfeedback-wspo
Updated
•
6

wh-zhu/qwen2_1.5B-ultrachatfeedback-dpo
Updated
•
8