Wei Liu
PeterV09
AI & ML interests
Machine Learning, Natural Language Processing
Recent Activity
updated
a model
1 day ago
RL4Reasoning/verl-grpo-lr-deepscaler-bsz128-16384-norm-length-0.2-hf-1.5B-2_deepscaler_-380
published
a model
1 day ago
RL4Reasoning/verl-grpo-lr-deepscaler-bsz128-16384-norm-length-0.2-hf-1.5B-2_deepscaler_-380
updated
a model
1 day ago
RL4Reasoning/verl-grpo-lr-deepscaler-bsz128-16384-norm-length-0.4-hf-1.5B-2_deepscaler_-280
Organizations
Collections
2
Papers
2
models
18
PeterV09/llava-1.6-alignmentv2
Text Generation
•
Updated
•
1
PeterV09/llava-1.6-beta-26
Updated
PeterV09/llava-1.6-asft
Updated
PeterV09/llava-1.6-4sftmse
Updated
•
1
PeterV09/llava-1.6-3sft0.5
Updated
•
1
PeterV09/llava-1.6-2sft
Updated
PeterV09/llava-1.6-sft
Text Generation
•
Updated
•
1
PeterV09/mistral-7b-300k-6k-a100-6e-valid-hkust_2-l4k
Text Generation
•
Updated
PeterV09/deita-6k-sft-fordpo
Text Generation
•
Updated
PeterV09/mistral-7b-300k-6k-a100-6e-valid-7
Text Generation
•
Updated
datasets
None public yet