This collection contains 4 initial checkpoints for https://github.com/LeslieTrue/SFTvsRL and necessary data for V-IRL training.
Tianzhe
tianzhechu
AI & ML interests
None yet
Recent Activity
updated
a model
5 days ago
tianzhechu/qwen-3b-sft-1200
updated
a model
5 days ago
tianzhechu/qwen-3b-sft-800
published
a model
5 days ago
tianzhechu/qwen-3b-sft-1200
Organizations
None yet
Collections
1
Papers
1
models
11
tianzhechu/qwen-3b-sft-1200
Updated
•
1
tianzhechu/qwen-3b-sft-800
Updated
•
8
tianzhechu/qwen-7b-sft
Updated
•
9
tianzhechu/qwen-3b-sft
Updated
•
3
tianzhechu/GP-L-Init-v2
Updated
•
5
tianzhechu/GP-L-RL20
Updated
•
16
tianzhechu/GL-L-RL20
Updated
tianzhechu/VIRL-VL-Init
Updated
•
21
tianzhechu/VIRL-L-Init
Updated
•
45
•
1
tianzhechu/GP-L-Init
Updated
•
43