versions of vilaw model
Lê Võ Quyết Thắng PRO
thangvip
AI & ML interests
Adapting LLM to specific domain
Recent Activity
upvoted
a
paper
5 days ago
The Entropy Mechanism of Reinforcement Learning for Reasoning Language
Models
upvoted
an
article
22 days ago
Vision Language Models (Better, Faster, Stronger)
Organizations
Collections
3
spaces
2
models
85

thangvip/qwen-2.5-vl-3b-lora-brainrot-new-128-256
Updated

thangvip/qwen-2.5-vl-3b-lora-brainrot-new
Updated

thangvip/qwen-2.5-vl-3b-lora-brainrot-256
Updated

thangvip/qwen-2.5-vl-3b-lora-brainrot
Updated

thangvip/qwen-2.5-vl-7b-4bit-lora-brainrot
Updated

thangvip/qwen-2.5-vl-3b-lora-brr
Updated

thangvip/qwen-2.5-vl-7b-4bit-brr
Updated

thangvip/vwen2.5-1.5b-evol
Question Answering
•
Updated
•
31

thangvip/sailor2-1b-evol
Question Answering
•
Updated
•
21

thangvip/vlama-1b-instruct
Text Generation
•
Updated
•
12
datasets
73
thangvip/image-query-vie
Viewer
•
Updated
•
117k
•
142
thangvip/GeneralThought-Filtered-230k
Viewer
•
Updated
•
231k
•
19
thangvip/brr_training_dataset
Viewer
•
Updated
•
2.38k
•
25
thangvip/vilaw-sailor-dpo-output
Viewer
•
Updated
•
1k
•
26
thangvip/vilaw-sailor-sft-output
Viewer
•
Updated
•
1k
•
24
thangvip/vilaw-qwen-sft-output
Viewer
•
Updated
•
1k
•
22
thangvip/vilaw-qwen-dpo-output
Viewer
•
Updated
•
1k
•
20
thangvip/vilaw-sailor-dpo-ds
Viewer
•
Updated
•
3.15k
•
22
thangvip/vilaw-qwen-dpo-ds
Viewer
•
Updated
•
3.15k
•
23
thangvip/vilaw-qwen-sft-comparison
Viewer
•
Updated
•
3.15k
•
18