5 10 27

Chi Tran

ambivalent02

baochi0212

AI & ML interests

LLM, RAG, VLLM

Recent Activity

updated a model 1 day ago

ambivalent02/eval_qwen25_lora

published a model 2 days ago

ambivalent02/eval_qwen25_lora

published a dataset 2 days ago

ambivalent02/eval_qwen25_lora

View all activity

Organizations

None yet

updated a model 1 day ago

ambivalent02/eval_qwen25_lora

Updated 1 day ago

published a model 2 days ago

ambivalent02/eval_qwen25_lora

Updated 1 day ago

published a dataset 2 days ago

ambivalent02/eval_qwen25_lora

Updated 2 days ago

liked a model 11 days ago

Qwen/Qwen2.5-Omni-3B

Any-to-Any • Updated Apr 30 • 91.2k • 241

liked a dataset 17 days ago

ICTNLP/InstructS2S-200K

Viewer • Updated May 19 • 200k • 1.46k • 3

liked a model 20 days ago

openbmb/MiniCPM4-0.5B

Text Generation • Updated 16 days ago • 5.91k • 49

liked a Space 28 days ago

2.72k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted 2 collections about 1 month ago

VisionLM

Collection

1281 items • Updated 2 days ago • 77

RL+reason model

Collection

188 items • Updated about 13 hours ago • 11

liked a model about 1 month ago

ACE-Step/ACE-Step-v1-3.5B

Text-to-Audio • Updated May 22 • 525

upvoted a paper about 2 months ago

T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Paper • 2505.00703 • Published May 1 • 43

updated a dataset 3 months ago

ambivalent02/harmful_hoangphan

Preview • Updated Mar 26 • 41

published a dataset 3 months ago

ambivalent02/harmful_hoangphan

Preview • Updated Mar 26 • 41

upvoted an article 4 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 169

updated a dataset 4 months ago

ambivalent02/mulberry_proc

Updated Mar 4 • 32

published a dataset 4 months ago

ambivalent02/mulberry_proc

Updated Mar 4 • 32

updated a model 4 months ago

ambivalent02/vai_2.5_4epoch

Updated Feb 22 • 7

published a model 4 months ago

ambivalent02/vai_2.5_4epoch

Updated Feb 22 • 7

updated a model 4 months ago

ambivalent02/test_beacon_format

Updated Feb 15

published a model 4 months ago

ambivalent02/test_beacon_format

Updated Feb 15

Chi Tran

AI & ML interests

Recent Activity

Organizations

ambivalent02's activity

The Ultra-Scale Playbook

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge