Dehao Huang
red0orange
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO
upvoted
a
paper
about 2 months ago
Advancing Multimodal Reasoning via Reinforcement Learning with Cold
Start