zitian gao
zgao3186
AI & ML interests
None yet
Recent Activity
authored
a paper
6 days ago
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement
Learning
upvoted
a
paper
6 days ago
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement
Learning
upvoted
a
paper
4 months ago
Interpretable Contrastive Monte Carlo Tree Search Reasoning
Organizations
None yet
zgao3186's activity
RuntimeError: probability tensor contains either `inf`, `nan` or element < 0
1
#1 opened 11 months ago
by
makongduo4112