Yury Panikov
panikov
AI & ML interests
None yet
Recent Activity
commented on
a paper
3 days ago
RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic
Sampling
upvoted
a
paper
3 days ago
RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic
Sampling
commented on
a paper
4 days ago
Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models
Organizations
None yet
models
0
None public yet
datasets
0
None public yet