Dataset and Models for ''IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards''
guox18
guox18
·
AI & ML interests
Alignment
Recent Activity
updated
a model
about 12 hours ago
guox18/Qwen2.5-7B-Instruct-IFDecorator
new activity
about 12 hours ago
guox18/IFDecorator:Add paper, project page, and code links to dataset card
Organizations
None yet