IFDecorator - a guox18 Collection

guox18 's Collections

IFDecorator

updated 1 day ago

Dataset and Models for ''IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards''

guox18/IFDecorator

Preview • Updated about 17 hours ago • 55 • 1

Note Datasets
guox18/Qwen2.5-7B-Instruct-IFDecorator

Text Generation • 8B • Updated about 17 hours ago • 4
guox18/Llama3.1-8B-Instruct-IFDecorator

Updated 2 days ago
guox18/Qwen3-8B-IFDecorator

8B • Updated 1 day ago • 1
IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards

Paper • 2508.04632 • Published 2 days ago • 2