Effectively Controlling Reasoning Models through Thinking Intervention Paper • 2503.24370 • Published 8 days ago • 18
Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy Paper • 2410.09102 • Published Oct 9, 2024 • 1
SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe Paper • 2410.05248 • Published Oct 7, 2024 • 8