RL trained models and datasets for instruction-following
AI & ML interests
None defined yet.
Recent Activity
《Constraint Back-translation Improves Complex Instruction Following of Large Language Models》
-
Constraint Back-translation Improves Complex Instruction Following of Large Language Models
Paper • 2410.24175 • Published • 18 -
THU-KEG/Mistral-Crab-SFT
Text Generation • 7B • Updated • 16 • 5 -
THU-KEG/Mistral-Crab-DPO
Text Generation • 7B • Updated • 15 • 4 -
THU-KEG/Llama3-Crab-SFT
Text Generation • Updated • 13
RL trained models and datasets for instruction-following
OpenSAE checkpoints for LLaMA 3.1 8B base model
《Constraint Back-translation Improves Complex Instruction Following of Large Language Models》
-
Constraint Back-translation Improves Complex Instruction Following of Large Language Models
Paper • 2410.24175 • Published • 18 -
THU-KEG/Mistral-Crab-SFT
Text Generation • 7B • Updated • 16 • 5 -
THU-KEG/Mistral-Crab-DPO
Text Generation • 7B • Updated • 15 • 4 -
THU-KEG/Llama3-Crab-SFT
Text Generation • Updated • 13
EMNLP2024 Main Conference: 《Aligning Large Language Models on Information Extraction》