Nellyw888/VeriReason-Qwen2.5-1.5b-RTLCoder-Verilog-GRPO-reasoning-tb Reinforcement Learning • Updated 4 days ago • 19 • 1
RakshitAralimatti/Qwen2.5-Coder-1.5B-Instruct-RASA-CALM Text Generation • Updated Dec 13, 2024 • 25 • 1
cutelemonlili/Qwen2.5-Coder-1.5B-Instruct_MATH_training_response_Qwen_QwQ_32B_Preview_common_correct_level Text Generation • Updated Dec 29, 2024 • 27
cutelemonlili/Qwen2.5-Coder-1.5B-Instruct_MATH_training_Qwen2.5-32B-Instruct Text Generation • Updated Dec 29, 2024 • 23
cutelemonlili/Qwen2.5-Coder-1.5B-Instruct_MATH_training_response_Qwen2.5-32B-Instruct_common_correct_level Text Generation • Updated Dec 29, 2024 • 24
cutelemonlili/Qwen2.5-Coder-1.5B-Instruct_MATH_training_Qwen_QwQ_32B_Preview Text Generation • Updated Dec 29, 2024 • 24