NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos-GGUF Reinforcement Learning • Updated 15 days ago • 346 • 5
DeepMostInnovations/sales-conversion-model-reinf-learning Reinforcement Learning • Updated about 17 hours ago • 93 • 15
mradermacher/VeriReason-Qwen2.5-7b-RTLCoder-Verilog-GRPO-reasoning-tb-i1-GGUF Reinforcement Learning • Updated 3 days ago • 580 • 2
NousResearch/DeepHermes-ToolCalling-Specialist-Atropos Reinforcement Learning • Updated 27 days ago • 74 • 5
Nellyw888/VeriReason-Qwen2.5-7b-RTLCoder-Verilog-GRPO-reasoning-tb Reinforcement Learning • Updated 5 days ago • 929 • 1
Nellyw888/VeriReason-Qwen2.5-1.5b-RTLCoder-Verilog-GRPO-reasoning-tb Reinforcement Learning • Updated 5 days ago • 19 • 1
mradermacher/VeriReason-Qwen2.5-7b-SFT-Reasoning-GGUF Reinforcement Learning • Updated 3 days ago • 433 • 1
mradermacher/VeriReason-Qwen2.5-1.5B-grpo-small-GGUF Reinforcement Learning • Updated 4 days ago • 310 • 1
mradermacher/VeriReason-Qwen2.5-7b-SFT-Reasoning-i1-GGUF Reinforcement Learning • Updated 3 days ago • 537 • 1
mradermacher/VeriReason-Qwen2.5-7b-RTLCoder-Verilog-GRPO-reasoning-tb-GGUF Reinforcement Learning • Updated 3 days ago • 274 • 1