view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 135
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q2_K-GGUF Text Generation • Updated 8 days ago • 87 • 4
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q2_K-GGUF Text Generation • Updated 8 days ago • 87 • 4
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q8_0-GGUF Text Generation • Updated 8 days ago • 106 • 2
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q8_0-GGUF Text Generation • Updated 8 days ago • 106 • 2
AlicanKiraz0/SenecaLLM-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Q4_K_M-GGUF Text Generation • Updated 8 days ago • 63 • 3
AlicanKiraz0/SenecaLLM-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Q4_K_M-GGUF Text Generation • Updated 8 days ago • 63 • 3