ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 Reinforcement Learning β’ 15B β’ Updated Feb 13 β’ 2.15k β’ 800