Experimental Model
Collection
Tried to mimic GRPO but using SFT
โข
2 items
โข
Updated
saishshinde15/TethysAI_Vortex_Reasoning_GGUF
TethysAI Vortex Reasoning is an experimental model designed to replicate the advanced reasoning abilities of TethysAI_Base_Reasoning, which was originally enhanced using GRPO. Instead of GRPO, this model was fine-tuned with high-quality structured data using high-end Supervised Fine-Tuning (SFT) to replicate the step-by-step thinking and self-questioning mechanisms seen in models like DeepSeek-R1.
This model has been optimized for efficient inference in GGUF format, allowing for deployment on CPU-based systems and lightweight edge devices without sacrificing reasoning capabilities.
๐น Advanced Self-Reasoning:
๐น No GRPO, Only High-End SFT:
๐น Optimized for GGUF Inference:
You are an advanced AI assistant. Provide answers in a clear, step-by-step manner.
Base model
Qwen/Qwen2.5-3B