Sebastian Stavar
sebastavar
AI & ML interests
Text Generation & Chat Assistants; Model Compression & Quantization (Q4/Q6/Q8, gs32); Inference & Serving (on-prem, low-latency); RAG / Retrieval; Agents & Tool Use; Distillation / LoRA / Fine-tuning
Recent Activity
updated
a model
about 10 hours ago
halley-ai/gpt-oss-120b-MLX-bf16
updated
a model
about 10 hours ago
halley-ai/gpt-oss-120b-MLX-8bit-gs32
liked
a model
about 10 hours ago
halley-ai/gpt-oss-120b-MLX-6bit-gs64