Sebastian Stavar

sebastavar
ยท

AI & ML interests

Text Generation & Chat Assistants; Model Compression & Quantization (Q4/Q6/Q8, gs32); Inference & Serving (on-prem, low-latency); RAG / Retrieval; Agents & Tool Use; Distillation / LoRA / Fine-tuning

Recent Activity

updated a model about 10 hours ago
halley-ai/gpt-oss-120b-MLX-bf16
updated a model about 10 hours ago
halley-ai/gpt-oss-120b-MLX-8bit-gs32
liked a model about 10 hours ago
halley-ai/gpt-oss-120b-MLX-6bit-gs64
View all activity

Organizations

Halley AI's profile picture