moonshotai/Kimi-VL-A3B-Thinking-2506 Image-Text-to-Text • 16B • Updated about 16 hours ago • 42.6k • 243
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 75 items • Updated 1 day ago • 173
view post Post 1341 🚨 MistralAI is back with the mistral small V3 model update and it is free! 👏https://docs.mistral.ai/getting-started/models/models_overview/#free-models🚀 Below is the the provider for reasoning over your dataset rows with custom schema 🧠https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/mistralai_150.pyMy personal usage experience and findings:⚠️The original API usage may constanly fail with the connection.To bypass this limitation, use --attempts [COUNT] to withstand connection loss while iterating through JSONL/CSV data (see 📷 below)💵 It is actually: ~0.18 USD 1M tokens🌟 Framework: https://github.com/nicolay-r/bulk-chain See translation 🔥 3 3 + Reply