Snowflake/snowflake-arctic-embed-l-v2.0 Sentence Similarity • 0.6B • Updated Apr 25 • 147k • • 186
mistralai/Mistral-Small-3.1-24B-Instruct-2503 Image-Text-to-Text • 24B • Updated May 9 • 221k • • 1.29k
bartowski/google_gemma-3-27b-it-qat-GGUF Image-Text-to-Text • 27B • Updated Apr 22 • 4.35k • 32
bartowski/google_gemma-3-12b-it-qat-GGUF Image-Text-to-Text • 12B • Updated Apr 18 • 2.85k • 22
google/gemma-3-27b-it-qat-q4_0-unquantized Image-Text-to-Text • 27B • Updated Apr 15 • 7.79k • 33
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 Text Generation • 253B • Updated 7 days ago • 162k • • 323
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference By mfuntowicz and 1 other • Jan 16 • 75