NER ITA Collection This collection presents my best models tailored for Named Entity Recognition (NER) tasks, exclusively designed for the Italian language. β’ 3 items β’ Updated 5 days ago β’ 2
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 By tomaarsen β’ Mar 26 β’ 149
view article Article Train 400x faster Static Embedding Models with Sentence Transformers By tomaarsen β’ Jan 15 β’ 197
view article Article **Intelligence Potentiation: An Evolutionary Perspective on AI Agent Designs** By KnutJaegersberg β’ Dec 19, 2024 β’ 4
view article Article SauerkrautLM's Multi-Phase Spectrum Training: A Technical Deep Dive By DavidGF β’ Nov 9, 2024 β’ 9
π«π· Calme-3 Collection Here you can find all the new Calme-3 models β’ 27 items β’ Updated Feb 9 β’ 16
Spectrum: Targeted Training on Signal to Noise Ratio Paper β’ 2406.06623 β’ Published Jun 7, 2024 β’ 14
view article Article Google releases Gemma 2 2B, ShieldGemma and Gemma Scope By Xenova and 3 others β’ Jul 31, 2024 β’ 60
VAGO solutions quants Collection Quantized version for the excellent german speaking models created by VAGO solutions. β’ 6 items β’ Updated Apr 20, 2024 β’ 2
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. β’ 39 items β’ Updated 4 days ago β’ 368
π Dataset comparison models Collection 1.8B models trained on 350BT to compare different pretraining datasets β’ 8 items β’ Updated Jun 12, 2024 β’ 40
view article Article LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!) By wolfram β’ Apr 24, 2024 β’ 63
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated Dec 6, 2024 β’ 817