view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? By Kseniase β’ Mar 17 β’ 324
view article Article Open-source DeepResearch β Freeing our search agents By m-ric and 4 others β’ Feb 4 β’ 1.28k
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents Paper β’ 2401.00812 β’ Published Jan 1, 2024 β’ 11
view article Article Mixture of Experts Explained By osanseviero and 5 others β’ Dec 11, 2023 β’ 797
Llama 3.3 (All Versions) Collection Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. β’ 3 items β’ Updated 6 days ago β’ 37
view article Article A failed experiment: Infini-Attention, and why we should keep trying? By neuralink and 2 others β’ Aug 14, 2024 β’ 68
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! β’ 30 items β’ Updated Jun 12, 2024 β’ 244
view article Article Fine-Tune Whisper with π€ Transformers By sanchit-gandhi β’ Nov 3, 2022 β’ 269
LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models Paper β’ 2407.07895 β’ Published Jul 10, 2024 β’ 43
Gemma 2: Improving Open Language Models at a Practical Size Paper β’ 2408.00118 β’ Published Jul 31, 2024 β’ 80