Holo1 Collection Vision-Language Action Model for use in Surfer-H web navigation agent • 6 items • Updated Jun 10 • 48
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! Jun 6 • 52
Jina Reader-LM Collection Convert HTML content to LLM-friendly Markdown/JSON content • 4 items • Updated 8 days ago • 15
view article Article SmolLM - blazingly fast and remarkably powerful By loubnabnl and 2 others • Jul 16, 2024 • 400
Idefics2 🐶 Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6, 2024 • 91
view article Article Speculative Decoding for 2x Faster Whisper Inference By sanchit-gandhi • Dec 20, 2023 • 29
view article Article Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face By mhillsmith and 2 others • May 3, 2024 • 14
An Embarrassingly Simple Approach for LLM with Strong ASR Capacity Paper • 2402.08846 • Published Feb 13, 2024 • 1
Introducing Bode: A Fine-Tuned Large Language Model for Portuguese Prompt-Based Task Paper • 2401.02909 • Published Jan 5, 2024 • 2
view article Article Fine-tuning XLS-R for Multi-Lingual ASR with 🤗 Transformers By patrickvonplaten • Nov 15, 2021 • 29
Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts Paper • 2309.11977 • Published Sep 21, 2023 • 2
AgentTuning: Enabling Generalized Agent Abilities for LLMs Paper • 2310.12823 • Published Oct 19, 2023 • 36