view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • 21 days ago • 398
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.25k
TxGemma Release Collection Collection of open models to accelerate the development of therapeutics. • 5 items • Updated 2 days ago • 57
HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale Paper • 2406.19280 • Published Jun 27, 2024 • 65
view article Article What We Learned About LLM/VLMs in Healthcare AI Evaluation: By shanchen • Nov 8, 2024 • 13
view article Article How to generate text: using different decoding methods for language generation with Transformers By patrickvonplaten • Mar 1, 2020 • 209
view article Article The N Implementation Details of RLHF with PPO By vwxyzjn and 2 others • Oct 24, 2023 • 58
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency By not-lain • Jan 30 • 69
view article Article SmolVLM Grows Smaller – Introducing the 250M & 500M Models! By andito and 2 others • Jan 23 • 180
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • Jan 23 • 68
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Paper • 2501.09686 • Published Jan 16 • 41
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper • 2501.09732 • Published Jan 16 • 72
Tools 4 Agents Collection This is a collection of spaces on the hub that are useful for building agents. https://huggingface.co/docs/smolagents/en/tutorials/tools • 5 items • Updated Jan 14 • 7
mechanistic interpretability with sparse autoencoders Collection A collection of papers that I found useful for learning about using Sparse Autoencoders for finding interpretable features in language models • 9 items • Updated Sep 3, 2024 • 2
view article Article FineWeb2-C: Help Build Better Language Models in Your Language By davanstrien and 5 others • Dec 23, 2024 • 19
Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry Paper • 2411.15221 • Published Nov 20, 2024 • 33
view article Article Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚 By Isayoften • Aug 26, 2024 • 64