LlaSMol Collection LLMs tuned on the SMolInstruct dataset for chemistry tasks. • 6 items • Updated Feb 4 • 2
AmpleGCG Collection Generative models to produce GCG-like adversarial suffixes • 7 items • Updated Feb 4 • 2
UGround Collection UGround: Universal GUI Visual Grounding for GUI Agents (ICLR'25 Oral) • 9 items • Updated 21 days ago • 4
A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis Paper • 2311.04157 • Published Nov 7, 2023
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI Paper • 2311.16502 • Published Nov 27, 2023 • 35
BIOCLIP: A Vision Foundation Model for the Tree of Life Paper • 2311.18803 • Published Nov 30, 2023 • 1
Bootstrapping a User-Centered Task-Oriented Dialogue System Paper • 2207.05223 • Published Jul 11, 2022
arXivEdits: Understanding the Human Revision Process in Scientific Writing Paper • 2210.15067 • Published Oct 26, 2022
Sparse Autoencoders for Scientifically Rigorous Interpretation of Vision Models Paper • 2502.06755 • Published 27 days ago • 7