2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper • 2501.00958 • Published 25 days ago • 98
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper • 2412.19723 • Published 30 days ago • 81
PERSE: Personalized 3D Generative Avatars from A Single Portrait Paper • 2412.21206 • Published 27 days ago • 17
Training Software Engineering Agents and Verifiers with SWE-Gym Paper • 2412.21139 • Published 27 days ago • 21
Can Large Language Models Help Developers with Robotic Finite State Machine Modification? Paper • 2412.05625 • Published Dec 7, 2024
MentalLLaMA: Interpretable Mental Health Analysis on Social Media with Large Language Models Paper • 2309.13567 • Published Sep 24, 2023 • 3