2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper • 2501.00958 • Published Jan 1 • 107
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper • 2412.19723 • Published Dec 27, 2024 • 89
PERSE: Personalized 3D Generative Avatars from A Single Portrait Paper • 2412.21206 • Published Dec 30, 2024 • 19
Training Software Engineering Agents and Verifiers with SWE-Gym Paper • 2412.21139 • Published Dec 30, 2024 • 23
Can Large Language Models Help Developers with Robotic Finite State Machine Modification? Paper • 2412.05625 • Published Dec 7, 2024
MentalLLaMA: Interpretable Mental Health Analysis on Social Media with Large Language Models Paper • 2309.13567 • Published Sep 24, 2023 • 3
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement Paper • 2501.12273 • Published Jan 21 • 14
Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models Paper • 2304.09842 • Published Apr 19, 2023 • 2