-
Gemini Robotics: Bringing AI into the Physical World
Paper • 2503.20020 • Published • 21 -
Magma: A Foundation Model for Multimodal AI Agents
Paper • 2502.13130 • Published • 57 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper • 2311.05437 • Published • 50 -
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
Paper • 2410.23218 • Published • 50
Jaehyun Jun
btjhjeon
AI & ML interests
Multimodal
Recent Activity
updated
a collection
about 6 hours ago
Multimodal LLM
updated
a collection
about 6 hours ago
Multimodal LLM
Organizations
Collections
11
models
None public yet
datasets
None public yet