Stoney Kang
sikang99
AI & ML interests
Remote Control based on Vision
Recent Activity
upvoted
a
paper
1 day ago
MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second
liked
a model
2 days ago
HuggingFaceTB/SmolLM3-3B
Organizations
Diffusion Model
Vision Processing
VLA Models
Vision Language Models for Robotics
-
Unified Vision-Language-Action Model
Paper • 2506.19850 • Published • 25 -
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics
Paper • 2506.01844 • Published • 114 -
3D-VLA: A 3D Vision-Language-Action Generative World Model
Paper • 2403.09631 • Published • 10 -
QUAR-VLA: Vision-Language-Action Model for Quadruped Robots
Paper • 2312.14457 • Published • 1
3D Generation
Diffusion Models
VLM, MLLM
Diffusion Model
Reinforcement Learning
Vision Processing
Simulation
VLA Models
Vision Language Models for Robotics
-
Unified Vision-Language-Action Model
Paper • 2506.19850 • Published • 25 -
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics
Paper • 2506.01844 • Published • 114 -
3D-VLA: A 3D Vision-Language-Action Generative World Model
Paper • 2403.09631 • Published • 10 -
QUAR-VLA: Vision-Language-Action Model for Quadruped Robots
Paper • 2312.14457 • Published • 1
AI Agents
3D Generation
Video Generation