-
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Paper • 2311.08046 • Published • 2 -
nvidia/GR00T-N1-2B
Robotics • 2B • Updated • 5.34k • 318 -
nvidia/Eagle2-1B
Image-Text-to-Text • 1B • Updated • 1.59k • 24 -
nvidia/PhysicalAI-Robotics-GR00T-X-Embodiment-Sim
Updated • 475k • 133
kurogane himaki
kurogane
AI & ML interests
None yet
Recent Activity
liked
a model
18 days ago
BAAI/RoboBrain2.0-7B
upvoted
a
paper
about 1 month ago
Paper2Poster: Towards Multimodal Poster Automation from Scientific
Papers
upvoted
a
paper
about 1 month ago
VSSD: Vision Mamba with Non-Casual State Space Duality
Organizations
None yet