Manipulation
Collection
Manipulation-related datasets and models
•
15 items
•
Updated
•
4
InternVLA-M1 is an open-source, end-to-end vision–language–action (VLA) framework for building and researching generalist robot policies. The checkpoints in this repository were pretrained on the system2 dataset.
@misc{internvla2024,
title = {InternVLA-M1: Latent Spatial Grounding for Instruction-Following Robotic Manipulation},
author = {InternVLA-M1 Contributors},
year = {2025},
booktitle={arXiv},
}