12 23 1

Siteng Huang

huangsiteng

https://kyonhuang.top/

AI & ML interests

Embodied AI, Multimodal AI, Generative AI, Efficient AI

Recent Activity

authored a paper about 2 months ago

Humanoid-VLA: Towards Universal Humanoid Control with Visual Integration

authored a paper about 2 months ago

RynnBrain: Open Embodied Foundation Models

upvoted a paper about 2 months ago

RynnBrain: Open Embodied Foundation Models

View all activity

Organizations

upvoted a paper about 2 months ago

RynnBrain: Open Embodied Foundation Models

Paper • 2602.14979 • Published Feb 13 • 45

upvoted a collection 2 months ago

RynnBrain

Collection

11 items • Updated 2 days ago • 24

upvoted a paper 2 months ago

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Paper • 2602.07026 • Published Feb 2 • 140

upvoted a paper 3 months ago

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Paper • 2601.06943 • Published Jan 11 • 215

upvoted a paper 4 months ago

HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models

Paper • 2512.09928 • Published Dec 10, 2025 • 14

upvoted a paper 5 months ago

RynnVLA-002: A Unified Vision-Language-Action and World Model

Paper • 2511.17502 • Published Nov 21, 2025 • 28

upvoted a paper 6 months ago

High-Fidelity Simulated Data Generation for Real-World Zero-Shot Robotic Manipulation Learning with Gaussian Splatting

Paper • 2510.10637 • Published Oct 12, 2025 • 15

upvoted 3 papers 7 months ago

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Paper • 2509.21268 • Published Sep 25, 2025 • 104

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

Paper • 2509.15212 • Published Sep 18, 2025 • 22

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11, 2025 • 254

upvoted a paper 8 months ago

Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors

Paper • 2508.08896 • Published Aug 12, 2025 • 12

upvoted an article 8 months ago

Article

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

Aug 11, 2025

•

upvoted a paper 10 months ago

WorldVLA: Towards Autoregressive Action World Model

Paper • 2506.21539 • Published Jun 26, 2025 • 40

upvoted 4 papers 11 months ago

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Paper • 2505.19147 • Published May 25, 2025 • 145

VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL

Paper • 2505.15791 • Published May 21, 2025 • 6

SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning

Paper • 2505.12448 • Published May 18, 2025 • 10

OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation

Paper • 2505.03912 • Published May 6, 2025 • 9

upvoted 2 papers about 1 year ago

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

Paper • 2503.22655 • Published Mar 28, 2025 • 38

Exploring the Evolution of Physics Cognition in Video Generation: A Survey

Paper • 2503.21765 • Published Mar 27, 2025 • 11

upvoted a paper over 1 year ago

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

Paper • 2412.06782 • Published Dec 9, 2024 • 7

Siteng Huang

AI & ML interests

Recent Activity

Organizations

huangsiteng's activity

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation