12 23 1

Siteng Huang

huangsiteng

https://kyonhuang.top/

AI & ML interests

Embodied AI, Multimodal AI, Generative AI, Efficient AI

Recent Activity

authored a paper about 2 months ago

Humanoid-VLA: Towards Universal Humanoid Control with Visual Integration

authored a paper about 2 months ago

RynnBrain: Open Embodied Foundation Models

upvoted a paper about 2 months ago

RynnBrain: Open Embodied Foundation Models

View all activity

Organizations

commented 2 papers 5 months ago

RynnVLA-002: A Unified Vision-Language-Action and World Model

Paper • 2511.17502 • Published Nov 21, 2025 • 28 •

Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors

Paper • 2508.08896 • Published Aug 12, 2025 • 12 •

commented a paper 6 months ago

High-Fidelity Simulated Data Generation for Real-World Zero-Shot Robotic Manipulation Learning with Gaussian Splatting

Paper • 2510.10637 • Published Oct 12, 2025 • 15 •

commented a paper 7 months ago

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11, 2025 • 254 •

commented a paper 8 months ago

Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors

Paper • 2508.08896 • Published Aug 12, 2025 • 12 •

commented 3 papers 11 months ago

VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL

Paper • 2505.15791 • Published May 21, 2025 • 6 •

SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning

Paper • 2505.12448 • Published May 18, 2025 • 10 •

OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation

Paper • 2505.03912 • Published May 6, 2025 • 9 •

commented a paper about 1 year ago

Exploring the Evolution of Physics Cognition in Video Generation: A Survey

Paper • 2503.21765 • Published Mar 27, 2025 • 11 •

commented 3 papers over 1 year ago

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

Paper • 2412.06782 • Published Dec 9, 2024 • 7 •

Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration

Paper • 2411.17686 • Published Nov 26, 2024 • 19 •

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

Paper • 2409.07239 • Published Sep 11, 2024 • 15 •

Siteng Huang

AI & ML interests

Recent Activity

Organizations

huangsiteng's activity