4 17 4

Yi Yang (SII)

Yysrc

Yysrc

AI & ML interests

None yet

Recent Activity

updated a dataset about 1 month ago

Yysrc/piper_unscrew_the_bottle_cap

published a dataset about 1 month ago

Yysrc/piper_unscrew_the_bottle_cap

updated a model about 1 month ago

Yysrc/robotwin_all_image_action

View all activity

Organizations

upvoted a paper about 1 month ago

LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model

Paper • 2604.02097 • Published Apr 2 • 32

upvoted a paper 4 months ago

Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders

Paper • 2601.10332 • Published Jan 15 • 32

upvoted 3 papers 5 months ago

upvoted a paper 8 months ago

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11, 2025 • 256

upvoted 3 papers 9 months ago

OpenVLA: An Open-Source Vision-Language-Action Model

Paper • 2406.09246 • Published Jun 13, 2024 • 47

VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks

Paper • 2412.18194 • Published Dec 24, 2024 • 1

Diffusion LLMs Can Do Faster-Than-AR Inference via Discrete Diffusion Forcing

Paper • 2508.09192 • Published Aug 8, 2025 • 30

upvoted 2 papers 11 months ago

WorldVLA: Towards Autoregressive Action World Model

Paper • 2506.21539 • Published Jun 26, 2025 • 40

OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning

Paper • 2505.11917 • Published May 17, 2025 • 1

upvoted 2 articles about 1 year ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

merve, andsteing, pcuenq

•

May 14, 2024

• 287

Article

Parameter-Efficient Fine-Tuning using 🤗 PEFT

smangrul, sayakpaul

•

Feb 10, 2023

• 119

upvoted 4 papers about 1 year ago

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published Mar 13, 2025 • 57

Improved Visual-Spatial Reasoning via R1-Zero-Like Training

Paper • 2504.00883 • Published Apr 1, 2025 • 67

Show-o Turbo: Towards Accelerated Unified Multimodal Understanding and Generation

Paper • 2502.05415 • Published Feb 8, 2025 • 20

SIFT: Grounding LLM Reasoning in Contexts via Stickers

Paper • 2502.14922 • Published Feb 19, 2025 • 32