11 16 47

Zihao Wang

zhwang4ai

https://zhwang4ai.github.io

zhwang4ai

AI & ML interests

Machine Learning

Recent Activity

new activity 22 days ago

CraftJarvis/minecraft-motionha-qwen2vl-7b-2509:Access to the model

upvoted a paper about 2 months ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

upvoted a paper 3 months ago

WMPO: World Model-based Policy Optimization for Vision-Language-Action Models

View all activity

Organizations

New activity in CraftJarvis/minecraft-motionha-qwen2vl-7b-2509 22 days ago

Access to the model

#1 opened 22 days ago by

3ndetz

commented 5 papers 4 months ago

Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents

Paper • 2510.23691 • Published Oct 27, 2025 • 54 •

Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents

Paper • 2510.23691 • Published Oct 27, 2025 • 54 •

Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents

Paper • 2510.23691 • Published Oct 27, 2025 • 54 •

Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents

Paper • 2510.23691 • Published Oct 27, 2025 • 54 •

Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents

Paper • 2510.23691 • Published Oct 27, 2025 • 54 •

New activity in CraftJarvis/minecraft-vla-sft 11 months ago

Add paper link and task category

#1 opened 11 months ago by

nielsr

commented 2 papers 11 months ago

JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse

Paper • 2503.16365 • Published Mar 20, 2025 • 41 •

Open-World Skill Discovery from Unsegmented Demonstrations

Paper • 2503.10684 • Published Mar 11, 2025 • 5 •

commented 4 papers over 1 year ago

OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents

Paper • 2407.00114 • Published Jun 27, 2024 • 13 •

OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents

Paper • 2407.00114 • Published Jun 27, 2024 • 13 •

OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents

Paper • 2407.00114 • Published Jun 27, 2024 • 13 •

OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents

Paper • 2407.00114 • Published Jun 27, 2024 • 13 •

New activity in codeparrot/github-code about 2 years ago

languges seems not working

👍 2

#5 opened about 3 years ago by

tianyang

New activity in llava-hf/llava-1.5-7b-hf about 2 years ago

How to translate the llava-1.5-7b into llava-1.5-7b-hf?

#3 opened about 2 years ago by

zhwang4ai

Zihao Wang

AI & ML interests

Recent Activity

Organizations

zhwang4ai's activity

Access to the model

Add paper link and task category

languges seems not working

How to translate the llava-1.5-7b into llava-1.5-7b-hf?