Shuai Liu

Choiszt

https://github.com/choiszt

Choiszt

AI & ML interests

Embodied AI

Recent Activity

published a dataset 3 days ago

Choiszt/yuhao

updated a dataset 3 days ago

Choiszt/yuhao

upvoted a paper 13 days ago

JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse

View all activity

Organizations

Choiszt's activity

upvoted a paper 13 days ago

JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse

Paper • 2503.16365 • Published 14 days ago • 35

upvoted a paper 21 days ago

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Paper • 2503.10639 • Published 21 days ago • 47

upvoted a paper 22 days ago

Gemini Embedding: Generalizable Embeddings from Gemini

Paper • 2503.07891 • Published 24 days ago • 34

upvoted a paper 28 days ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published 29 days ago • 38

upvoted 2 papers about 1 month ago

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published Feb 25 • 71

LongRoPE2: Near-Lossless LLM Context Window Scaling

Paper • 2502.20082 • Published Feb 27 • 36

upvoted 4 collections about 1 month ago

upvoted 2 papers 4 months ago

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models

Paper • 2412.09645 • Published Dec 10, 2024 • 36

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Paper • 2411.14982 • Published Nov 22, 2024 • 16

upvoted 2 papers 5 months ago

MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels

Paper • 2405.07526 • Published May 13, 2024 • 21

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17, 2024 • 56

upvoted a collection 6 months ago

LLaVA-Critic

Collection

as a general evaluator for assessing model performance • 6 items • Updated Oct 6, 2024 • 10

upvoted a paper 6 months ago

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3, 2024 • 38

upvoted a paper 8 months ago

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6, 2024 • 60

upvoted a collection 12 months ago

LMMs-Eval

Collection

Dataset Collection of LMMs-Eval • 36 items • Updated Oct 4, 2024 • 29

upvoted 2 papers over 1 year ago

Eureka: Human-Level Reward Design via Coding Large Language Models

Paper • 2310.12931 • Published Oct 19, 2023 • 26

Octopus: Embodied Vision-Language Programmer from Environmental Feedback

Paper • 2310.08588 • Published Oct 12, 2023 • 36