6 23 42

Haoqin Tu

PahaII

https://www.haqtu.me/

ImKeTT

AI & ML interests

generation, latent variable models

Recent Activity

updated a model 1 day ago

PahaII/maplillary_results

published a model 4 days ago

PahaII/maplillary_results

liked a model about 1 month ago

UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-3B

View all activity

Organizations

updated a model 1 day ago

PahaII/maplillary_results

Updated 1 day ago

published a model 4 days ago

PahaII/maplillary_results

Updated 1 day ago

liked a model about 1 month ago

UCSC-VLAA/VLAA-Thinker-Qwen2.5VL-3B

4B • Updated Mar 31 • 5.85k • 5

upvoted an article about 2 months ago

Article

Vision Language Models Explained

and 1 other •

Apr 11, 2024

• 415

updated a dataset 2 months ago

UCSC-VLAA/PARADE_audio

Viewer • Updated May 11 • 938 • 40

upvoted 2 papers 2 months ago

X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains

Paper • 2505.03981 • Published May 6 • 15

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Paper • 2505.04601 • Published May 7 • 27

commented a paper 2 months ago

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Paper • 2505.04601 • Published May 7 • 27 •

upvoted 2 collections 2 months ago

VLAA-Thinker

Collection

6 items • Updated Apr 17 • 4

OpenVision

Collection

27 items • Updated May 8 • 29

liked a model 3 months ago

Skywork/Skywork-VL-Reward-7B

Image-Text-to-Text • 8B • Updated Jun 10 • 410 • 41

authored 4 papers 3 months ago

Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability

Paper • 2412.18551 • Published Dec 24, 2024

Language Models Can See Better: Visual Contrastive Decoding For LLM Multimodal Reasoning

Paper • 2502.11751 • Published Feb 17

STAR-1: Safer Alignment of Reasoning LLMs with 1K Data

Paper • 2504.01903 • Published Apr 2

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Paper • 2504.11468 • Published Apr 10 • 29

upvoted a paper 3 months ago

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Paper • 2504.11468 • Published Apr 10 • 29

upvoted an article 3 months ago

Article

What is test-time compute and how to scale it?

and 1 other •

Feb 6

• 96

liked a dataset 3 months ago

UCSC-VLAA/STAR-1

Viewer • Updated Apr 4 • 1k • 178 • 10

liked 2 datasets 4 months ago

UCSC-VLAA/ViLBench

Preview • Updated Mar 27 • 31 • 2

UCSC-VLAA/ViLReward-73K

Viewer • Updated Mar 27 • 73.6k • 43 • 2

Haoqin Tu

AI & ML interests

Recent Activity

Organizations

PahaII's activity

Vision Language Models Explained

What is test-time compute and how to scale it?