3 497 163

Yuseung "Phillip" Lee

phillipinseoul

https://phillipinseoul.github.io/

phillipinseoul

AI & ML interests

Computer Vision

Recent Activity

liked a model about 1 hour ago

liuhaotian/llava-v1.5-13b

upvoted a paper about 2 hours ago

A Survey on Latent Reasoning

upvoted a paper about 20 hours ago

RoboBrain 2.0 Technical Report

View all activity

Organizations

liked a model about 1 hour ago

liuhaotian/llava-v1.5-13b

Image-Text-to-Text • Updated May 9, 2024 • 330k • 505

upvoted a paper about 2 hours ago

A Survey on Latent Reasoning

Paper • 2507.06203 • Published about 13 hours ago • 35

upvoted a paper about 20 hours ago

RoboBrain 2.0 Technical Report

Paper • 2507.02029 • Published 7 days ago • 21

upvoted a paper 1 day ago

StreamDiT: Real-Time Streaming Text-to-Video Generation

Paper • 2507.03745 • Published 5 days ago • 20

liked a model 1 day ago

TRI-ML/prismatic-vlms

Image-to-Text • Updated May 6, 2024 • 21

upvoted a paper 1 day ago

How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks

Paper • 2507.01955 • Published 7 days ago • 27

upvoted a paper 2 days ago

Fast and Simplex: 2-Simplicial Attention in Triton

Paper • 2507.02754 • Published 6 days ago • 22

upvoted 4 papers 5 days ago

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published 9 days ago • 74

upvoted 2 papers 7 days ago

MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings

Paper • 2506.23115 • Published 10 days ago • 36

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published 8 days ago • 174

liked a model 9 days ago

remyxai/SpaceThinker-Qwen2.5VL-3B

Image-Text-to-Text • 4B • Updated 18 days ago • 3.76k • 22

upvoted 2 papers 9 days ago

Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs

Paper • 2506.21656 • Published 13 days ago • 13

BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing

Paper • 2506.17450 • Published 18 days ago • 60

liked a dataset 10 days ago

uoft-cs/cifar10

Viewer • Updated Jan 4, 2024 • 60k • 57k • 77

upvoted 2 papers 12 days ago

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published 14 days ago • 59

WorldVLA: Towards Autoregressive Action World Model

Paper • 2506.21539 • Published 13 days ago • 36

upvoted a paper 14 days ago

Unified Vision-Language-Action Model

Paper • 2506.19850 • Published 15 days ago • 23

Yuseung "Phillip" Lee

AI & ML interests

Recent Activity

Organizations

phillipinseoul's activity