2 12 4

Yifei Li

JoeLeelyf

https://joeleelyf.github.io/

JoeLeelyf

AI & ML interests

MLLMs, Deepfake Detection, Computer Vision

Recent Activity

upvoted a paper 1 day ago

ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing

updated a dataset about 1 month ago

JoeLeelyf/NeXT-IMDL

published a dataset about 1 month ago

JoeLeelyf/NeXT-IMDL

View all activity

Organizations

None yet

upvoted a paper 1 day ago

ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing

Paper • 2506.19848 • Published 2 days ago • 24

updated a dataset about 1 month ago

JoeLeelyf/NeXT-IMDL

Preview • Updated May 15 • 38

published a dataset about 1 month ago

JoeLeelyf/NeXT-IMDL

Preview • Updated May 15 • 38

upvoted a paper 3 months ago

MM-IFEngine: Towards Multimodal Instruction Following

Paper • 2504.07957 • Published Apr 10 • 34

updated a dataset 3 months ago

JoeLeelyf/OVO-Bench

Viewer • Updated Mar 23 • 2.06k • 962 • 5

New activity in JoeLeelyf/OVO-Bench 3 months ago

Maybe an error in "realtime"

#3 opened 4 months ago by

gogorunrun

upvoted a paper 4 months ago

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published Feb 25 • 73

liked a dataset 4 months ago

THUdyh/Ola-Data

Viewer • Updated Feb 24 • 363k • 1.54k • 8

upvoted a paper 4 months ago

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Paper • 2502.13128 • Published Feb 18 • 42

liked a dataset 4 months ago

HuggingFaceFV/finevideo

Viewer • Updated Dec 16, 2024 • 39.5k • 2.04k • 311

liked a model 4 months ago

parler-tts/parler-tts-large-v1

Text-to-Speech • Updated Nov 22, 2024 • 23k • 255

upvoted 2 papers 5 months ago

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Paper • 2502.05173 • Published Feb 7 • 65

Redundancy Principles for MLLMs Benchmarks

Paper • 2501.13953 • Published Jan 20 • 30

New activity in JoeLeelyf/OVO-Bench 5 months ago

Add task category, paper, code and project page link

#2 opened 5 months ago by

nielsr

authored a paper 5 months ago

OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Paper • 2501.05510 • Published Jan 9 • 44

upvoted a paper 5 months ago

OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Paper • 2501.05510 • Published Jan 9 • 44

upvoted 2 papers 6 months ago

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Paper • 2501.03226 • Published Jan 6 • 45

Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction

Paper • 2501.03218 • Published Jan 6 • 37

liked a dataset 6 months ago

JoeLeelyf/OVO-Bench

Viewer • Updated Mar 23 • 2.06k • 962 • 5

updated a dataset 6 months ago

JoeLeelyf/OVO-Bench

Viewer • Updated Mar 23 • 2.06k • 962 • 5

Yifei Li

AI & ML interests

Recent Activity

Organizations

JoeLeelyf's activity

Maybe an error in "realtime"

Add task category, paper, code and project page link