1 10 11

Fuxiao Liu

Fuxiao

https://scholar.google.ca/citations?user=e0P54E4AAAAJ&hl=en

https://fuxiaoliu.github.io

AI & ML interests

Multimodal Large Language Model

Recent Activity

authored a paper about 1 month ago

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

authored a paper about 1 month ago

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

authored a paper about 1 month ago

A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and Challenges

View all activity

Organizations

authored 6 papers about 1 month ago

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published Apr 4 • 15

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Paper • 2504.10514 • Published Apr 10 • 48

A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and Challenges

Paper • 2501.02189 • Published Jan 4 • 1

upvoted 3 papers about 1 month ago

First Frame Is the Place to Go for Video Content Customization

Paper • 2511.15700 • Published Nov 19 • 52

DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents

Paper • 2306.06306 • Published Jun 9, 2023 • 1

MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning

Paper • 2311.10774 • Published Nov 15, 2023 • 2

upvoted a paper about 2 months ago

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Paper • 2511.02779 • Published Nov 4 • 58

upvoted a paper 4 months ago

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27 • 84

liked a model 5 months ago

wangcce/RefSR_x10

Updated Aug 9 • 2

upvoted a paper 8 months ago

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Paper • 2504.15271 • Published Apr 21 • 67

liked 2 models over 1 year ago

NVEagle/Eagle-X5-34B-Plus

Image-Text-to-Text • 37B • Updated Sep 16, 2024 • 11 • 2

NVEagle/Eagle-X4-8B-Plus

Image-Text-to-Text • 10B • Updated Sep 16, 2024 • 17 • 4

upvoted a paper over 1 year ago

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Paper • 2408.15998 • Published Aug 28, 2024 • 86

liked a dataset over 1 year ago

shi-labs/Eagle-1.8M

Updated Aug 29, 2024 • 120 • 7

liked 2 models over 1 year ago

NVEagle/Eagle-X5-13B-Chat

Image-Text-to-Text • 15B • Updated Sep 16, 2024 • 34 • 28

NVEagle/Eagle-X5-7B

Image-Text-to-Text • 9B • Updated Sep 16, 2024 • 29 • 26

liked a Space over 1 year ago

Eagle X5 13B Chat

🚀

Combine text and images to generate responses

Fuxiao Liu

AI & ML interests

Recent Activity

Organizations

Fuxiao's activity

Eagle X5 13B Chat