2 18 23

GJ

SpaceProduct

AI & ML interests

None yet

Recent Activity

upvoted a collection 3 days ago

Simulation_LLMs

upvoted a collection 3 days ago

Simulation_LLMs_V2

upvoted a collection 3 days ago

ZeroSearch_google

View all activity

Organizations

SpaceProduct's activity

upvoted 4 collections 3 days ago

upvoted a paper 21 days ago

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published 21 days ago • 63

upvoted a paper about 1 month ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 265

upvoted 2 papers 3 months ago

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

Paper • 2502.09621 • Published Feb 13 • 28

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 91

upvoted a collection 5 months ago

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 16 items • Updated 30 days ago • 216

upvoted 2 papers 7 months ago

EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation

Paper • 2411.08380 • Published Nov 13, 2024 • 27

Chain of Ideas: Revolutionizing Research in Novel Idea Development with LLM Agents

Paper • 2410.13185 • Published Oct 17, 2024 • 6

upvoted 2 collections 7 months ago

LLaVA-Video

Collection

Models focus on video understanding (previously known as LLaVA-NeXT-Video). • 8 items • Updated Feb 21 • 61

LLaVA-Critic

Collection

as a general evaluator for assessing model performance • 6 items • Updated Oct 6, 2024 • 10

upvoted a paper 7 months ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 95

upvoted a paper 8 months ago

LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3, 2024 • 37

upvoted an article 8 months ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 253

upvoted a paper 10 months ago

SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages

Paper • 2407.19672 • Published Jul 29, 2024 • 58