6 124 29

meng shao

meng-shao

shao__meng

AI & ML interests

None yet

Recent Activity

upvoted a paper 25 days ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

upvoted a paper about 2 months ago

What Makes a Good Natural Language Prompt?

upvoted a paper 2 months ago

WebDancer: Towards Autonomous Information Seeking Agency

View all activity

Organizations

upvoted a paper 25 days ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published 27 days ago • 197

upvoted a paper about 2 months ago

What Makes a Good Natural Language Prompt?

Paper • 2506.06950 • Published Jun 7 • 11

upvoted 2 papers 2 months ago

WebDancer: Towards Autonomous Information Seeking Agency

Paper • 2505.22648 • Published May 28 • 23

AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge

Paper • 2505.10468 • Published May 15 • 9

liked a Space 2 months ago

TEN Agent Demo

🔥

A Conversational Voice AI Agent powered by the TEN Framework

liked a model 2 months ago

TEN-framework/TEN_Turn_Detection

Text Generation • 8B • Updated May 27 • 1.4k • 37

upvoted 3 papers 3 months ago

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published May 7 • 66

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30 • 52

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published Apr 30 • 48

liked a model 3 months ago

deepseek-ai/DeepSeek-Prover-V2-671B

Text Generation • 685B • Updated Apr 30 • 1.71k • • 804

upvoted 4 papers 5 months ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published Mar 3 • 80

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3 • 88

DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking

Paper • 2502.20730 • Published Feb 28 • 38

Region-Adaptive Sampling for Diffusion Transformers

Paper • 2502.10389 • Published Feb 14 • 54

upvoted 2 papers 6 months ago

LM2: Large Memory Models

Paper • 2502.06049 • Published Feb 9 • 30

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 155

liked a Space 6 months ago

507

Chat with DeepSeek-VL2-small

🌍

Generate responses using images and text input

upvoted a paper 6 months ago

DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

Paper • 2502.01142 • Published Feb 3 • 24

reacted to AdinaY's post with 🔥 6 months ago

Post

1464

VideoLLaMA 3🔥multimodal foundation models for Image and Video Understanding by DAMO Alibaba

Model: DAMO-NLP-SG/videollama3-678cdda9281a0e32fe79af15
Paper: VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding (2501.13106)

✨ 2B/7B
✨ Apache2.0

1 reply