vansin's picture

vansin PRO

vansin

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

updated a Space 5 days ago

AI-Insight/README

published a Space 5 days ago

AI-Insight/README

View all activity

Organizations

vansin's activity

upvoted a paper 1 day ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published 8 days ago • 170

upvoted a paper 21 days ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published 25 days ago • 63

upvoted a paper about 2 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 268

upvoted a collection about 2 months ago

InternVL3

34 items • Updated Apr 20 • 70

upvoted 6 papers 3 months ago

ProJudge: A Multi-Modal Multi-Discipline Benchmark and Instruction-Tuning Dataset for MLLM-based Process Judges

Paper • 2503.06553 • Published Mar 9 • 8

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5 • 234

MinorBench: A hand-built benchmark for content-based risks for children

Paper • 2503.10242 • Published Mar 13 • 5

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published Mar 13 • 84

VisualSimpleQA: A Benchmark for Decoupled Evaluation of Large Vision-Language Models in Fact-Seeking Question Answering

Paper • 2503.06492 • Published Mar 9 • 11

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 66

upvoted 2 papers 4 months ago

SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image Interpretation

Paper • 2502.08168 • Published Feb 12 • 12

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10 • 61

upvoted 2 collections 4 months ago

OREAL

7 items • Updated Feb 11 • 9

InternLM3

6 items • Updated Feb 11 • 26

upvoted an article 4 months ago

Article

State of open video generation models in Diffusers

By

and 2 others •

Jan 27

• 53

upvoted 2 papers 6 months ago

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published Dec 17, 2024 • 95

OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain

Paper • 2412.13018 • Published Dec 17, 2024 • 42

upvoted 3 collections 6 months ago

📑Trending Papers - September 9⃣️

10 items • Updated Mar 28 • 9

🏆 Leaderboards & Arenas

20 items • Updated Mar 28 • 7

🖼️ MLLMs

39 items • Updated Mar 28 • 12