new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Jun 17

Submitted by

MiniMax-AI

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

·
127 authors

Submitted by

schrodingers-tiger

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

·
27 authors

2

Submitted by

Ayanami0730

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

·
5 authors

1

Submitted by

saksornr

DoTA-RAG: Dynamic of Thought Aggregation RAG

·
5 authors

1

Submitted by

shulin16

Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning

·
10 authors

1

Submitted by

shuaishuaicdp

Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency

·
6 authors

Submitted by

Wangchunshu

TaskCraft: Automated Generation of Agentic Tasks

·
17 authors

Submitted by

rp-yu

Discrete Diffusion in Large Language and Multimodal Models: A Survey

·
3 authors

Submitted by

nopyyh

Test3R: Learning to Reconstruct 3D at Test Time

·
5 authors

Submitted by

zhendch

Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression

·
8 authors

1

Submitted by

stormthunder

VGR: Visual Grounded Reasoning

·
11 authors

1

Submitted by

WTNswaggy

PersonaFeedback: A Large-scale Human-annotated Benchmark For Personalization

·
6 authors

Submitted by

IgnoraZ

From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding

·
4 authors

1

Submitted by

LPY

BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models

·
9 authors

1

Submitted by

joanitolopo

Language Surgery in Multilingual Large Language Models

·
9 authors

1

Submitted by

JJ-TMT

AI Agent Behavioral Science

·
16 authors

1

Submitted by

iwiwi

ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering

·
6 authors

1

Submitted by

viswa-98

LETS Forecast: Learning Embedology for Time Series Forecasting

·
5 authors

2

Submitted by

pranavAL2109

Supernova Event Dataset: Interpreting Large Language Model's Personality through Critical Event Analysis

·
2 authors

1

Submitted by

ZhuangXialie

A Technical Study into Small Reasoning Language Models

·
5 authors

Submitted by

Franck-Dernoncourt

Forecasting Time Series with LLMs via Patch-Based Prompting and Decomposition

·
10 authors

Submitted by

Franck-Dernoncourt

MS4UI: A Dataset for Multi-modal Summarization of User Interface Instructional Videos

·
8 authors

Submitted by

zainmujahid

Profiling News Media for Factuality and Bias Using LLMs and the Fact-Checking Methodology of Human Experts

·
4 authors

1

Submitted by

Owenngt

SRLAgent: Enhancing Self-Regulated Learning Skills through Gamification and LLM Assistance

·
8 authors

1

Submitted by

yerim0210

Incorporating Domain Knowledge into Materials Tokenization

·
5 authors

1

Submitted by

senfu

Steering LLM Thinking with Budget Guidance

·
4 authors

Submitted by

Taegyeonglee

QGuard:Question-based Zero-shot Guard for Multi-modal LLM Safety

·
5 authors

Submitted by

TristanKe

Uncertainty-Aware Remaining Lifespan Prediction from Images

·
3 authors

1

Submitted by

huayangli

SeqPE: Transformer with Sequential Position Encoding

·
8 authors

Submitted by

PChemGuy

Ai-Facilitated Analysis of Abstracts and Conclusions: Flagging Unsubstantiated Claims and Ambiguous Pronouns

·
1 authors

1

Submitted by

williamium

EgoPrivacy: What Your First-Person Camera Says About You?

·
10 authors

Submitted by

dibo

Hatevolution: What Static Benchmarks Don't Tell Us

·
4 authors

1