new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

May 26

Submitted by

EilamSha

TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations

·
3 authors

4

Submitted by

Wanfq

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

·
10 authors

3

Submitted by

Nardien

Distilling LLM Agent into Small Models with Retrieval and Code Tools

·
5 authors

5

Submitted by

BlackSamorez

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

·
8 authors

2

Submitted by

yjyjyj98

Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models

·
5 authors

2

Submitted by

Ryan1122

One RL to See Them All: Visual Triple Unified Reinforcement Learning

·
10 authors

2

Submitted by

taki555

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

·
19 authors

4

Submitted by

shenwzh3

QwenLong-CPRS: Towards infty-LLMs with Dynamic Context Optimization

·
15 authors

Submitted by

RyanLiu112

Scaling Image and Video Generation via Test-Time Evolutionary Search

·
7 authors

Submitted by

ZonglinY

MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback

·
10 authors

3

Submitted by

kwanyoung

Model Already Knows the Best Noise: Bayesian Active Noise Selection via Attention in Video Diffusion Model

·
2 authors

Submitted by

Zigeng

VeriThinker: Learning to Verify Makes Reasoning Model Efficient

·
5 authors

2

Submitted by

Gigglingface

Diffusion Classifiers Understand Compositionality, but Conditions Apply

·
4 authors

3

Submitted by

JusperLee

AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models

·
32 authors

Submitted by

LoYoT

Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention

·
11 authors

Submitted by

dalime

Position of Uncertainty: A Cross-Linguistic Study of Positional Bias in Large Language Models

·
8 authors

Submitted by

pat-jj

s3: You Don't Need That Much Data to Train a Search Agent via RL

·
7 authors

2

Submitted by

SP2001

Teaching with Lies: Curriculum DPO on Synthetic Negatives for Hallucination Detection

·
4 authors

2

Submitted by

Kuvvi

FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow

·
5 authors

2

Submitted by

Jinyang23

Thought-Augmented Policy Optimization: Bridging External Guidance and Internal Capabilities

·
8 authors

2

Submitted by

m-serious

Time-R1: Towards Comprehensive Temporal Reasoning in LLMs

·
5 authors

3

Submitted by

Yunqiu

Clear Nights Ahead: Towards Multi-Weather Nighttime Image Restoration

·
5 authors

2

Submitted by

alandao

Speechless: Speech Instruction Training Without Speech for Low Resource Languages

·
9 authors

2

Submitted by

MenghaoGuo

RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs

·
15 authors

3

Submitted by

ssz1111

Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning

·
14 authors

5

Submitted by

yanxi-chen

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

·
13 authors

2

Submitted by

ed1son

ScanBot: Towards Intelligent Surface Scanning in Embodied Robotic Systems

·
6 authors

2

Submitted by

oneonlee

Are Vision-Language Models Safe in the Wild? A Meme-Based Benchmark Study

·
4 authors

2

Submitted by

zguo0525

Synthetic Data RL: Task Definition Is All You Need

·
8 authors

2

Submitted by

mrwu

RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning

·
17 authors

Submitted by

yisuanwang

DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation

·
12 authors

2

Submitted by

Lingaaaaaaa

Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning

·
7 authors

Submitted by

yifAI

On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning

·
6 authors

2

Submitted by

tanshh97

Interactive Post-Training for Vision-Language-Action Models

·
4 authors

2

Submitted by

beanie00

ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Reflection

·
7 authors

2

Submitted by

prateekv

Large Language Models Implicitly Learn to See and Hear Just By Reading

·
2 authors

Submitted by

kaiwenw

Value-Guided Search for Efficient Chain-of-Thought Reasoning

·
7 authors

Submitted by

ljcleo

Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

·
6 authors

2

Submitted by

HwanChang0106

Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering

·
4 authors

2

Submitted by

BootsofLagrangian

Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks

·
5 authors

2

Submitted by

Chaeeun-Kim

FREESON: Retriever-Free Retrieval-Augmented Reasoning via Corpus-Traversing MCTS

·
2 authors

2

Submitted by

rmahesh

Augmenting LLM Reasoning with Dynamic Notes Writing for Complex QA

·
8 authors

2

Submitted by

thinkwee

NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning

·
6 authors

5

Submitted by

songff

TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenarios

·
8 authors

2

Submitted by

3ebdola

NileChat: Towards Linguistically Diverse and Culturally Aware LLMs for Local Communities

·
5 authors

Submitted by

liboaccn

FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation

·
4 authors

2

Submitted by

Wyattz23

Universal Biological Sequence Reranking for Improved De Novo Peptide Sequencing

·
9 authors