new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Feb 11

Submitted by

akhaliq

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

·
8 authors

Submitted by

etomoscow

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

·
5 authors

Submitted by

vanilla1116

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

·
17 authors

Submitted by

bidiptas

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

·
4 authors

Submitted by

JaxonK

LM2: Large Memory Models

·
8 authors

Submitted by

pengxiang

The Curse of Depth in Large Language Models

·
6 authors

Submitted by

akhaliq

Matryoshka Quantization

·
5 authors

Submitted by

ashraful

CODESIM: Multi-Agent Code Generation and Problem Solving through Simulation-Driven Planning and Debugging

·
3 authors

Submitted by

Lingaaaaaaa

ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates

·
4 authors

Submitted by

zhijie3

Show-o Turbo: Towards Accelerated Unified Multimodal Understanding and Generation

·
6 authors

Submitted by

Jiabin99

MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents

·
3 authors

Submitted by

zomss

Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding

·
9 authors

Submitted by

akhaliq

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT

·
19 authors

Submitted by

Paranioar

EVEv2: Improved Baselines for Encoder-Free Vision-Language Models

·
9 authors

Submitted by

ztwang

The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering

·
10 authors

Submitted by

akhaliq

History-Guided Video Diffusion

·
6 authors

Submitted by

sahsaeedi

Dual Caption Preference Optimization for Diffusion Models

·
7 authors

Submitted by

akhaliq

CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers

·
12 authors

Submitted by

PY007

Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile

·
7 authors

Submitted by

zhenglin

DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization

·
6 authors

Submitted by

Hanyuezhuohua

APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding

·
3 authors

Submitted by

aaabiao

Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLM

·
4 authors

Submitted by

Hhaiduo

Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE

·
10 authors

Submitted by

btrabucco

Towards Internet-Scale Training For Agents

·
4 authors

Submitted by

dnoever

Forbidden Science: Dual-Use AI Challenge Benchmark and Scientific Refusal Tests

·
2 authors

Submitted by

akhaliq

Embodied Red Teaming for Auditing Robotic Foundation Models

·
8 authors