new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Jan 28

Submitted by

akhaliq

Qwen2.5-1M Technical Report

·
28 authors

Submitted by

akhaliq

Baichuan-Omni-1.5 Technical Report

·
93 authors

Submitted by

akhaliq

Towards General-Purpose Model-Free Reinforcement Learning

·
5 authors

Submitted by

xiaol

ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer

·
4 authors

Submitted by

HarryHe

Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation

·
14 authors

Submitted by

eliebak

Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

·
6 authors

Submitted by

akhaliq

iFormer: Integrating ConvNet and Transformer for Mobile Application

·
1 authors

Submitted by

nielsr

Are Vision Language Models Texture or Shape Biased and Can We Steer Them?

·
8 authors

Submitted by

ChenDRAG

Visual Generation Without Guidance

·
6 authors

Submitted by

Bradley

CodeMonkeys: Scaling Test-Time Compute for Software Engineering

·
6 authors

Submitted by

akhaliq

Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity

·
6 authors

Submitted by

xywang1

OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas

·
6 authors

Submitted by

melfeki11

Return of the Encoder: Maximizing Parameter Efficiency for SLMs

·
3 authors

Submitted by

akhaliq

Feasible Learning

·
7 authors