Evaluating and Steering Modality Preferences in Multimodal Large Language Model Paper β’ 2505.20977 β’ Published 6 days ago β’ 1 β’ 1
LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models Paper β’ 2505.19223 β’ Published 8 days ago β’ 8 β’ 2
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language Paper β’ 2406.20085 β’ Published Jun 28, 2024 β’ 13 β’ 3
Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Paper β’ 2504.07866 β’ Published Apr 10 β’ 11 β’ 3
Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Paper β’ 2504.07866 β’ Published Apr 10 β’ 11 β’ 3
Boost Your Own Human Image Generation Model via Direct Preference Optimization with AI Feedback Paper β’ 2405.20216 β’ Published May 30, 2024 β’ 22 β’ 3
MoBA: Mixture of Block Attention for Long-Context LLMs Paper β’ 2502.13189 β’ Published Feb 18 β’ 17 β’ 2
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper β’ 2411.14405 β’ Published Nov 21, 2024 β’ 62 β’ 4
Zero-shot Model-based Reinforcement Learning using Large Language Models Paper β’ 2410.11711 β’ Published Oct 15, 2024 β’ 9 β’ 4
Context is Key(NMF): Modelling Topical Information Dynamics in Chinese Diaspora Media Paper β’ 2410.12791 β’ Published Oct 16, 2024 β’ 5 β’ 3
Named Clinical Entity Recognition Benchmark Paper β’ 2410.05046 β’ Published Oct 7, 2024 β’ 17 β’ 3
Training Language Models on Synthetic Edit Sequences Improves Code Synthesis Paper β’ 2410.02749 β’ Published Oct 3, 2024 β’ 12 β’ 3
LLaVA-Critic: Learning to Evaluate Multimodal Models Paper β’ 2410.02712 β’ Published Oct 3, 2024 β’ 37 β’ 3
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning Paper β’ 2409.12568 β’ Published Sep 19, 2024 β’ 51 β’ 4
Insights from Benchmarking Frontier Language Models on Web App Code Generation Paper β’ 2409.05177 β’ Published Sep 8, 2024 β’ 7 β’ 3