Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback Paper • 2501.12895 • Published 6 days ago • 48
What makes your model a low-empathy or warmth person: Exploring the Origins of Personality in LLMs Paper • 2410.10863 • Published Oct 7, 2024 • 1
DLO: Dynamic Layer Operation for Efficient Vertical Scaling of LLMs Paper • 2407.11030 • Published Jul 3, 2024
LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training Paper • 2411.15708 • Published Nov 24, 2024
RoRA-VLM: Robust Retrieval-Augmented Vision Language Models Paper • 2410.08876 • Published Oct 11, 2024
Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation? Paper • 2411.03670 • Published Nov 6, 2024
Diving into Self-Evolving Training for Multimodal Reasoning Paper • 2412.17451 • Published Dec 23, 2024 • 42
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models Paper • 2501.03124 • Published 21 days ago • 14
Scaling Laws for Floating Point Quantization Training Paper • 2501.02423 • Published 23 days ago • 25
Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark Paper • 2501.05444 • Published 18 days ago
SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information Paper • 2409.14083 • Published Sep 21, 2024
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback Paper • 2501.12895 • Published 6 days ago • 48
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 13 days ago • 268
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models Paper • 2501.03124 • Published 21 days ago • 14
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation Paper • 2410.05363 • Published Oct 7, 2024 • 45
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation Paper • 2410.05363 • Published Oct 7, 2024 • 45