Optimizing Large Language Model Training Using FP4 Quantization Paper • 2501.17116 • Published 9 days ago • 32
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published 9 days ago • 100
Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts Paper • 2501.14334 • Published 14 days ago • 17
Large Language Models Think Too Fast To Explore Effectively Paper • 2501.18009 • Published 8 days ago • 22
MatAnyone: Stable Video Matting with Consistent Memory Propagation Paper • 2501.14677 • Published 13 days ago • 26
SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model Paper • 2501.18636 • Published 9 days ago • 25
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs Paper • 2412.21187 • Published Dec 30, 2024 • 37
On the Compositional Generalization of Multimodal LLMs for Medical Imaging Paper • 2412.20070 • Published Dec 28, 2024 • 45
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization Paper • 2412.18525 • Published Dec 24, 2024 • 72
GameFactory: Creating New Games with Generative Interactive Videos Paper • 2501.08325 • Published 23 days ago • 61
Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament Paper • 2501.13007 • Published 15 days ago • 19
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 15 days ago • 301
Control LLM: Controlled Evolution for Intelligence Retention in LLM Paper • 2501.10979 • Published 19 days ago • 6
Hallucinations Can Improve Large Language Models in Drug Discovery Paper • 2501.13824 • Published 14 days ago • 9
AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation Paper • 2403.14614 • Published Mar 21, 2024 • 3
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published 13 days ago • 29