Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model Paper • 2501.02790 • Published 21 days ago • 9
Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers Paper • 2501.02393 • Published 23 days ago • 8
Generalizable Origin Identification for Text-Guided Image-to-Image Diffusion Models Paper • 2501.02376 • Published 23 days ago • 3
MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control Paper • 2501.02260 • Published 23 days ago • 5
Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control Paper • 2501.03847 • Published 20 days ago • 23
Demystifying Domain-adaptive Post-training for Financial LLMs Paper • 2501.04961 • Published 18 days ago • 11
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization Paper • 2412.18525 • Published Dec 24, 2024 • 70
MMFactory: A Universal Solution Search Engine for Vision-Language Tasks Paper • 2412.18072 • Published Dec 24, 2024 • 17
Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation Paper • 2412.18176 • Published Dec 24, 2024 • 15