Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources Paper • 2504.00595 • Published 2 days ago • 23
AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation Paper • 2503.19693 • Published 9 days ago • 64
ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model Paper • 2503.21144 • Published 7 days ago • 23
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 8 days ago • 89
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders Paper • 2503.18878 • Published 10 days ago • 110
Modifying Large Language Model Post-Training for Diverse Creative Writing Paper • 2503.17126 • Published 13 days ago • 33
Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models Paper • 2503.16257 • Published 14 days ago • 23
BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing Paper • 2503.13434 • Published 17 days ago • 25
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models Paper • 2503.12885 • Published 17 days ago • 42
DPO Collection Various useful datasets with preference optimization • 16 items • Updated Jan 23 • 5
abliteration loras Collection Extracted adapters for removing censorship in models • 3 items • Updated Jan 21 • 2
DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation Paper • 2503.10618 • Published 21 days ago • 17
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Paper • 2503.09516 • Published 22 days ago • 27
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 22 days ago • 363
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think Paper • 2502.20172 • Published Feb 27 • 28
EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer Paper • 2503.07027 • Published 24 days ago • 27