Packing Input Frame Context in Next-Frame Prediction Models for Video Generation Paper โข 2504.12626 โข Published 23 days ago โข 48
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper โข 2501.17161 โข Published Jan 28 โข 121
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper โข 2408.08872 โข Published Aug 16, 2024 โข 101
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases Paper โข 2407.12784 โข Published Jul 17, 2024 โข 52