SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL Paper • 2504.11455 • Published 9 days ago • 12
Phi-4 Collection Phi-4 family of small language and multi-modal models. • 9 items • Updated 7 days ago • 117
BitNet Collection 🔥BitNet family of large language models (1-bit LLMs). • 6 items • Updated 6 days ago • 28
Sheet Music Transformer Collection Collection of finetuned versions of the Sheet Music Transformer model. • 5 items • Updated Sep 23, 2024 • 1
Sheet Music Transformer Datasets Collection Datasets for the Sheet Music Transformer • 4 items • Updated Sep 23, 2024 • 3
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 4 items • Updated about 8 hours ago • 91
GenPRM Collection A collection of GenPRM. Project page: https://ryanliu112.github.io/GenPRM • 6 items • Updated 18 days ago • 5
WMDP Benchmark Collection The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning • 9 items • Updated Apr 23, 2024 • 7
WPO Collection Models and datasets in paper "WPO: Enhancing RLHF with Weighted Preference Optimization". • 11 items • Updated Aug 22, 2024 • 7
SciCap Challenge Collection The Second Scientific Figure Captioning Challenge (SCICAP) in IJCAI 2024 • 2 items • Updated Jul 25, 2024 • 2
The SPRIGHT T2I collection Collection This collection contains the datasets, model, paper, and demo associated with the SPRIGHT (SPatially RIGHT) release. • 5 items • Updated Apr 2, 2024 • 6