MIRA Collection Group-specific quality scorers from MIRA for mid-training data selection. • 12 items • Updated 2 days ago • 1
TapSampling Collection [ICML 2026] TapSampling: Inference-Time Sampling with a Task-Progress-Understanding Verifier for Robotic Manipulation • 3 items • Updated 20 days ago • 1
MiniCPM5 Collection A SOTA 1B on-device LLM, small yet powerful. • 11 items • Updated 4 days ago • 21
BitCPM-CANN Collection Full-pipeline ternary quantized model trained on CANN. • 12 items • Updated 5 days ago • 25
MolmoAct2-Cortex Eval rollouts Collection For all the evaluation rollouts refer to this: https://huggingface.co/collections/allenai/molmoact2-eval-rollouts • 6 items • Updated 10 days ago • 1
HoloMotion Collection HoloMotion: A Foundation Model for Whole-Body Humanoid Control. https://github.com/HorizonRobotics/HoloMotion • 3 items • Updated 9 days ago • 2
Rethinking OPD Collection This collection includes the models used in the paper "Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recip • 4 items • Updated 18 days ago • 2
PaTaRM: Bridging Pairwise and Pointwise Signals via Preference-Aware Task-Adaptive Reward Modeling Paper • 2510.24235 • Published Oct 28, 2025 • 1
PaTaRM Collection PaTaRM is a Generative Reward Model (GRM) for RLHF alignment. • 4 items • Updated Apr 2 • 2
MolmoAct2-BimanualYAM Dataset Collection Collection of the MolmoAct2-BimanualYAM Dataset • 740 items • Updated 24 days ago • 14
GLiNER-relex Collection Zero-shot joint NER and relation extraction models • 4 items • Updated Mar 19 • 4
GLiClass-Multilang Collection Multi-lingual zero-shot text classification models • 3 items • Updated 29 days ago • 7
talkie-13b Collection talkie-1930-13b is a vintage language model trained on pre-1931 English-language text. See https://github.com/talkie-lm/talkie to run talkie. • 3 items • Updated Apr 21 • 53