PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters Paper • 2504.08791 • Published 17 days ago • 123
APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay Paper • 2504.03601 • Published 20 days ago • 16
Articulated Kinematics Distillation from Video Diffusion Models Paper • 2504.01204 • Published 23 days ago • 24
TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization Paper • 2503.19901 • Published 30 days ago • 39
PaperBench: Evaluating AI's Ability to Replicate AI Research Paper • 2504.01848 • Published 22 days ago • 36
Inference-Time Scaling for Generalist Reward Modeling Paper • 2504.02495 • Published 21 days ago • 53
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published 24 days ago • 256
Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Paper • 2406.12644 • Published Jun 18, 2024 • 5
Teach Better or Show Smarter? On Instructions and Exemplars in Automatic Prompt Optimization Paper • 2406.15708 • Published Jun 22, 2024 • 1
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper • 2502.14499 • Published Feb 20 • 192
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published Nov 7, 2024 • 124
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning Paper • 2503.16252 • Published Mar 20 • 27
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14 • 97