Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
tjw 's Collections
Papers

Papers

updated Jun 20
Upvote
-

  • Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

    Paper • 2505.02567 • Published May 5 • 79

  • TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations

    Paper • 2505.18125 • Published May 23 • 113

  • Distilling LLM Agent into Small Models with Retrieval and Code Tools

    Paper • 2505.17612 • Published May 23 • 81

  • One RL to See Them All: Visual Triple Unified Reinforcement Learning

    Paper • 2505.18129 • Published May 23 • 60

  • MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

    Paper • 2506.13585 • Published Jun 16 • 261

  • Scaling Test-time Compute for LLM Agents

    Paper • 2506.12928 • Published Jun 15 • 61

  • Reinforcement Pre-Training

    Paper • 2506.08007 • Published Jun 9 • 255

  • Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

    Paper • 2506.07044 • Published Jun 8 • 112

  • ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

    Paper • 2506.09513 • Published Jun 11 • 98

  • Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

    Paper • 2506.06395 • Published Jun 5 • 129

  • Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

    Paper • 2505.24726 • Published May 30 • 269
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs