Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Deping Zhang's picture
4 1

Deping Zhang

Deping
leegao19's profile picture BK-Lee's profile picture Mohammadmostafa's profile picture
·

AI & ML interests

Deep Reinforcement Learning, Computer Vision, Large Language Models ( especially their "emergence" capabilities), Theoretical Condensed Matter Physics ( superconductivity, ferromagnetism)

Organizations

None yet

Collections 10

LLM_VLM_R1
  • Med-RLVR: Emerging Medical Reasoning from a 3B base model via reinforcement Learning

    Paper • 2502.19655 • Published Feb 27
  • MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

    Paper • 2502.19634 • Published Feb 26 • 63
  • R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning

    Paper • 2502.19735 • Published Feb 27 • 9
  • AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO

    Paper • 2502.14669 • Published Feb 20 • 14
LLM_Infra
  • Running
    2.56k
    2.56k

    The Ultra-Scale Playbook

    🌌

    The ultimate guide to training LLM on large GPU Clusters

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs