Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jiang's picture
5 7 1

Jiang

Dongwei
dark-pen's profile picture rxlqn2's profile picture AZH04's profile picture
·
  • Some-random

AI & ML interests

None yet

Organizations

ESPnet's profile picture

Papers 3

arxiv:2410.01044
arxiv:2409.12183
arxiv:2407.09007

models 17

Dongwei/Qwen-2.5-7B_Base_Math_smalllr_newdata

Text Generation • Updated Feb 13 • 4

Dongwei/Qwen-2.5-7B_Base_Math_smalllr_longer

Text Generation • Updated Feb 11 • 8

Dongwei/Qwen-2.5-7B_Base_Math_smallestlr

Text Generation • Updated Feb 11 • 8

Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata

Text Generation • Updated Feb 5 • 7

Dongwei/Qwen-2.5-7B_Base_Math_smalllr

Text Generation • Updated Feb 5 • 12 • 6

Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math_lowlr

Text Generation • Updated Feb 4 • 7

Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr

Text Generation • Updated Feb 4 • 10

Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr

Text Generation • Updated Feb 4 • 25

Dongwei/Qwen-2.5-7B_Math_smalllr

Text Generation • Updated Feb 4 • 9

Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math

Text Generation • Updated Feb 4 • 14

datasets 2

Dongwei/Math_8K_for_GRPO

Viewer • Updated Feb 5 • 8.89k • 49 • 2

Dongwei/reasoning_world_model

Viewer • Updated Apr 22, 2024 • 15.2k • 15 • 5
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs