Jiang

Dongwei

·

Some-random

AI & ML interests

None yet

Organizations

Dongwei 's models 17

Dongwei/Qwen-2.5-7B_Base_Math_smalllr_newdata

Text Generation • 8B • Updated Feb 13, 2025 • 4

Dongwei/Qwen-2.5-7B_Base_Math_smalllr_longer

Text Generation • 8B • Updated Feb 11, 2025 • 6

Dongwei/Qwen-2.5-7B_Base_Math_smallestlr

Text Generation • 8B • Updated Feb 11, 2025 • 9

Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata

Text Generation • 8B • Updated Feb 5, 2025 • 7

Dongwei/Qwen-2.5-7B_Base_Math_smalllr

Text Generation • 8B • Updated Feb 5, 2025 • 25 • • 6

Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math_lowlr

Text Generation • 8B • Updated Feb 4, 2025 • 3

Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr

Text Generation • 2B • Updated Feb 4, 2025 • 3

Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr

Text Generation • 2B • Updated Feb 4, 2025 • 2

Dongwei/Qwen-2.5-7B_Math_smalllr

Text Generation • 8B • Updated Feb 4, 2025 • 5

Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math

Text Generation • 8B • Updated Feb 4, 2025 • 8

Dongwei/Qwen-2.5-7B_Math

Text Generation • 8B • Updated Feb 4, 2025 • 12

Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math

Text Generation • 2B • Updated Feb 3, 2025 • 5

Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math

Text Generation • 2B • Updated Feb 3, 2025 • 7

Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO

Text Generation • 8B • Updated Feb 3, 2025 • 10 • 1

Dongwei/Qwen-2.5-7B

Text Generation • 8B • Updated Feb 3, 2025 • 4

Dongwei/Qwen2.5-1.5B-Open-R1-GRPO

Text Generation • 2B • Updated Feb 2, 2025 • 5 • 1

Dongwei/Rationalyst_reasoning_datasets

Text Generation • 8B • Updated Oct 13, 2024 • 6 • 4