Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2391.9
TFLOPS
57
21
127
chansung park
PRO
chansung
Follow
julien-c's profile picture
rahulanand1103's profile picture
severo's profile picture
4235 followers
·
34 following
algo_diver
deep-diver
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
12 days ago
EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes
reacted
to
their
post
with 👍
21 days ago
YAML engineering becomes more and more important than ever from infra provisioning to model training (recipes). Here, I built a simple editor first for @dstackai, and I will share the live endpoint this week. Let me know what you think about this approach. Based on this approach, if people think this is useful, I am going to do the same thing for the LLM training recipes for popular frameworks such as Hugging Face open-r1, Axolotl, and so on. Let me hear.
posted
an
update
21 days ago
YAML engineering becomes more and more important than ever from infra provisioning to model training (recipes). Here, I built a simple editor first for @dstackai, and I will share the live endpoint this week. Let me know what you think about this approach. Based on this approach, if people think this is useful, I am going to do the same thing for the LLM training recipes for popular frameworks such as Hugging Face open-r1, Axolotl, and so on. Let me hear.
View all activity
Organizations
chansung
's models
122
Sort: Recently updated
chansung/Qwen2.5-7B-CCRL-1
Text Generation
•
8B
•
Updated
Apr 28
•
4
chansung/Qwen2.5-1.5B-CCRL-1
Text Generation
•
2B
•
Updated
Apr 9
•
10
chansung/Qwen2.5-1.5B-CCRL-2
Text Generation
•
2B
•
Updated
Apr 2
•
7
chansung/Qwen2.5-1.5B-CRL-Code-GRPO-exp1
Updated
Mar 31
chansung/Qwen2.5-1.5B-Coder-CRL-GRPO-exp1
Updated
Mar 31
chansung/Qwen2.5-1.5B-Instruct-CRL-Open-R1-Code-GRPO-exp1
Updated
Mar 31
chansung/Qwen2.5-1.5B-CRL-Open-R1-Code-GRPO-exp1
Text Generation
•
2B
•
Updated
Mar 31
•
4
chansung/Qwen2.5-1.5B-CRL-Open-R1-Code-GRPO
2B
•
Updated
Mar 30
•
1
chansung/Qwen2.5-1.5B-Open-R1-Code-GRPO
2B
•
Updated
Mar 30
•
1
chansung/Qwen2.5-1.5B-Open-R1-GRPO
Updated
Mar 25
chansung/Qwen-2.5-7B-Simple-RL
Text Generation
•
8B
•
Updated
Mar 18
•
4
chansung/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Mar 18
chansung/llama3-8b-lora-thinkcoder-sft
Text Generation
•
8B
•
Updated
Mar 5
•
4
chansung/gemma7b-gpt4o_1k_summarize-lora2
Updated
Sep 25, 2024
chansung/gemma7b-gpt4o_1k_summarize-kasalora-auxloss
Updated
Sep 24, 2024
•
3
chansung/gemma7b-gpt4o_1k_summarize-kasalora
Updated
Sep 24, 2024
•
4
chansung/gemma7b-gpt4o_1k_summarize-lora
Updated
Sep 24, 2024
•
3
chansung/flux-lora-test
Updated
Aug 16, 2024
•
3
•
1
chansung/mental_health_counseling_merged_v0.1
Text Generation
•
Updated
May 29, 2024
•
15
•
3
chansung/coding_llamaduo_result111
Updated
May 27, 2024
chansung/mental_health_counseling_v0.1_merged
Text Generation
•
9B
•
Updated
May 18, 2024
•
6
chansung/mental_health_counseling_v0.1
Updated
May 17, 2024
•
3
chansung/llamaduo_synth_ds_v0.1
Updated
Apr 30, 2024
•
15
•
1
chansung/fsdp-qlora-test
Updated
Apr 24, 2024
chansung/coding_llamaduo_60k_v0.2
Updated
Apr 24, 2024
•
3
chansung/coding_llamaduo_60k
Updated
Apr 24, 2024
•
5
chansung/coding_llamaduo_result3
Updated
Apr 22, 2024
•
3
chansung/coding_llamaduo_result2
Updated
Apr 21, 2024
•
3
chansung/coding_llamaduo_result1
Updated
Apr 20, 2024
•
3
chansung/gemma-7b-sft-qlora-no-robots-99
Updated
Apr 18, 2024
Previous
1
2
3
4
5
Next