chansung park's picture

chansung park PRO

chansung

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes

reacted to their post with 👍 21 days ago

YAML engineering becomes more and more important than ever from infra provisioning to model training (recipes). Here, I built a simple editor first for @dstackai, and I will share the live endpoint this week. Let me know what you think about this approach. Based on this approach, if people think this is useful, I am going to do the same thing for the LLM training recipes for popular frameworks such as Hugging Face open-r1, Axolotl, and so on. Let me hear.

posted an update 21 days ago

YAML engineering becomes more and more important than ever from infra provisioning to model training (recipes). Here, I built a simple editor first for @dstackai, and I will share the live endpoint this week. Let me know what you think about this approach. Based on this approach, if people think this is useful, I am going to do the same thing for the LLM training recipes for popular frameworks such as Hugging Face open-r1, Axolotl, and so on. Let me hear.

View all activity

Organizations

chansung 's models 122

chansung/Qwen2.5-7B-CCRL-1

Text Generation • 8B • Updated Apr 28 • 4

chansung/Qwen2.5-1.5B-CCRL-1

Text Generation • 2B • Updated Apr 9 • 10

chansung/Qwen2.5-1.5B-CCRL-2

Text Generation • 2B • Updated Apr 2 • 7

chansung/Qwen2.5-1.5B-CRL-Code-GRPO-exp1

chansung/Qwen2.5-1.5B-Coder-CRL-GRPO-exp1

chansung/Qwen2.5-1.5B-Instruct-CRL-Open-R1-Code-GRPO-exp1

chansung/Qwen2.5-1.5B-CRL-Open-R1-Code-GRPO-exp1

Text Generation • 2B • Updated Mar 31 • 4

chansung/Qwen2.5-1.5B-CRL-Open-R1-Code-GRPO

2B • Updated Mar 30 • 1

chansung/Qwen2.5-1.5B-Open-R1-Code-GRPO

2B • Updated Mar 30 • 1

chansung/Qwen2.5-1.5B-Open-R1-GRPO

chansung/Qwen-2.5-7B-Simple-RL

Text Generation • 8B • Updated Mar 18 • 4

chansung/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

chansung/llama3-8b-lora-thinkcoder-sft

Text Generation • 8B • Updated Mar 5 • 4

chansung/gemma7b-gpt4o_1k_summarize-lora2

Updated Sep 25, 2024

chansung/gemma7b-gpt4o_1k_summarize-kasalora-auxloss

Updated Sep 24, 2024 • 3

chansung/gemma7b-gpt4o_1k_summarize-kasalora

Updated Sep 24, 2024 • 4

chansung/gemma7b-gpt4o_1k_summarize-lora

Updated Sep 24, 2024 • 3

chansung/flux-lora-test

Updated Aug 16, 2024 • 3 • 1

chansung/mental_health_counseling_merged_v0.1

Text Generation • Updated May 29, 2024 • 15 • 3

chansung/coding_llamaduo_result111

Updated May 27, 2024

chansung/mental_health_counseling_v0.1_merged

Text Generation • 9B • Updated May 18, 2024 • 6

chansung/mental_health_counseling_v0.1

Updated May 17, 2024 • 3

chansung/llamaduo_synth_ds_v0.1

Updated Apr 30, 2024 • 15 • 1

chansung/fsdp-qlora-test

Updated Apr 24, 2024

chansung/coding_llamaduo_60k_v0.2

Updated Apr 24, 2024 • 3

chansung/coding_llamaduo_60k

Updated Apr 24, 2024 • 5

chansung/coding_llamaduo_result3

Updated Apr 22, 2024 • 3

chansung/coding_llamaduo_result2

Updated Apr 21, 2024 • 3

chansung/coding_llamaduo_result1

Updated Apr 20, 2024 • 3

chansung/gemma-7b-sft-qlora-no-robots-99

Updated Apr 18, 2024