2 19 1

Chenlu Ye

Chenlu123

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Learning, Fast and Slow: Towards LLMs That Adapt Continually

published a model 8 days ago

Chenlu123/grpo_qwen3_4b_guru_n8_delta10_lam1e-11_bz128_mini_bz32_fsdp2_kl0.001

upvoted a paper 17 days ago

Recursive Multi-Agent Systems

View all activity

Organizations

upvoted a paper 1 day ago

Learning, Fast and Slow: Towards LLMs That Adapt Continually

Paper • 2605.12484 • Published 4 days ago • 15

published a model 8 days ago

Chenlu123/grpo_qwen3_4b_guru_n8_delta10_lam1e-11_bz128_mini_bz32_fsdp2_kl0.001

Updated 8 days ago

upvoted a paper 17 days ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published 18 days ago • 266

updated a model 19 days ago

Chenlu123/shampoo_npg_tr_scale_delta40-lam1e-11_bsz512-32_qwen2_5_math_1_5b

Updated 19 days ago

published a model 19 days ago

Chenlu123/shampoo_npg_tr_scale_delta40-lam1e-11_bsz512-32_qwen2_5_math_1_5b

Updated 19 days ago

updated a model 21 days ago

Chenlu123/grpo_qwen_qwen2_5_math_1_5b_guru_n8_bz512_mini_bz32_fsdp2_kl0.001

Updated 21 days ago

published a model 21 days ago

Chenlu123/grpo_qwen_qwen2_5_math_1_5b_guru_n8_bz512_mini_bz32_fsdp2_kl0.001

Updated 21 days ago

updated a model 23 days ago

Chenlu123/shampoo_npg_tr_scale_delta20_lam1e-12_warmup_1_graftTrue_qwen2_5_math_1_5b

Updated 22 days ago

published a model 23 days ago

Chenlu123/shampoo_npg_tr_scale_delta20_lam1e-12_warmup_1_graftTrue_qwen2_5_math_1_5b

Updated 22 days ago

upvoted a paper 24 days ago

AgentSPEX: An Agent SPecification and EXecution Language

Paper • 2604.13346 • Published Apr 14 • 164

published a model 25 days ago

Chenlu123/grpo_qwen_qwen2_5_math_1_5b_guru_cliph0.2_n16_bz64_mini_bz64_fsdp2

Updated 25 days ago

updated a model 25 days ago

Chenlu123/shampoo_npg_tr_scale_delta0.1-lam1e-12_qwen2_5_math_1_5b

Updated 25 days ago

published a model 25 days ago

Chenlu123/shampoo_npg_tr_scale_delta0.1-lam1e-12_qwen2_5_math_1_5b

Updated 25 days ago

updated a model about 1 month ago

Chenlu123/grpo_warmup_graftTrue_qwen2_5_math_1_5b_guru_n16_bz64_mini_bz64_global_step_80

Updated Apr 8

published a model about 1 month ago

Chenlu123/grpo_warmup_graftTrue_qwen2_5_math_1_5b_guru_n16_bz64_mini_bz64_global_step_80

Updated Apr 8

upvoted a paper about 2 months ago

Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL

Paper • 2603.19470 • Published Mar 19 • 3

submitted a paper to Daily Papers about 2 months ago

Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL

Paper • 2603.19470 • Published Mar 19 • 3

updated a model about 2 months ago

Chenlu123/teacher_Qwen3-4B_dapo-math-17k_n8_prompt_bsz_128_mini_bsz_32_step460

2B • Updated Mar 20 • 4

published a model about 2 months ago

Chenlu123/teacher_Qwen3-4B_dapo-math-17k_n8_prompt_bsz_128_mini_bsz_32_step460

2B • Updated Mar 20 • 4

updated a model about 2 months ago

Chenlu123/teacher_Qwen3-4B_dapo-math-17k_n8_prompt_bsz_128_mini_bsz_32_step440

2B • Updated Mar 20 • 1

Chenlu Ye

AI & ML interests

Recent Activity

Organizations

Chenlu123's activity