Hamish Ivison's picture

Hamish Ivison

hamishivi

·

https://ivison.id.au

AI & ML interests

NLP :)

Recent Activity

updated a model 2 days ago

hamishivi/swerl_qwen36_27b_base_tmax_10k_grpo_tp2_sp4_step100

published a model 2 days ago

hamishivi/swerl_qwen36_27b_base_tmax_10k_grpo_tp2_sp4_step100

updated a model 5 days ago

hamishivi/swerl_qwen35_9b_base_tmax_10k_grpo_mask_no_submit_10pct_step160

View all activity

Organizations

Collections 8

View 8 collections

Papers 14

arxiv:2512.13961

arxiv:2511.19399

arxiv:2511.07317

arxiv:2503.01807

models 273

hamishivi/swerl_qwen36_27b_base_tmax_10k_grpo_tp2_sp4_step100

2.65M • Updated 2 days ago • 234

hamishivi/swerl_qwen35_9b_base_tmax_10k_grpo_mask_no_submit_10pct_step160

9B • Updated 5 days ago • 151

hamishivi/swerl_qwen35_9b_base_tmax_10k_grpo_mask_no_submit421777143486_step_200

9B • Updated 7 days ago • 156

hamishivi/swerl_qwen35_9b_base_tmax_10k_grpo_mask_overlong421777163763_step_200

9B • Updated 7 days ago • 157

hamishivi/vip_grpo_base_p32_2403_qwen3_4b_math11774385112_step500

196k • Updated 9 days ago • 14

hamishivi/vip_grpo_base_p32_2403_qwen3_4b_math11774385112_step1000

196k • Updated 9 days ago • 298

hamishivi/qwen3.5_sft

9B • Updated 10 days ago • 166

hamishivi/qwen3.5_sft_w_incompletes

9B • Updated 10 days ago • 297

hamishivi/qwen3.5_tmax_breakdown_test_step100

9B • Updated 10 days ago • 267

hamishivi/swerl_qwen35_9b_base_tmax_10k_grpo421776749915_step400

9B • Updated 12 days ago • 156

View 273 models

datasets 219

hamishivi/dapo-grpo-1000-steps-math-pairs-sae

Viewer • Updated 8 days ago • 200 • 28

hamishivi/dapo-math-pairs-value-sae

Viewer • Updated 11 days ago • 200 • 35

hamishivi/dapo-math-pairs-value

Viewer • Updated 11 days ago • 200 • 40

hamishivi/agent-task-termigen

Viewer • Updated 11 days ago • 3.56k • 39

hamishivi/swerl-tmax-10k-verified

Viewer • Updated 11 days ago • 6.17k • 102

hamishivi/swerl-tmax-10k

Viewer • Updated 11 days ago • 9.46k • 308

hamishivi/agent-task-terminal-traj

Viewer • Updated 11 days ago • 5.65k • 53

hamishivi/agent-task-r2e-gym

Viewer • Updated 12 days ago • 8.1k • 32

hamishivi/agent-task-endless-terminals

Viewer • Updated 12 days ago • 2.49k • 44

hamishivi/agent-task-swe-gym

Viewer • Updated 12 days ago • 407 • 93

View 219 datasets