37 15 8

Hamish Ivison

hamishivi

penfever's profile picture

0xLaszlo's profile picture

shuyuej's profile picture

https://ivison.id.au

hamishivi
hamishivi

AI & ML interests

NLP :)

Recent Activity

updated a dataset 1 day ago

rl-rag/gpt-oss-20b-eval-react-serper

published a dataset 1 day ago

rl-rag/gpt-oss-20b-eval-react-serper

updated a model 4 days ago

hamishivi/1708_miromind_8b_dpo_rl_rag__1__1755577229_step_100

View all activity

Organizations

hamishivi 's collections 7

Large-Scale Data Selection for Instruction Tuning

Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)

Large-Scale Data Selection for Instruction Tuning

Paper • 2503.01807 • Published Mar 3 • 13
hamishivi/tulu-2-multitask-rrmax-326k-sft

7B • Updated Mar 4 • 8
hamishivi/rds-sels-multitask-rrmax-top326k

Viewer • Updated Mar 4 • 326k • 16
hamishivi/llama-3.1-tulu-3-multitask-rrmax-939k-sft

Updated Mar 4 • 5

Tulu 2 Llama 3 Update

Llama 3 models trained on the tulu dataset, following https://arxiv.org/abs/2311.10702 (tulu 2) and https://arxiv.org/abs/2406.09279 (tulu 2.5).

allenai/llama-3.1-tulu-2-dpo-70b

71B • Updated Aug 15, 2024 • 29
allenai/llama-3.1-tulu-2-70b

71B • Updated Aug 15, 2024 • 41
allenai/llama-3.1-tulu-2-70b-uf-mean-rm

70B • Updated Aug 15, 2024 • 29
allenai/llama-3.1-tulu-2-dpo-8b

8B • Updated Aug 15, 2024 • 31 • 2

Tulu V2 Suite

The set of models associated with the Tulu V2 technical report.

allenai/tulu-2-dpo-70b

Text Generation • 69B • Updated Jan 31, 2024 • 4.53k • 157
allenai/tulu-2-dpo-13b

Text Generation • 13B • Updated May 17, 2024 • 3.05k • • 20
allenai/tulu-2-dpo-7b

Text Generation • Updated May 14, 2024 • 3.1k • 20
allenai/tulu-2-70b

Text Generation • Updated Apr 19, 2024 • 48 • 8

LM Preference Datasets

lmsys/chatbot_arena_conversations

Viewer • Updated Sep 30, 2023 • 33k • 1.54k • 407
Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 16.3k • 1.41k
openai/summarize_from_feedback

Viewer • Updated Jan 3, 2023 • 194k • 1.29k • 213
openai/webgpt_comparisons

Viewer • Updated Dec 19, 2022 • 19.6k • 753 • 236

TESS 2

Models associated with the paper "TESS-2: A Large-Scale, Generalist Diffusion Language Model". Code: https://github.com/hamishivi/tess-2

TESS 2: A Large-Scale Generalist Diffusion Language Model

Paper • 2502.13917 • Published Feb 19 • 6
hamishivi/tess2-v0.3

7B • Updated Feb 20 • 7 • 3
hamishivi/tess2-v0.1

7B • Updated Feb 20 • 2
hamishivi/tess2-v0.3-base

7B • Updated Feb 20 • 2

7b tulu 2.5

a small run at 7b scale with ppo, following the unpacking dpo and ppo paper.

hamishivi/tulu-v2.5-7b-uf-mean-7b-uf-rm

Text Generation • 7B • Updated Jun 25, 2024 • 3
hamishivi/tulu-v2.5-7b-uf-mean-7b-uf-rm-value

Token Classification • 7B • Updated Jun 25, 2024 • 2
hamishivi/tulu-v2.5-7b-uf-rm

Text Classification • 7B • Updated Jun 25, 2024 • 1

Tulu V1 Suite

The set of models associated with the paper "How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources".

allenai/tulu-65b

Text Generation • Updated Jun 29, 2023 • 55 • 21
allenai/tulu-30b

Text Generation • Updated Jun 20, 2023 • 58 • 18
allenai/tulu-13b

Text Generation • Updated Jun 20, 2023 • 55 • 8
allenai/tulu-7b

Text Generation • Updated Jun 20, 2023 • 132 • 9

Large-Scale Data Selection for Instruction Tuning

Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)

Large-Scale Data Selection for Instruction Tuning

Paper • 2503.01807 • Published Mar 3 • 13
hamishivi/tulu-2-multitask-rrmax-326k-sft

7B • Updated Mar 4 • 8
hamishivi/rds-sels-multitask-rrmax-top326k

Viewer • Updated Mar 4 • 326k • 16
hamishivi/llama-3.1-tulu-3-multitask-rrmax-939k-sft

Updated Mar 4 • 5

TESS 2

Models associated with the paper "TESS-2: A Large-Scale, Generalist Diffusion Language Model". Code: https://github.com/hamishivi/tess-2

TESS 2: A Large-Scale Generalist Diffusion Language Model

Paper • 2502.13917 • Published Feb 19 • 6
hamishivi/tess2-v0.3

7B • Updated Feb 20 • 7 • 3
hamishivi/tess2-v0.1

7B • Updated Feb 20 • 2
hamishivi/tess2-v0.3-base

7B • Updated Feb 20 • 2

Tulu 2 Llama 3 Update

Llama 3 models trained on the tulu dataset, following https://arxiv.org/abs/2311.10702 (tulu 2) and https://arxiv.org/abs/2406.09279 (tulu 2.5).

allenai/llama-3.1-tulu-2-dpo-70b

71B • Updated Aug 15, 2024 • 29
allenai/llama-3.1-tulu-2-70b

71B • Updated Aug 15, 2024 • 41
allenai/llama-3.1-tulu-2-70b-uf-mean-rm

70B • Updated Aug 15, 2024 • 29
allenai/llama-3.1-tulu-2-dpo-8b

8B • Updated Aug 15, 2024 • 31 • 2

7b tulu 2.5

a small run at 7b scale with ppo, following the unpacking dpo and ppo paper.

hamishivi/tulu-v2.5-7b-uf-mean-7b-uf-rm

Text Generation • 7B • Updated Jun 25, 2024 • 3
hamishivi/tulu-v2.5-7b-uf-mean-7b-uf-rm-value

Token Classification • 7B • Updated Jun 25, 2024 • 2
hamishivi/tulu-v2.5-7b-uf-rm

Text Classification • 7B • Updated Jun 25, 2024 • 1

Tulu V2 Suite

The set of models associated with the Tulu V2 technical report.

allenai/tulu-2-dpo-70b

Text Generation • 69B • Updated Jan 31, 2024 • 4.53k • 157
allenai/tulu-2-dpo-13b

Text Generation • 13B • Updated May 17, 2024 • 3.05k • • 20
allenai/tulu-2-dpo-7b

Text Generation • Updated May 14, 2024 • 3.1k • 20
allenai/tulu-2-70b

Text Generation • Updated Apr 19, 2024 • 48 • 8

Tulu V1 Suite

The set of models associated with the paper "How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources".

allenai/tulu-65b

Text Generation • Updated Jun 29, 2023 • 55 • 21
allenai/tulu-30b

Text Generation • Updated Jun 20, 2023 • 58 • 18
allenai/tulu-13b

Text Generation • Updated Jun 20, 2023 • 55 • 8
allenai/tulu-7b

Text Generation • Updated Jun 20, 2023 • 132 • 9

LM Preference Datasets

lmsys/chatbot_arena_conversations

Viewer • Updated Sep 30, 2023 • 33k • 1.54k • 407
Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 16.3k • 1.41k
openai/summarize_from_feedback

Viewer • Updated Jan 3, 2023 • 194k • 1.29k • 213
openai/webgpt_comparisons

Viewer • Updated Dec 19, 2022 • 19.6k • 753 • 236