Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)
Hamish Ivison
hamishivi
AI & ML interests
NLP :)
Recent Activity
updated
a dataset
1 day ago
rl-rag/gpt-oss-20b-eval-react-serper
published
a dataset
1 day ago
rl-rag/gpt-oss-20b-eval-react-serper
updated
a model
4 days ago
hamishivi/1708_miromind_8b_dpo_rl_rag__1__1755577229_step_100
Organizations
Tulu 2 Llama 3 Update
Llama 3 models trained on the tulu dataset, following https://arxiv.org/abs/2311.10702 (tulu 2) and https://arxiv.org/abs/2406.09279 (tulu 2.5).
Tulu V2 Suite
The set of models associated with the Tulu V2 technical report.
LM Preference Datasets
TESS 2
Models associated with the paper "TESS-2: A Large-Scale, Generalist Diffusion Language Model". Code: https://github.com/hamishivi/tess-2
7b tulu 2.5
a small run at 7b scale with ppo, following the unpacking dpo and ppo paper.
Tulu V1 Suite
The set of models associated with the paper "How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources".
Large-Scale Data Selection for Instruction Tuning
Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)
TESS 2
Models associated with the paper "TESS-2: A Large-Scale, Generalist Diffusion Language Model". Code: https://github.com/hamishivi/tess-2
Tulu 2 Llama 3 Update
Llama 3 models trained on the tulu dataset, following https://arxiv.org/abs/2311.10702 (tulu 2) and https://arxiv.org/abs/2406.09279 (tulu 2.5).
7b tulu 2.5
a small run at 7b scale with ppo, following the unpacking dpo and ppo paper.
Tulu V2 Suite
The set of models associated with the Tulu V2 technical report.
Tulu V1 Suite
The set of models associated with the paper "How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources".
LM Preference Datasets