Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)
Hamish Ivison
hamishivi
AI & ML interests
NLP :)
Recent Activity
updated
a model
6 days ago
hamishivi/2006rl_long_qwen2_5_integration_mix_5838__1__1750473014_checkpoints_step_2000
published
a model
6 days ago
hamishivi/2006rl_long_qwen2_5_integration_mix_5838__1__1750473014_checkpoints_step_2000
updated
a dataset
7 days ago
hamishivi/hamishivi_rlvr_orz_math_57k_collected_all_tight_filtered_hamishivi_qwen2_5_openthoughts2
Organizations
Tulu 2 Llama 3 Update
Llama 3 models trained on the tulu dataset, following https://arxiv.org/abs/2311.10702 (tulu 2) and https://arxiv.org/abs/2406.09279 (tulu 2.5).
Tulu V2 Suite
The set of models associated with the Tulu V2 technical report.
LM Preference Datasets
TESS 2
Models associated with the paper "TESS-2: A Large-Scale, Generalist Diffusion Language Model". Code: https://github.com/hamishivi/tess-2
7b tulu 2.5
a small run at 7b scale with ppo, following the unpacking dpo and ppo paper.
Tulu V1 Suite
The set of models associated with the paper "How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources".
Large-Scale Data Selection for Instruction Tuning
Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)
TESS 2
Models associated with the paper "TESS-2: A Large-Scale, Generalist Diffusion Language Model". Code: https://github.com/hamishivi/tess-2
Tulu 2 Llama 3 Update
Llama 3 models trained on the tulu dataset, following https://arxiv.org/abs/2311.10702 (tulu 2) and https://arxiv.org/abs/2406.09279 (tulu 2.5).
7b tulu 2.5
a small run at 7b scale with ppo, following the unpacking dpo and ppo paper.
Tulu V2 Suite
The set of models associated with the Tulu V2 technical report.
Tulu V1 Suite
The set of models associated with the paper "How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources".
LM Preference Datasets