Hamish Ivison's picture

Hamish Ivison

hamishivi

·

https://ivison.id.au

AI & ML interests

NLP :)

Recent Activity

updated a dataset 1 day ago

rl-rag/gpt-oss-20b-eval-react-serper

published a dataset 1 day ago

rl-rag/gpt-oss-20b-eval-react-serper

updated a model 4 days ago

hamishivi/1708_miromind_8b_dpo_rl_rag__1__1755577229_step_100

View all activity

Organizations

authored 3 papers 6 months ago

Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning

Paper • 2408.10075 • Published Aug 19, 2024

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 21

Large-Scale Data Selection for Instruction Tuning

Paper • 2503.01807 • Published Mar 3 • 13

authored a paper 9 months ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 65

authored a paper about 1 year ago

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

Paper • 2406.09279 • Published Jun 13, 2024 • 3

authored a paper over 1 year ago

OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 84

authored a paper almost 2 years ago

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Paper • 2311.10702 • Published Nov 17, 2023 • 20

authored a paper about 2 years ago

How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources

Paper • 2306.04751 • Published Jun 7, 2023 • 5

authored 2 papers over 2 years ago

HINT: Hypernetwork Instruction Tuning for Efficient Zero-Shot Generalisation

Paper • 2212.10315 • Published Dec 20, 2022 • 1

TESS: Text-to-Text Self-Conditioned Simplex Diffusion

Paper • 2305.08379 • Published May 15, 2023 • 3