30 17 203

Huseyin ABANOZ

habanoz

AI & ML interests

LLM, RL

Recent Activity

liked a model 18 days ago

allenai/Llama-3.1-Tulu-3-8B

liked a dataset 18 days ago

allenai/RLVR-GSM-MATH-IF-Mixed-Constraints

liked a dataset 18 days ago

allenai/llama-3.1-tulu-3-405b-preference-mixture

View all activity

Organizations

None yet

habanoz's activity

upvoted a collection 18 days ago

Tulu 3 Datasets

Collection

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 19 days ago • 80

upvoted a paper 7 months ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3, 2024 • 55

upvoted an article 7 months ago

Article

SmolLM - blazingly fast and remarkably powerful

and 2 others •

Jul 16, 2024

• 367

upvoted a paper 12 months ago

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 131

upvoted an article about 1 year ago

Article

Welcome Llama 3 - Meta's new open LLM

and 4 others •

Apr 18, 2024

• 289

upvoted a paper about 1 year ago

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Paper • 2404.03820 • Published Apr 4, 2024 • 27

upvoted 2 articles about 1 year ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

and 2 others •

Apr 15, 2024

• 179

Article

CodeGemma - an official Google release for code LLMs

and 5 others •

Apr 9, 2024

• 101

upvoted 3 collections about 1 year ago

upvoted a paper about 1 year ago

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 45

upvoted 3 papers over 1 year ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 58

Effective Long-Context Scaling of Foundation Models

Paper • 2309.16039 • Published Sep 27, 2023 • 30

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 243

upvoted 2 papers almost 2 years ago

Stay on topic with Classifier-Free Guidance

Paper • 2306.17806 • Published Jun 30, 2023 • 27

WizardLM: Empowering Large Language Models to Follow Complex Instructions

Paper • 2304.12244 • Published Apr 24, 2023 • 14