Haris Jabbar's picture

Haris Jabbar

maveriq

·

AI & ML interests

Tokenization, language generation, normalizing flows, language modeling, document ai

Organizations

upvoted a paper 3 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

upvoted an article 4 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By

•

Feb 7

• 199

upvoted an article 6 months ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

By

and 5 others •

Feb 4

• 99

upvoted a paper 6 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 154

upvoted an article 6 months ago

Article

Open-source DeepResearch – Freeing our search agents

By

and 4 others •

Feb 4

• 1.28k

upvoted a collection 8 months ago

Scaling Test-Time Compute with Open Models

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6 • 27

upvoted a paper 8 months ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 64

upvoted a collection 9 months ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated Apr 30 • 88

upvoted 2 articles 11 months ago

Article

Selective fine-tuning of Language Models with Spectrum

By

•

Sep 3, 2024

• 36

Article

Uncensor any LLM with abliteration

By

•

Jun 13, 2024

• 642

upvoted a collection 11 months ago

INT8 LLMs for vLLM

Accurate INT8 quantized models by Neural Magic, ready for use with vLLM! • 50 items • Updated Sep 26, 2024 • 17

upvoted 2 papers about 1 year ago

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12, 2024 • 138

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

Paper • 2305.11738 • Published May 19, 2023 • 8

upvoted a collection over 1 year ago

💫 StarCoder2

StarCoder2 models and datasets! • 8 items • Updated Mar 1, 2024 • 86

upvoted 2 papers over 1 year ago

Self-Discover: Large Language Models Self-Compose Reasoning Structures

Paper • 2402.03620 • Published Feb 6, 2024 • 118

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 258

upvoted 2 collections over 1 year ago

Handbook v0.1 models and datasets

Models and datasets for v0.1 of the alignment handbook • 6 items • Updated Nov 10, 2023 • 24

⭐ StarCoder

All models, datasets, and demos related to StarCoder! • 11 items • Updated Feb 27, 2024 • 26