danielkorat (Daniel Korat)

upvoted an article 2 months ago

Article

Getting More from Your Test-Time Compute Budget with Portfolio Beam Search

danelbaz

•

Feb 24

• 8

upvoted an article 5 months ago

Article

DeepMath: A lightweight math reasoning Agent with smolagents

+1

danf, mber, moshew

•

Dec 4, 2025

• 40

upvoted 2 articles about 1 year ago

Article

Introducing HELMET: Holistically Evaluating Long-context Language Models

+5

hyen, gaotianyu1350, houminmin, kding1, danf, moshew, cdq10131

•

Apr 16, 2025

• 42

Article

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

jmamou

•

Mar 24, 2025

• 20

upvoted a paper about 1 year ago

SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models

Paper • 2502.09390 • Published Feb 13, 2025 • 16

upvoted a paper over 1 year ago

FastDraft: How to Train Your Draft

Paper • 2411.11055 • Published Nov 17, 2024 • 11

upvoted 4 articles over 1 year ago

Article

Assisted Generation: a new direction toward low-latency text generation

joaogante

•

May 11, 2023

• 78

Article

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

+4

danielkorat, tomaarsen, orenpereg, moshew, echarlaix, aprabh2

•

Apr 3, 2024

• 11

Article

Faster Assisted Generation with Dynamic Speculation

+5

jmamou, orenpereg, joaogante, lewtun, danielkorat, Nadav-Timor, moshew

•

Oct 8, 2024

• 51

Article

SetFit: Efficient Few-Shot Learning Without Prompts

+4

Unso, lewtun, luketheduke, danielkorat, orenpereg, moshew

•

Sep 26, 2022

• 40

upvoted a paper almost 2 years ago

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Paper • 2408.02545 • Published Aug 5, 2024 • 40

upvoted 2 articles almost 2 years ago

Article

Our Transformers Code Agent beats the GAIA benchmark 🏅

m-ric, sergeipetrov

•

Jul 1, 2024

• 100

Article

Training and Finetuning Embedding Models with Sentence Transformers

tomaarsen

•

May 28, 2024

• 274

upvoted 2 papers almost 2 years ago

Accelerating Speculative Decoding using Dynamic Speculation Length

Paper • 2405.04304 • Published May 7, 2024 • 2

Distributed Speculative Inference of Large Language Models

Paper • 2405.14105 • Published May 23, 2024 • 18

upvoted 2 articles almost 2 years ago

Article

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

+6

juliensimon, Haihao, antonyvance, MatrixYao, lianglv, gserochi, Debbh, kding1

•

May 9, 2024

• 12

Article

Introducing the Open Leaderboard for Hebrew LLMs!

+2

Shaltiel, TalGeva, OmerKo, clefourrier

•

May 5, 2024

• 56

upvoted a paper about 2 years ago

Improving Classification Performance With Human Feedback: Label a few, we label the rest

Paper • 2401.09555 • Published Jan 17, 2024 • 6

upvoted a paper almost 3 years ago

H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

Paper • 2306.14048 • Published Jun 24, 2023 • 14

Daniel Korat

AI & ML interests

Organizations

Getting More from Your Test-Time Compute Budget with Portfolio Beam Search

DeepMath: A lightweight math reasoning Agent with smolagents

Introducing HELMET: Holistically Evaluating Long-context Language Models

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models

FastDraft: How to Train Your Draft

Assisted Generation: a new direction toward low-latency text generation

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Faster Assisted Generation with Dynamic Speculation

SetFit: Efficient Few-Shot Learning Without Prompts

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Our Transformers Code Agent beats the GAIA benchmark 🏅

Training and Finetuning Embedding Models with Sentence Transformers

Accelerating Speculative Decoding using Dynamic Speculation Length

Distributed Speculative Inference of Large Language Models

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

Introducing the Open Leaderboard for Hebrew LLMs!

Improving Classification Performance With Human Feedback: Label a few, we label the rest

H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

Daniel Korat

AI & ML interests

Organizations

danielkorat's activity

Getting More from Your Test-Time Compute Budget with Portfolio Beam Search

DeepMath: A lightweight math reasoning Agent with smolagents

Introducing HELMET: Holistically Evaluating Long-context Language Models

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

Assisted Generation: a new direction toward low-latency text generation

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Faster Assisted Generation with Dynamic Speculation

SetFit: Efficient Few-Shot Learning Without Prompts

Our Transformers Code Agent beats the GAIA benchmark 🏅

Training and Finetuning Embedding Models with Sentence Transformers

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

Introducing the Open Leaderboard for Hebrew LLMs!