18 50 77

Asaf Yehudai

Asaf-Yehudai

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Effective Red-Teaming of Policy-Adherent Agents

upvoted a paper 12 days ago

Discrete Audio Tokens: More Than a Survey!

upvoted a paper 13 days ago

Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games

View all activity

Organizations

upvoted a paper 9 days ago

Effective Red-Teaming of Policy-Adherent Agents

Paper • 2506.09600 • Published 14 days ago • 37

upvoted a paper 12 days ago

Discrete Audio Tokens: More Than a Survey!

Paper • 2506.10274 • Published 13 days ago • 30

upvoted a paper 13 days ago

Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games

Paper • 2506.05309 • Published 19 days ago • 14

liked a dataset 17 days ago

ibm-research/justrank_judge_scores

Viewer • Updated 17 days ago • 1.51M • 155 • 2

upvoted a paper 28 days ago

Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning

Paper • 2505.17813 • Published May 23 • 56

upvoted a paper about 1 month ago

J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning

Paper • 2505.10320 • Published May 15 • 22

liked a Space about 1 month ago

556

DreamO

🐨

A Unified Framework for Image Customization

upvoted a paper about 1 month ago

Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models

Paper • 2505.02847 • Published May 1 • 27

liked a dataset about 2 months ago

ibm-research/watsonxDocsQA

Viewer • Updated May 7 • 1.22k • 269 • 3

upvoted an article about 2 months ago

Article

Bamba-9B-v2 - Fast and powerful!

and 12 others •

Apr 29

• 32

liked a model about 2 months ago

deepseek-ai/DeepSeek-Prover-V2-671B

Text Generation • Updated Apr 30 • 3.75k • • 801

upvoted 2 papers 2 months ago

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24 • 112

RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation

Paper • 2504.17502 • Published Apr 24 • 56

liked a Space 2 months ago

8.59k

DeepSite v2

🐳

Generate any application with DeepSeek

commented 2 papers 3 months ago

Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving

Paper • 2504.02605 • Published Apr 3 • 48 •

LiveVQA: Live Visual Knowledge Seeking

Paper • 2504.05288 • Published Apr 7 • 15 •

upvoted 2 papers 3 months ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published Apr 3 • 55

Scaling Analysis of Interleaved Speech-Text Language Models

Paper • 2504.02398 • Published Apr 3 • 30

liked a model 3 months ago

openfree/flux-chatgpt-ghibli-lora

Text-to-Image • Updated 25 days ago • 1.73k • • 312

upvoted a paper 3 months ago

AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation

Paper • 2503.19693 • Published Mar 25 • 75