2 42 34

Denis Akhiyarov

dtanow

AI & ML interests

AI Code Generation with LLMs

Recent Activity

liked a model about 4 hours ago

agentica-org/DeepSWE-Preview

upvoted a paper 22 days ago

Reinforcement Pre-Training

upvoted a paper 29 days ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

View all activity

Organizations

None yet

liked a model about 4 hours ago

agentica-org/DeepSWE-Preview

Text Generation • 33B • Updated 1 day ago • 403 • 70

upvoted a paper 22 days ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published 24 days ago • 238

upvoted a paper 29 days ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 257

liked a model about 1 month ago

mistralai/Devstral-Small-2505

Text Generation • 24B • Updated May 26 • 125k • • 810

upvoted 2 papers about 1 month ago

Rank1: Test-Time Compute for Reranking in Information Retrieval

Paper • 2502.18418 • Published Feb 25 • 27

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17 • 119

liked a model about 2 months ago

facebook/KernelLLM

Text Generation • 8B • Updated 1 day ago • 3.03k • • 162

upvoted a paper about 2 months ago

Generating Physically Stable and Buildable LEGO Designs from Text

Paper • 2505.05469 • Published May 8 • 27

liked a model about 2 months ago

ServiceNow-AI/Apriel-Nemotron-15b-Thinker

Text Generation • 15B • Updated May 15 • 6.48k • 88

upvoted 2 papers 2 months ago

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29 • 55

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 70

liked a model 2 months ago

Qwen/Qwen3-235B-A22B

Text Generation • 235B • Updated May 21 • 148k • • 965

upvoted 2 papers 2 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 116

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24 • 112

upvoted a paper 3 months ago

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published Apr 8 • 172

upvoted 3 papers 4 months ago

MPO: Boosting LLM Agents with Meta Plan Optimization

Paper • 2503.02682 • Published Mar 4 • 27

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 192

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 160

upvoted a paper 5 months ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 50

upvoted a paper 6 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 280

Denis Akhiyarov

AI & ML interests

Recent Activity

Organizations

dtanow's activity