2 42 34

Denis Akhiyarov

dtanow

AI & ML interests

AI Code Generation with LLMs

Recent Activity

liked a model about 4 hours ago

agentica-org/DeepSWE-Preview

upvoted a paper 22 days ago

Reinforcement Pre-Training

upvoted a paper 29 days ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

View all activity

Organizations

None yet

upvoted a paper 22 days ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published 24 days ago • 238

upvoted a paper 29 days ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 257

upvoted 2 papers about 1 month ago

Rank1: Test-Time Compute for Reranking in Information Retrieval

Paper • 2502.18418 • Published Feb 25 • 27

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17 • 119

upvoted a paper about 2 months ago

Generating Physically Stable and Buildable LEGO Designs from Text

Paper • 2505.05469 • Published May 8 • 27

upvoted 4 papers 2 months ago

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24 • 112

upvoted a paper 3 months ago

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published Apr 8 • 172

upvoted 3 papers 4 months ago

MPO: Boosting LLM Agents with Meta Plan Optimization

Paper • 2503.02682 • Published Mar 4 • 27

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 192

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 160

upvoted a paper 5 months ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 50

upvoted 2 papers 6 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 280

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 39

upvoted a paper 7 months ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 64

upvoted a paper 8 months ago

GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models

Paper • 2411.05830 • Published Nov 5, 2024 • 22

upvoted an article 10 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

•

May 28, 2024

• 230

upvoted a paper 11 months ago

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 126

Denis Akhiyarov

AI & ML interests

Recent Activity

Organizations

dtanow's activity

Training and Finetuning Embedding Models with Sentence Transformers v3