1 126 40

js

rldy

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Elevating 3D Models: High-Quality Texture and Geometry Refinement from a Low-Quality Model

upvoted a paper 11 days ago

A Survey of Context Engineering for Large Language Models

upvoted a paper 14 days ago

One Token to Fool LLM-as-a-Judge

View all activity

Organizations

upvoted a paper 5 days ago

Elevating 3D Models: High-Quality Texture and Geometry Refinement from a Low-Quality Model

Paper • 2507.11465 • Published 14 days ago • 13

upvoted a paper 11 days ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published 12 days ago • 215

upvoted a paper 14 days ago

One Token to Fool LLM-as-a-Judge

Paper • 2507.08794 • Published 18 days ago • 31

upvoted a paper 15 days ago

Test-Time Scaling with Reflective Generative Model

Paper • 2507.01951 • Published 27 days ago • 98

upvoted an article 19 days ago

Article

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models

and 17 others •

19 days ago

• 46

upvoted a paper 19 days ago

Towards Solving More Challenging IMO Problems via Decoupled Reasoning and Proving

Paper • 2507.06804 • Published 22 days ago • 15

upvoted 2 papers 20 days ago

SingLoRA: Low Rank Adaptation Using a Single Matrix

Paper • 2507.05566 • Published 21 days ago • 107

Coding Triangle: How Does Large Language Model Understand Code?

Paper • 2507.06138 • Published 21 days ago • 20

upvoted a paper 28 days ago

Calligrapher: Freestyle Text Image Customization

Paper • 2506.24123 • Published 29 days ago • 33

upvoted a paper about 1 month ago

AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models

Paper • 2506.19851 • Published Jun 24 • 58

upvoted 4 papers about 2 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 253

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 176

FlowMo: Variance-Based Flow Guidance for Coherent Motion in Video Generation

Paper • 2506.01144 • Published Jun 1 • 14

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 267

upvoted 2 papers 2 months ago

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

Paper • 2505.23754 • Published May 29 • 16

Model Already Knows the Best Noise: Bayesian Active Noise Selection via Attention in Video Diffusion Model

Paper • 2505.17561 • Published May 23 • 31

upvoted 4 papers 3 months ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2 • 86

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 132

Tina: Tiny Reasoning Models via LoRA

Paper • 2504.15777 • Published Apr 22 • 55

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

js

AI & ML interests

Recent Activity

Organizations

rldy's activity

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models