-
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Paper • 2408.08152 • Published • 60 -
ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition
Paper • 2402.15220 • Published • 22 -
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
Paper • 2402.19427 • Published • 57 -
Simple linear attention language models balance the recall-throughput tradeoff
Paper • 2402.18668 • Published • 21
Kiran Kamble
kiranr
AI & ML interests
nlp,llm
Recent Activity
authored
a paper
1 day ago
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
upvoted
a
paper
1 day ago
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
liked
a dataset
5 days ago
open-r1/Mixture-of-Thoughts
Organizations
Collections
1
models
1
datasets
0
None public yet