zhang's picture

6 1

zhang

kekueknu2

·

AI & ML interests

None yet

Organizations

upvoted an article 6 months ago

Article

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

By

•

Feb 4

• 16

upvoted an article 11 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

By

and 3 others •

Dec 9, 2022

• 320

upvoted a collection over 1 year ago

LLM papers

It is a collection of papers that are useful in studying LLM. • 14 items • Updated Apr 3, 2024 • 14

upvoted a paper over 1 year ago

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 110

upvoted 2 collections over 1 year ago

Foundation AI Papers

Curated List of Must-Reads on LLM reasoning at Temus AI team • 135 items • Updated Jun 15, 2024 • 34

Reading Papers

231 items • Updated 22 days ago • 11