Prakhar Dixit's picture

2 5 4

Prakhar Dixit

pdx97

·

AI & ML interests

None yet

Organizations

upvoted a paper 4 months ago

SBI-RAG: Enhancing Math Word Problem Solving for Students through Schema-Based Instruction and Retrieval-Augmented Generation

Paper • 2410.13293 • Published Oct 17, 2024 • 3

upvoted a paper 6 months ago

ReProHRL: Towards Multi-Goal Navigation in the Real World using Hierarchical Agents

Paper • 2308.08737 • Published Aug 17, 2023 • 1

upvoted 2 papers about 1 year ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20, 2024 • 39

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 60

upvoted an article about 1 year ago

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

By

and 4 others •

Jan 18, 2024

• 66