Pratik Bhavsar's picture

Pratik Bhavsar PRO

pratikbhavsar

·

https://pakodas.substack.com

AI & ML interests

LLM agents, evaluation & reasoning

Recent Activity

liked a model 1 day ago

Qwen/Qwen3-Coder-30B-A3B-Instruct

updated a Space 9 days ago

galileo-ai/agent-leaderboard

liked a model 10 days ago

Qwen/Qwen3-Coder-480B-A35B-Instruct

View all activity

Organizations

upvoted a collection 10 days ago

Qwen3

80 items • Updated 3 days ago • 978

upvoted an article 16 days ago

Article

5 Things You Need to Know About Moonshot AI and Kimi K2, the New #1 model on the Hub

By

and 1 other •

18 days ago

• 21

upvoted an article 19 days ago

Article

Combining Remote Reasoning with Local Models

By

•

Jun 26

• 12

upvoted 2 papers about 2 months ago

M^3FinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset

Paper • 2506.02510 • Published Jun 3 • 3

DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models

Paper • 2504.15716 • Published Apr 22 • 11

upvoted an article 5 months ago

Article

Open-R1: Update #1

By

and 7 others •

Feb 2

• 305

upvoted an article 6 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

By

and 2 others •

Jan 28

• 877

upvoted a collection 6 months ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 162

upvoted a collection 12 months ago

⛈️ Llama-3.1 Storm Models

Fine-tuned Llama 3.1 8B model with superior reasoning, conversation abilities, and function calling! • 3 items • Updated Aug 25, 2024 • 15

upvoted 2 collections over 1 year ago

Awesome feedback datasets

A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12, 2024 • 68

Awesome SFT datasets

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 137