Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Pratik Bhavsar's picture
4 8 94

Pratik Bhavsar

pratikbhavsar
Almazyood's profile picture the12kilu's profile picture aimlresearch2023's profile picture
·
https://pakodas.substack.com
  • nlpguy_
  • bhavsarpratik
  • bhavsarpratik

AI & ML interests

LLM agents, evaluation & reasoning

Recent Activity

upvoted a paper 2 days ago
M^3FinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset
upvoted a paper 2 days ago
DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models
liked a model 4 days ago
mistralai/Magistral-Small-2506
View all activity

Organizations

Maximalists's profile picture Galileo's profile picture

Articles 1

Article
22

Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios

Collections 1

Agent Leaderboard
  • Running on CPU Upgrade
    333
    333

    Agent Leaderboard

    💬

    Ranking of LLMs for agentic tasks

  • galileo-ai/agent-leaderboard

    Viewer • Updated Feb 11 • 1.28k • 215 • 27

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs