Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kunal Suri's picture
19 8

Kunal Suri

suryakiran786
Tropy007's profile picture
·
  • suri-kunal

AI & ML interests

None yet

Recent Activity

upvoted a collection 7 days ago
Reward Bench 2
upvoted a paper 19 days ago
BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs
upvoted an article 30 days ago
TinyAgents: A Minimal Experiment with Code Agents and MCP Tools
View all activity

Organizations

Hugging Face MCP Course's profile picture

suryakiran786's activity

liked 2 Spaces 4 months ago
Running on CPU Upgrade
208
208

MMLU-Pro Leaderboard

🥇

More advanced and challenging multi-task evaluation

Running
568
568

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

liked 5 datasets 4 months ago

galileo-ai/agent-leaderboard

Viewer • Updated Feb 11 • 1.28k • 215 • 27

m-ric/agents_small_benchmark

Viewer • Updated Jan 19, 2024 • 100 • 149 • 11

Writer/omniact

Updated Apr 29, 2024 • 579 • 36

rabbit-hmi/MM-Mind2Web-tilde_test_snapshot_20dist

Viewer • Updated Jul 2, 2024 • 4.92k • 58 • 2

osunlp/Multimodal-Mind2Web

Viewer • Updated Jun 5, 2024 • 14.2k • 2.09k • 73
liked a Space 5 months ago
Running
19
19

TravelPlannerLeaderboard

💻

Display and submit evaluation results for travel planning

Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs