Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
3
Yao Xuesong
NathanYao
Follow
AI & ML interests
None yet
Recent Activity
authored
a paper
about 18 hours ago
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use
authored
a paper
about 18 hours ago
Recitation over Reasoning: How Cutting-Edge Language Models Can Fail on Elementary School-Level Reasoning Problems?
upvoted
a
paper
3 months ago
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use
View all activity
Organizations
None yet
Papers
2
arxiv:
2504.00509
arxiv:
2501.02506
models
None public yet
datasets
None public yet