Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
2
Boxi Yu
Bertsekas
Follow
0 followers
·
1 following
https://boxiyu.github.io/
BoshCavendish
BoxiYu
AI & ML interests
Coding Agent, Automated Operator
Recent Activity
authored
a paper
7 days ago
How Should I Build A Benchmark? Revisiting Code-Related Benchmarks For LLMs
authored
a paper
7 days ago
UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench
upvoted
a
paper
8 days ago
UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench
View all activity
Organizations
None yet
Bertsekas
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
12 days ago
princeton-nlp/SWE-bench_Lite
Viewer
•
Updated
Mar 3
•
323
•
28.3k
•
40
liked
a Space
3 months ago
Running
on
Zero
327
327
Describe Anything
⚡
Generate descriptions from images using masks