Sherlock
eyuansu71
AI & ML interests
None yet
Recent Activity
commented on
a paper
24 days ago
One Token to Fool LLM-as-a-Judge
upvoted
a
paper
about 1 month ago
SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in
Real-World Applications
updated
a dataset
3 months ago
FlagEval/HMMT_2025