Josh Harris
jah242
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health
Information
upvoted
a
paper
about 1 year ago
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls
and Complex Instructions
upvoted
a
paper
about 1 year ago
Are We Done with MMLU?