Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
stair-lab 's Collections
Reliable and Efficient Amortized Model-Based Evaluation
Nonmyopic Bayesian Optimization in Dynamic Cost Settings
Gathering Context for Decision Support with LLMs
Finetuning and Comprehensive Evaluation of Vietnamese LLM
Dynamics of Learning

Reliable and Efficient Amortized Model-Based Evaluation

updated 8 days ago

Datasets and Models for the REEval project

Upvote
-

  • stair-lab/reeval

    Viewer • Updated 4 days ago • 5.69M • 53

  • stair-lab/reeval-difficulty-for-helm

    Viewer • Updated Mar 18 • 217k • 42
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs