Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
stair-lab 's Collections
Reliable and Efficient Amortized Model-Based Evaluation
Nonmyopic Bayesian Optimization in Dynamic Cost Settings
Bayes Optimal Survey Design
Finetuning and Comprehensive Evaluation of Vietnamese LLM
Dynamics of Learning

Reliable and Efficient Amortized Model-Based Evaluation

updated Dec 25, 2024

Datasets and Models for the REEval project

Upvote
-

  • stair-lab/reeval_responses

    Viewer • Updated Nov 29, 2024 • 500k • 55

  • stair-lab/reeval_jsons

    Updated Nov 22, 2024 • 106

  • stair-lab/reeval_results_Mistral-7B-v0.3

    Updated Jan 1 • 6.53k

  • stair-lab/reeval-sft-archived

    Updated Dec 30, 2024 • 3
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs