Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
kenhktsui 's Collections
Self Correction Bench
FastText Model for Pretraining Data Curation
LongTalk
textbook-quality-classifier
CoT
nano-phi
VLM Data

Self Correction Bench

updated 25 days ago

Benchmarking LLM capability of external and internal error correction

Upvote
1

  • kenhktsui/scli5

    Viewer • Updated 23 days ago • 286 • 123

  • kenhktsui/gsm8k_sc

    Viewer • Updated 23 days ago • 1.31k • 92

  • kenhktsui/prm800k_sc

    Viewer • Updated 23 days ago • 448 • 92

  • Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs

    Paper • 2507.02778 • Published 26 days ago • 9
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs