Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ankner 's Collections
Base Models With Chat Templates
Hydra Decoding
Oracle 2 Proxy Models
Oracle 2 Proxy Data
Multi Judgement Oversight
Critique-out-Loud Reward Models

Oracle 2 Proxy Data

updated Jan 21
Upvote
-

  • ankner/gsm8k-CoT

    Viewer • Updated Jan 17 • 8.78k • 104 • 1

  • ankner/gsm8k-sft

    Viewer • Updated Jan 19 • 1.1k • 30 • 1

  • ankner/gsm8k-rl

    Viewer • Updated Jan 19 • 7.68k • 16

  • ankner/gsm8k-rl-llama3-8b-base-labeled

    Viewer • Updated Jan 20 • 7.68k • 16

  • ankner/apps-sft

    Viewer • Updated Jan 12 • 3.51k • 22

  • ankner/apps-rl

    Viewer • Updated Jan 21 • 5.25k • 13

  • ankner/apps-rl-deepseek-7b-inst-labeled

    Viewer • Updated Jan 13 • 5.25k • 26

  • ankner/chat-pref

    Viewer • Updated Jan 17 • 39.7k • 9
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs