Model Card for Model ID

Model Details

Model Description

Tasks Version Filter n-shot Metric Value Stderr
arc_challenge 1 none 25 acc 0.1792 ยฑ 0.0112
none 25 acc_norm 0.2065 ยฑ 0.0118
truthfulqa_mc2 2 none 0 acc 0.4553 ยฑ 0.0154
winogrande 1 none 5 acc 0.4972 ยฑ 0.0141
hellaswag 1 none 10 acc 0.2703 ยฑ 0.0044
none 10 acc_norm 0.2796 ยฑ 0.0045
gsm8k 3 strict-match 5 exact_match 0.0000 ยฑ 0.0000
flexible-extract 5 exact_match 0.0144 ยฑ 0.0033

(0.24656842105263158, 0.004373961821155628)

Tasks Version Filter n-shot Metric Value Stderr
world_religions 0 none 5 acc 0.2573 ยฑ 0.0335
virology 0 none 5 acc 0.2831 ยฑ 0.0351
us_foreign_policy 0 none 5 acc 0.2500 ยฑ 0.0435
sociology 0 none 5 acc 0.2438 ยฑ 0.0304
security_studies 0 none 5 acc 0.2327 ยฑ 0.0270
public_relations 0 none 5 acc 0.2273 ยฑ 0.0401
professional_psychology 0 none 5 acc 0.2500 ยฑ 0.0175
professional_medicine 0 none 5 acc 0.4485 ยฑ 0.0302
professional_law 0 none 5 acc 0.2458 ยฑ 0.0110
professional_accounting 0 none 5 acc 0.2624 ยฑ 0.0262
prehistory 0 none 5 acc 0.2130 ยฑ 0.0228
philosophy 0 none 5 acc 0.1929 ยฑ 0.0224
nutrition 0 none 5 acc 0.2222 ยฑ 0.0238
moral_scenarios 0 none 5 acc 0.2380 ยฑ 0.0142
moral_disputes 0 none 5 acc 0.2486 ยฑ 0.0233
miscellaneous 0 none 5 acc 0.2644 ยฑ 0.0158
medical_genetics 0 none 5 acc 0.3000 ยฑ 0.0461
marketing 0 none 5 acc 0.1752 ยฑ 0.0249
management 0 none 5 acc 0.1748 ยฑ 0.0376
machine_learning 0 none 5 acc 0.2500 ยฑ 0.0411
logical_fallacies 0 none 5 acc 0.2945 ยฑ 0.0358
jurisprudence 0 none 5 acc 0.2593 ยฑ 0.0424
international_law 0 none 5 acc 0.2479 ยฑ 0.0394
human_sexuality 0 none 5 acc 0.2595 ยฑ 0.0384
human_aging 0 none 5 acc 0.2466 ยฑ 0.0289
high_school_world_history 0 none 5 acc 0.2911 ยฑ 0.0296
high_school_us_history 0 none 5 acc 0.2794 ยฑ 0.0315
high_school_statistics 0 none 5 acc 0.4722 ยฑ 0.0340
high_school_psychology 0 none 5 acc 0.1927 ยฑ 0.0169
high_school_physics 0 none 5 acc 0.1987 ยฑ 0.0326
high_school_microeconomics 0 none 5 acc 0.2227 ยฑ 0.0270
high_school_mathematics 0 none 5 acc 0.2667 ยฑ 0.0270
high_school_macroeconomics 0 none 5 acc 0.2103 ยฑ 0.0207
high_school_government_and_politics 0 none 5 acc 0.2435 ยฑ 0.0310
high_school_geography 0 none 5 acc 0.1717 ยฑ 0.0269
high_school_european_history 0 none 5 acc 0.2485 ยฑ 0.0337
high_school_computer_science 0 none 5 acc 0.2700 ยฑ 0.0446
high_school_chemistry 0 none 5 acc 0.2906 ยฑ 0.0319
high_school_biology 0 none 5 acc 0.2774 ยฑ 0.0255
global_facts 0 none 5 acc 0.1600 ยฑ 0.0368
formal_logic 0 none 5 acc 0.1508 ยฑ 0.0320
elementary_mathematics 0 none 5 acc 0.2540 ยฑ 0.0224
electrical_engineering 0 none 5 acc 0.2414 ยฑ 0.0357
econometrics 0 none 5 acc 0.2544 ยฑ 0.0410
conceptual_physics 0 none 5 acc 0.2638 ยฑ 0.0288
computer_security 0 none 5 acc 0.2600 ยฑ 0.0441
college_physics 0 none 5 acc 0.2157 ยฑ 0.0409
college_medicine 0 none 5 acc 0.2081 ยฑ 0.0310
college_mathematics 0 none 5 acc 0.2300 ยฑ 0.0423
college_computer_science 0 none 5 acc 0.3100 ยฑ 0.0465
college_chemistry 0 none 5 acc 0.2000 ยฑ 0.0402
college_biology 0 none 5 acc 0.2431 ยฑ 0.0359
clinical_knowledge 0 none 5 acc 0.2415 ยฑ 0.0263
business_ethics 0 none 5 acc 0.1600 ยฑ 0.0368
astronomy 0 none 5 acc 0.1776 ยฑ 0.0311
anatomy 0 none 5 acc 0.3407 ยฑ 0.0409
abstract_algebra 0 none 5 acc 0.2200 ยฑ 0.0416
  • Developed by: [More Information Needed]
  • Funded by [optional]: [More Information Needed]
  • Shared by [optional]: [More Information Needed]
  • Model type: [More Information Needed]
  • Language(s) (NLP): [More Information Needed]
  • License: [More Information Needed]
  • Finetuned from model [optional]: [More Information Needed]

Model Sources [optional]

  • Repository: [More Information Needed]
  • Paper [optional]: [More Information Needed]
  • Demo [optional]: [More Information Needed]

Uses

Direct Use

[More Information Needed]

Downstream Use [optional]

[More Information Needed]

Out-of-Scope Use

[More Information Needed]

Bias, Risks, and Limitations

[More Information Needed]

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.

How to Get Started with the Model

Use the code below to get started with the model.

[More Information Needed]

Training Details

Training Data

[More Information Needed]

Training Procedure

Preprocessing [optional]

[More Information Needed]

Training Hyperparameters

  • Training regime: [More Information Needed]

Speeds, Sizes, Times [optional]

[More Information Needed]

Evaluation

Testing Data, Factors & Metrics

Testing Data

[More Information Needed]

Factors

[More Information Needed]

Metrics

[More Information Needed]

Results

[More Information Needed]

Summary

Model Examination [optional]

[More Information Needed]

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

  • Hardware Type: [More Information Needed]
  • Hours used: [More Information Needed]
  • Cloud Provider: [More Information Needed]
  • Compute Region: [More Information Needed]
  • Carbon Emitted: [More Information Needed]

Technical Specifications [optional]

Model Architecture and Objective

[More Information Needed]

Compute Infrastructure

[More Information Needed]

Hardware

[More Information Needed]

Software

[More Information Needed]

Citation [optional]

BibTeX:

[More Information Needed]

APA:

[More Information Needed]

Glossary [optional]

[More Information Needed]

More Information [optional]

[More Information Needed]

Model Card Authors [optional]

[More Information Needed]

Model Card Contact

[More Information Needed]

Downloads last month
44
Safetensors
Model size
111M params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support