Model Card for Model ID

Model Details

Model Description

Tasks Version Filter n-shot Metric Value Stderr
arc_challenge 1 none 25 acc 0.1775 ยฑ 0.0112
none 25 acc_norm 0.2065 ยฑ 0.0118
truthfulqa_mc2 2 none 0 acc 0.4633 ยฑ 0.0155
winogrande 1 none 5 acc 0.5075 ยฑ 0.0141
hellaswag 1 none 10 acc 0.2685 ยฑ 0.0044
none 10 acc_norm 0.2746 ยฑ 0.0045
gsm8k 3 strict-match 5 exact_match 0.0023 ยฑ 0.0013
flexible-extract 5 exact_match 0.0152 ยฑ 0.0034

(0.26113333333333333, 0.004443523026985591)

Tasks Version Filter n-shot Metric Value Stderr
world_religions 0 none 5 acc 0.2047 ยฑ 0.0309
virology 0 none 5 acc 0.1807 ยฑ 0.0300
us_foreign_policy 0 none 5 acc 0.2700 ยฑ 0.0446
sociology 0 none 5 acc 0.2488 ยฑ 0.0306
security_studies 0 none 5 acc 0.3347 ยฑ 0.0302
public_relations 0 none 5 acc 0.2273 ยฑ 0.0401
professional_psychology 0 none 5 acc 0.2042 ยฑ 0.0163
professional_medicine 0 none 5 acc 0.4485 ยฑ 0.0302
professional_law 0 none 5 acc 0.2458 ยฑ 0.0110
professional_accounting 0 none 5 acc 0.2163 ยฑ 0.0246
prehistory 0 none 5 acc 0.2222 ยฑ 0.0231
philosophy 0 none 5 acc 0.2379 ยฑ 0.0242
nutrition 0 none 5 acc 0.2810 ยฑ 0.0257
moral_scenarios 0 none 5 acc 0.2659 ยฑ 0.0148
moral_disputes 0 none 5 acc 0.2428 ยฑ 0.0231
miscellaneous 0 none 5 acc 0.2375 ยฑ 0.0152
medical_genetics 0 none 5 acc 0.3000 ยฑ 0.0461
marketing 0 none 5 acc 0.1966 ยฑ 0.0260
management 0 none 5 acc 0.1553 ยฑ 0.0359
machine_learning 0 none 5 acc 0.3304 ยฑ 0.0446
logical_fallacies 0 none 5 acc 0.2331 ยฑ 0.0332
jurisprudence 0 none 5 acc 0.2407 ยฑ 0.0413
international_law 0 none 5 acc 0.3306 ยฑ 0.0429
human_sexuality 0 none 5 acc 0.2595 ยฑ 0.0384
human_aging 0 none 5 acc 0.2063 ยฑ 0.0272
high_school_world_history 0 none 5 acc 0.2658 ยฑ 0.0288
high_school_us_history 0 none 5 acc 0.2745 ยฑ 0.0313
high_school_statistics 0 none 5 acc 0.4722 ยฑ 0.0340
high_school_psychology 0 none 5 acc 0.2330 ยฑ 0.0181
high_school_physics 0 none 5 acc 0.3311 ยฑ 0.0384
high_school_microeconomics 0 none 5 acc 0.3403 ยฑ 0.0308
high_school_mathematics 0 none 5 acc 0.2630 ยฑ 0.0268
high_school_macroeconomics 0 none 5 acc 0.3205 ยฑ 0.0237
high_school_government_and_politics 0 none 5 acc 0.3679 ยฑ 0.0348
high_school_geography 0 none 5 acc 0.3283 ยฑ 0.0335
high_school_european_history 0 none 5 acc 0.2606 ยฑ 0.0343
high_school_computer_science 0 none 5 acc 0.2800 ยฑ 0.0451
high_school_chemistry 0 none 5 acc 0.2956 ยฑ 0.0321
high_school_biology 0 none 5 acc 0.3194 ยฑ 0.0265
global_facts 0 none 5 acc 0.1600 ยฑ 0.0368
formal_logic 0 none 5 acc 0.1825 ยฑ 0.0346
elementary_mathematics 0 none 5 acc 0.2487 ยฑ 0.0223
electrical_engineering 0 none 5 acc 0.2966 ยฑ 0.0381
econometrics 0 none 5 acc 0.2632 ยฑ 0.0414
conceptual_physics 0 none 5 acc 0.2553 ยฑ 0.0285
computer_security 0 none 5 acc 0.1800 ยฑ 0.0386
college_physics 0 none 5 acc 0.2451 ยฑ 0.0428
college_medicine 0 none 5 acc 0.2312 ยฑ 0.0321
college_mathematics 0 none 5 acc 0.3200 ยฑ 0.0469
college_computer_science 0 none 5 acc 0.3000 ยฑ 0.0461
college_chemistry 0 none 5 acc 0.1800 ยฑ 0.0386
college_biology 0 none 5 acc 0.2778 ยฑ 0.0375
clinical_knowledge 0 none 5 acc 0.2340 ยฑ 0.0261
business_ethics 0 none 5 acc 0.2100 ยฑ 0.0409
astronomy 0 none 5 acc 0.1776 ยฑ 0.0311
anatomy 0 none 5 acc 0.2296 ยฑ 0.0363
abstract_algebra 0 none 5 acc 0.2200 ยฑ 0.0416
  • Developed by: [More Information Needed]
  • Funded by [optional]: [More Information Needed]
  • Shared by [optional]: [More Information Needed]
  • Model type: [More Information Needed]
  • Language(s) (NLP): [More Information Needed]
  • License: [More Information Needed]
  • Finetuned from model [optional]: [More Information Needed]

Model Sources [optional]

  • Repository: [More Information Needed]
  • Paper [optional]: [More Information Needed]
  • Demo [optional]: [More Information Needed]

Uses

Direct Use

[More Information Needed]

Downstream Use [optional]

[More Information Needed]

Out-of-Scope Use

[More Information Needed]

Bias, Risks, and Limitations

[More Information Needed]

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.

How to Get Started with the Model

Use the code below to get started with the model.

[More Information Needed]

Training Details

Training Data

[More Information Needed]

Training Procedure

Preprocessing [optional]

[More Information Needed]

Training Hyperparameters

  • Training regime: [More Information Needed]

Speeds, Sizes, Times [optional]

[More Information Needed]

Evaluation

Testing Data, Factors & Metrics

Testing Data

[More Information Needed]

Factors

[More Information Needed]

Metrics

[More Information Needed]

Results

[More Information Needed]

Summary

Model Examination [optional]

[More Information Needed]

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

  • Hardware Type: [More Information Needed]
  • Hours used: [More Information Needed]
  • Cloud Provider: [More Information Needed]
  • Compute Region: [More Information Needed]
  • Carbon Emitted: [More Information Needed]

Technical Specifications [optional]

Model Architecture and Objective

[More Information Needed]

Compute Infrastructure

[More Information Needed]

Hardware

[More Information Needed]

Software

[More Information Needed]

Citation [optional]

BibTeX:

[More Information Needed]

APA:

[More Information Needed]

Glossary [optional]

[More Information Needed]

More Information [optional]

[More Information Needed]

Model Card Authors [optional]

[More Information Needed]

Model Card Contact

[More Information Needed]

Downloads last month
54
Safetensors
Model size
111M params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support