Model Card for Model ID

Model Details

Model Description

Tasks Version Filter n-shot Metric Value Stderr
arc_challenge 1 none 25 acc 0.1775 ยฑ 0.0112
none 25 acc_norm 0.2150 ยฑ 0.0120
truthfulqa_mc2 2 none 0 acc 0.4571 ยฑ 0.0154
winogrande 1 none 5 acc 0.5107 ยฑ 0.014
hellaswag 1 none 10 acc 0.2735 ยฑ 0.0044
none 10 acc_norm 0.2801 ยฑ 0.0045
gsm8k 3 strict-match 5 exact_match 0.0030 ยฑ 0.0015
flexible-extract 5 exact_match 0.0099 ยฑ 0.0027

(0.25676491228070175, 0.004430608580958243)

Tasks Version Filter n-shot Metric Value Stderr
world_religions 0 none 5 acc 0.2281 ยฑ 0.0322
virology 0 none 5 acc 0.2289 ยฑ 0.0327
us_foreign_policy 0 none 5 acc 0.3000 ยฑ 0.0461
sociology 0 none 5 acc 0.2438 ยฑ 0.0304
security_studies 0 none 5 acc 0.2327 ยฑ 0.0270
public_relations 0 none 5 acc 0.2091 ยฑ 0.0390
professional_psychology 0 none 5 acc 0.2516 ยฑ 0.0176
professional_medicine 0 none 5 acc 0.4522 ยฑ 0.0302
professional_law 0 none 5 acc 0.2484 ยฑ 0.0110
professional_accounting 0 none 5 acc 0.2518 ยฑ 0.0259
prehistory 0 none 5 acc 0.2654 ยฑ 0.0246
philosophy 0 none 5 acc 0.2315 ยฑ 0.0240
nutrition 0 none 5 acc 0.2059 ยฑ 0.0232
moral_scenarios 0 none 5 acc 0.2380 ยฑ 0.0142
moral_disputes 0 none 5 acc 0.2486 ยฑ 0.0233
miscellaneous 0 none 5 acc 0.2874 ยฑ 0.0162
medical_genetics 0 none 5 acc 0.2900 ยฑ 0.0456
marketing 0 none 5 acc 0.2009 ยฑ 0.0262
management 0 none 5 acc 0.1845 ยฑ 0.0384
machine_learning 0 none 5 acc 0.2857 ยฑ 0.0429
logical_fallacies 0 none 5 acc 0.3190 ยฑ 0.0366
jurisprudence 0 none 5 acc 0.2685 ยฑ 0.0428
international_law 0 none 5 acc 0.2149 ยฑ 0.0375
human_sexuality 0 none 5 acc 0.2137 ยฑ 0.0360
human_aging 0 none 5 acc 0.2466 ยฑ 0.0289
high_school_world_history 0 none 5 acc 0.2616 ยฑ 0.0286
high_school_us_history 0 none 5 acc 0.2402 ยฑ 0.0300
high_school_statistics 0 none 5 acc 0.4722 ยฑ 0.0340
high_school_psychology 0 none 5 acc 0.2128 ยฑ 0.0175
high_school_physics 0 none 5 acc 0.2781 ยฑ 0.0366
high_school_microeconomics 0 none 5 acc 0.3067 ยฑ 0.0300
high_school_mathematics 0 none 5 acc 0.2630 ยฑ 0.0268
high_school_macroeconomics 0 none 5 acc 0.2590 ยฑ 0.0222
high_school_government_and_politics 0 none 5 acc 0.3005 ยฑ 0.0331
high_school_geography 0 none 5 acc 0.3030 ยฑ 0.0327
high_school_european_history 0 none 5 acc 0.2667 ยฑ 0.0345
high_school_computer_science 0 none 5 acc 0.2900 ยฑ 0.0456
high_school_chemistry 0 none 5 acc 0.2956 ยฑ 0.0321
high_school_biology 0 none 5 acc 0.2871 ยฑ 0.0257
global_facts 0 none 5 acc 0.2600 ยฑ 0.0441
formal_logic 0 none 5 acc 0.1667 ยฑ 0.0333
elementary_mathematics 0 none 5 acc 0.2566 ยฑ 0.0225
electrical_engineering 0 none 5 acc 0.2414 ยฑ 0.0357
econometrics 0 none 5 acc 0.2719 ยฑ 0.0419
conceptual_physics 0 none 5 acc 0.3319 ยฑ 0.0308
computer_security 0 none 5 acc 0.2000 ยฑ 0.0402
college_physics 0 none 5 acc 0.1863 ยฑ 0.0387
college_medicine 0 none 5 acc 0.1965 ยฑ 0.0303
college_mathematics 0 none 5 acc 0.2700 ยฑ 0.0446
college_computer_science 0 none 5 acc 0.2300 ยฑ 0.0423
college_chemistry 0 none 5 acc 0.2000 ยฑ 0.0402
college_biology 0 none 5 acc 0.2431 ยฑ 0.0359
clinical_knowledge 0 none 5 acc 0.2226 ยฑ 0.0256
business_ethics 0 none 5 acc 0.2100 ยฑ 0.0409
astronomy 0 none 5 acc 0.1842 ยฑ 0.0315
anatomy 0 none 5 acc 0.3407 ยฑ 0.0409
abstract_algebra 0 none 5 acc 0.2400 ยฑ 0.0429
  • Developed by: me
  • Funded by [optional]: nobody
  • Shared by [optional]: me
  • Model type: mistral
  • Language(s) (NLP): english (pile)
  • License: apache
  • Finetuned from model [optional]: none

Model Sources [optional]

  • Repository: [More Information Needed]
  • Paper [optional]: [More Information Needed]
  • Demo [optional]: [More Information Needed]

Uses

Direct Use

[More Information Needed]

Downstream Use [optional]

[More Information Needed]

Out-of-Scope Use

[More Information Needed]

Bias, Risks, and Limitations

[More Information Needed]

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.

How to Get Started with the Model

Use the code below to get started with the model.

[More Information Needed]

Training Details

Training Data

[More Information Needed]

Training Procedure

Preprocessing [optional]

[More Information Needed]

Training Hyperparameters

  • Training regime: [More Information Needed]

Speeds, Sizes, Times [optional]

[More Information Needed]

Evaluation

Testing Data, Factors & Metrics

Testing Data

[More Information Needed]

Factors

[More Information Needed]

Metrics

[More Information Needed]

Results

[More Information Needed]

Summary

Model Examination [optional]

[More Information Needed]

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

  • Hardware Type: [More Information Needed]
  • Hours used: [More Information Needed]
  • Cloud Provider: [More Information Needed]
  • Compute Region: [More Information Needed]
  • Carbon Emitted: [More Information Needed]

Technical Specifications [optional]

Model Architecture and Objective

[More Information Needed]

Compute Infrastructure

[More Information Needed]

Hardware

[More Information Needed]

Software

[More Information Needed]

Citation [optional]

BibTeX:

[More Information Needed]

APA:

[More Information Needed]

Glossary [optional]

[More Information Needed]

More Information [optional]

[More Information Needed]

Model Card Authors [optional]

[More Information Needed]

Model Card Contact

[More Information Needed]

Downloads last month
11
Safetensors
Model size
111M params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support