File size: 7,165 Bytes

d078752
 
7aeaa4c
 
 
 
 
 
 
 
 
d078752
 
e309fb3
83a2612
d078752
e309fb3
 
 
 
 
 
d078752
 
bd123b8
d078752
bd123b8
d078752
bd123b8
 
d078752
3414f7e
 
 
bd123b8
3414f7e
 
 
 
7fe1cda
 
 
7aeaa4c
7fe1cda
d078752
7aeaa4c
 
d078752
7aeaa4c
 
 
d078752
7aeaa4c
02652df
7aeaa4c
d078752
7aeaa4c
bd123b8
d078752
1f12d7a
bd123b8
d078752
 
7aeaa4c
 
 
042f08e
7aeaa4c
bd123b8
d078752
7aeaa4c
d078752
7aeaa4c
d078752
1f12d7a
7aeaa4c
d078752
bd123b8
d078752
7aeaa4c
d078752
 
bd123b8
d078752
 
 
 
7aeaa4c
d078752
 
 
 
 
1f12d7a
7aeaa4c
 
 
e5fd2ca
7aeaa4c
d078752
 
 
1f12d7a
bd123b8
d078752
 
 
 
bd123b8
 
d078752
 
 
 
 
45377fb
 
7aeaa4c
 
45377fb
d078752
7aeaa4c
d078752
 
 
7aeaa4c
 
56714bb
7aeaa4c
 
 
45377fb
7aeaa4c

---
library_name: transformers
license: cc-by-nc-4.0
datasets:
- oumi-ai/oumi-c2d-d2c-subset
- oumi-ai/oumi-synthetic-claims
- oumi-ai/oumi-synthetic-document-claims
language:
- en
base_model:
- meta-llama/Llama-3.1-8B-Instruct
---

[![oumi logo](https://oumi.ai/logo_lockup_black.svg)](https://github.com/oumi-ai/oumi)
[![Made with Oumi](https://badgen.net/badge/Made%20with/Oumi/%23085CFF?icon=https%3A%2F%2Foumi.ai%2Flogo_dark.svg)](https://github.com/oumi-ai/oumi)

[![Documentation](https://img.shields.io/badge/Documentation-oumi-blue.svg)](https://oumi.ai/docs/en/latest/index.html)
[![Blog](https://img.shields.io/badge/Blog-oumi-blue.svg)](https://oumi.ai/blog)
[![Discord](https://img.shields.io/discord/1286348126797430814?label=Discord)](https://discord.gg/oumi)

# oumi-ai/HallOumi-8B-classifier

<!-- Provide a quick summary of what the model is/does. -->

Introducing **HallOumi-8B-classifier**, a _fast_ **SOTA hallucination detection model**, outperforming DeepSeek R1, OpenAI o1, Google Gemini 1.5 Pro, and Claude Sonnet 3.5 at only **8 billion parameters!**

Give HallOumi a try now!

* Demo: https://oumi.ai/halloumi-demo
* Github: https://github.com/oumi-ai/oumi/tree/main/configs/projects/halloumi

| Model                 | Macro F1 Score | Open?             | Model Size |
| --------------------- | -------------- | ----------------- | ---------- |
| **HallOumi-8B**           | **77.2% ± 2.2%**   | Truly Open Source | 8B         |
| Claude Sonnet 3.5     | 69.6% ± 2.8%   | Closed            | ??         |
| OpenAI o1-preview     | 65.9% ± 2.3%   | Closed            | ??         |
| DeepSeek R1           | 61.6% ± 2.5%   | Open Weights      | 671B       |
| Llama 3.1 405B        | 58.8% ± 2.4%   | Open Weights      | 405B       |
| Google Gemini 1.5 Pro | 48.2% ± 1.8%   | Closed            | ??         |

**HallOumi-8B-classifier**, the hallucination classification model built with Oumi, is an end-to-end binary classification system that enables *fast and accurate* assessment of the hallucination probability of any written content (AI or human-generated).
* ✔️ Fast with high accuracy
* ✔️ Per-claim support (must call once per claim)


## Hallucinations
Hallucinations are often cited as the most important issue with being able to deploy generative models in numerous commercial and personal applications, and for good reason:

* [Lawyers sanctioned for briefing where ChatGPT cited 6 fictitious cases](https://www.reuters.com/legal/new-york-lawyers-sanctioned-using-fake-chatgpt-cases-legal-brief-2023-06-22/)
* [Air Canada required to honor refund policy made up by its AI support chatbot](https://www.wired.com/story/air-canada-chatbot-refund-policy/)
* [AI suggesting users should make glue pizza and eat rocks](https://www.bbc.com/news/articles/cd11gzejgz4o)

It ultimately comes down to an issue of **trust** — generative models are trained to produce outputs which are **probabilistically likely**, but not necessarily **true**. 
While such tools are useful in the right hands, being unable to trust them prevents AI from being adopted more broadly, 
where it can be utilized safely and responsibly.

## Building Trust with Verifiability
To be able to begin trusting AI systems, we have to be able to verify their outputs. To verify, we specifically mean that we need to:

* Understand the **truthfulness** of a particular statement produced by any model (the key focus of **HallOumi-8B-classifier** model).
* Understand what **information supports that statement’s truth** and have **full traceability** connecting the statement to that information (provided by our *generative* [HallOumi model](https://huggingface.co/oumi-ai/HallOumi-8B))


- **Developed by:** [Oumi AI](https://oumi.ai/)
- **Model type:** Small Language Model
- **Language(s) (NLP):** English
- **License:** [CC-BY-NC-4.0](https://creativecommons.org/licenses/by-nc/4.0/deed.en) (due to ANLI data falling under the same license)
- **Finetuned from model:** [Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct)
- **Demo:** [HallOumi Demo](https://oumi.ai/halloumi)

---

## Uses

<!-- Address questions around how the model is intended to be used, including the foreseeable users and those affected by the model. -->
Use to verify claims/detect hallucinations in scenarios where a known source of truth is available.

Demo: https://oumi.ai/halloumi-demo

## Out-of-Scope Use

<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
Smaller LLMs have limited capabilities and should be used with caution. Avoid using this model for purposes outside of claim verification.

## Bias, Risks, and Limitations

<!-- This section is meant to convey both technical and sociotechnical limitations. -->
This model was finetuned with Llama-3.1-405B-Instruct data on top of a Llama-3.1-8B-Instruct model, so any biases or risks associated with those models may be present.

## Training Details

### Training Data

<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about and documentation related to data pre-processing or additional filtering. -->
Training data: 
- [oumi-ai/oumi-synthetic-document-claims](https://huggingface.co/datasets/oumi-ai/oumi-synthetic-document-claims)
- [oumi-ai/oumi-synthetic-claims](https://huggingface.co/datasets/oumi-ai/oumi-synthetic-claims)
- [oumi-ai/oumi-anli-subset](https://huggingface.co/datasets/oumi-ai/oumi-anli-subset)
- [oumi-ai/oumi-c2d-d2c-subset](https://huggingface.co/datasets/oumi-ai/oumi-c2d-d2c-subset)

### Training Procedure

<!-- This relates heavily to the Technical Specifications. Content here should link to that section when relevant to the training procedure. -->
For information on training, see https://oumi.ai/halloumi

## Evaluation

<!-- This section describes the evaluation protocols and provides the results. -->
Follow along with our notebook on how to evaluate hallucination with HallOumi and other popular models:
https://github.com/oumi-ai/oumi/blob/main/configs/projects/halloumi/halloumi_eval_notebook.ipynb

## Environmental Impact

<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->

- **Hardware Type:** A100-80GB
- **Hours used:** 1.5 (4 * 8 GPUs)
- **Cloud Provider:** Google Cloud Platform
- **Compute Region:** us-east5
- **Carbon Emitted:** 0.15 kg 

## Citation

<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->

```
@misc{oumiHalloumi8BClassifier,
  author = {Panos Achlioptas, Jeremy Greer, Konstantinos Aisopos, Michael Schuler, Oussama Elachqar, Emmanouil Koukoumidis},
  title = {HallOumi-8B-classifier},
  month = {March},
  year = {2025},
  url = {https://huggingface.co/oumi-ai/HallOumi-8B-classifier}
}

@software{oumi2025,
  author = {Oumi Community},
  title = {Oumi: an Open, End-to-end Platform for Building Large Foundation Models},
  month = {January},
  year = {2025},
  url = {https://github.com/oumi-ai/oumi}
}
```