gcuomo
/

open-source-ai-t5-liar-lens

Text Generation

text-classification

text2text-generation

Model card Files Files and versions

gcuomo commited on May 9

Commit

2abcc91

·

verified ·

1 Parent(s): 98b0d4d

Update README.md

Files changed (1) hide show

README.md +28 -25

README.md CHANGED Viewed

@@ -29,26 +29,28 @@ So while the original LIAR dataset supplies factual claims from political discou
 ### Task Format
-The model reframes classification as a text-to-text task, generating a numeric label (as a string) for each claim:
-- `0`: pants-fire
-- `1`: false
-- `2`: barely-true
-- `3`: half-true
-- `4`: mostly-true
-- `5`: true
-**Example Input**:
 ```
-veracity: Open-source AI systems cannot hallucinate because they're transparent.
 ```
 **Example Output**:
 ```
-0
 ```
-You can map the numeric labels back to human-readable categories using a simple dictionary.
 ### Training Details
@@ -75,8 +77,11 @@ It is **not** intended for production-grade fact-checking or regulatory enforcem
 ### Example Usage
 ```python
 from transformers import T5ForConditionalGeneration, T5Tokenizer
 model = T5ForConditionalGeneration.from_pretrained(
     "gcuomo/open-source-ai-t5-liar-lens"
 )
@@ -84,20 +89,18 @@ tokenizer = T5Tokenizer.from_pretrained(
     "gcuomo/open-source-ai-t5-liar-lens"
 )
-label_map = {
-    "0": "pants-fire",
-    "1": "false",
-    "2": "barely-true",
-    "3": "half-true",
-    "4": "mostly-true",
-    "5": "true"
-}
-input_text = "veracity: Blockchain guarantees ethical outcomes in all AI systems."
-inputs = tokenizer(input_text, return_tensors="pt")
-output = model.generate(**inputs)
-prediction = tokenizer.decode(output[0], skip_special_tokens=True).strip()
-print("Predicted label:", label_map.get(prediction, prediction))
 ```
 ### Citation

 ### Task Format
+This model treats classification as a **text-to-text generation task**. Each input is a short claim or quote, and the model responds with one of six factuality labels, generated directly as a lowercase string:
+- `pants-fire`
+- `false`
+- `barely-true`
+- `half-true`
+- `mostly-true`
+- `true`
+The input format uses a summarization-style prefix to frame the task:
+**Example Input**:
 ```
+summarize: Python is the fastest programming language available.
 ```
 **Example Output**:
 ```
+half-true
 ```
+This response reflects the model’s ability to evaluate short-form claims with nuance, producing a graded label based on its understanding of truthfulness.
 ### Training Details
 ### Example Usage
 ```python
+### Example Usage
 from transformers import T5ForConditionalGeneration, T5Tokenizer
+# Load the fine-tuned model and tokenizer
 model = T5ForConditionalGeneration.from_pretrained(
     "gcuomo/open-source-ai-t5-liar-lens"
 )
     "gcuomo/open-source-ai-t5-liar-lens"
 )
+# Prepare input
+statement = "Blockchain guarantees ethical outcomes in all AI systems."
+prompt = f"summarize: {statement}"
+inputs = tokenizer(prompt, return_tensors="pt", padding=True, truncation=True, max_length=128)
+# Generate prediction
+output = model.generate(**inputs, max_new_tokens=8)
+prediction = tokenizer.decode(output[0], skip_special_tokens=True).strip().lower()
+# Print result
+print("Predicted label:", prediction)
 ```
 ### Citation