przvl
/

PopEuroBERT-binary-210m

Text Classification

political-speech

Model card Files Files and versions

przvl commited on Mar 21

Commit

c8fe128

·

verified ·

1 Parent(s): 6edbff0

Update README.md

Files changed (1) hide show

README.md +17 -18

README.md CHANGED Viewed

@@ -117,24 +117,23 @@ Predicted class: Populist (Confidence: 0.90)
 ## Evaluation
-### Test Set Results (Threshold = 0.5)
-| Metric    | Score  |
-| --------- | ------ |
-| Accuracy  | 75.99% |
-| Precision | 73.78% |
-| Recall    | 80.66% |
-| F1 Score  | 77.07% |
-| Loss      | 0.4959 |
-### Test Set Results (Optimized Threshold = 0.56)
-| Metric    | Score  |
-| --------- | ------ |
-| Accuracy  | 76.00% |
-| Precision | 76.00% |
-| Recall    | 76.00% |
-| F1 Score  | 76.00% |
 ## Limitations

 ## Evaluation
+For transparency, we compare this model with its smaller variant ([PopEuroBERT-210m](https://huggingface.co/przvl/PopEuroBERT-binary-210m)), both trained and evaluated on the same dataset and splits.
+### Test Set Performance (Threshold = 0.5)
+| Model              | Accuracy | Precision | Recall | F1 Score | Loss   |
+|--------------------|----------|-----------|--------|----------|--------|
+| **210M**           | 75.99%   | 73.78%    | 80.66% | 77.07%   | 0.4959 |
+| **610M (this)**    | 80.26%   | 78.42%    | 83.50% | 80.89%   | 0.4631 |
+### Test Set Performance (Optimized Threshold)
+| Model              | Threshold | Accuracy | Precision | Recall | F1 Score |
+|--------------------|-----------|----------|-----------|--------|----------|
+| **210M**           | 0.56      | 76.00%   | 76.00%    | 76.00% | 76.00%   |
+| **610M (this)**    | 0.43      | 79.81%   | 76.63%    | 85.78% | 80.94%   |
+PopEuroBERT-610m consistently outperforms the 210m variant across all metrics. It especially improves recall and F1 score, suggesting better identification of populist speech. The decision threshold (0.43) was tuned for balanced performance.
 ## Limitations