Anirbanbhk
commited on
Commit
·
6a445c4
1
Parent(s):
018871a
updated readme
Browse files
README.md
CHANGED
@@ -6,4 +6,16 @@ datasets:
|
|
6 |
- movies subtitles
|
7 |
---
|
8 |
|
9 |
-
# bert-based-uncased-hatespeech-movies:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
6 |
- movies subtitles
|
7 |
---
|
8 |
|
9 |
+
# bert-based-uncased-hatespeech-movies:
|
10 |
+
A hatespeech model used to classify text as **normal**, **offensive**, **hatespeech** in Movie subtitles. The model is initially a pre-trained transformer model(bert-based-uncased) which is further trained on Twitter comments which can be normal, offensive and hate to learn the context from social media data. It is then fine-tuned using the movie subtitles dataset.
|
11 |
+
|
12 |
+
Please check our paper and if used please cite
|
13 |
+
```
|
14 |
+
@article{von2021hateful,
|
15 |
+
title={How Hateful are Movies? A Study and Prediction on Movie Subtitles},
|
16 |
+
author={von Boguszewski, Niklas and Moin, Sana and Bhowmick, Anirban and Yimam, Seid Muhie and Biemann, Chris},
|
17 |
+
journal={arXiv preprint arXiv:2108.10724},
|
18 |
+
year={2021}
|
19 |
+
}
|
20 |
+
```
|
21 |
+
The dataset and models are available on https://github.com/uhh-lt/hatespeech
|