finiteautomata commited on
Commit
276c733
1 Parent(s): 900dd9f

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -0
README.md ADDED
@@ -0,0 +1,47 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+
5
+ tags:
6
+ - twitter
7
+ - hate-speech
8
+
9
+ ---
10
+ # Hate Speech detection in Spanish
11
+ ## robertuito-hate-speech
12
+
13
+ Repository: [https://github.com/pysentimiento/pysentimiento/](https://github.com/finiteautomata/pysentimiento/)
14
+
15
+
16
+
17
+ Model trained with SemEval 2019 Task 5: HatEval (SubTask B) corpus for Hate Speech detection in English. Base model is [BERTweet](https://huggingface.co/vinai/bertweet-base), a RoBERTa model trained in English tweets.
18
+
19
+ It is a multi-classifier model, with the following classes:
20
+
21
+ - **HS**: is it hate speech?
22
+ - **TR**: is it targeted to a specific individual?
23
+ - **AG**: is it aggressive?
24
+
25
+
26
+ ## License
27
+
28
+ `pysentimiento` is an open-source library for non-commercial use and scientific research purposes only. Please be aware that models are trained with third-party datasets and are subject to their respective licenses.
29
+
30
+ 1. [TASS Dataset license](http://tass.sepln.org/tass_data/download.php)
31
+ 2. [SEMEval 2017 Dataset license]()
32
+
33
+ ## Citation
34
+
35
+ If you use `pysentimiento` in your work, please cite [this paper](https://arxiv.org/abs/2106.09462)
36
+
37
+ ```
38
+ @misc{perez2021pysentimiento,
39
+ title={pysentimiento: A Python Toolkit for Sentiment Analysis and SocialNLP tasks},
40
+ author={Juan Manuel Pérez and Juan Carlos Giudici and Franco Luque},
41
+ year={2021},
42
+ eprint={2106.09462},
43
+ archivePrefix={arXiv},
44
+ primaryClass={cs.CL}
45
+ }
46
+ ```
47
+ Enjoy! 🤗