Commit History

Swap test dataset
183f8cd

Tymec commited on

Parallelize text cleaning
1414454

Tymec commited on

Fix wrong sentiment mapping on the test dataset
3a96048

Tymec commited on

Fix broken tokenization
447f97e

Tymec commited on

Add slang map
e1645d7

Tymec commited on

Refactor typing and update tokenization rules
228859a

Tymec commited on

Ignore amazonreviews test
d09d1f6

Tymec commited on

Tokenization rework
2c1f9dd

Tymec commited on

Slight optimizations
0ca5366

Tymec commited on

Use spacy instead of nltk and move data functions to separate module
a092d54

Tymec commited on