Commit History

Fix broken tokenization
447f97e

Tymec commited on

Add slang map
e1645d7

Tymec commited on

Refactor typing and update tokenization rules
228859a

Tymec commited on

Ignore amazonreviews test
d09d1f6

Tymec commited on

Tokenization rework
2c1f9dd

Tymec commited on

Slight optimizations
0ca5366

Tymec commited on

Use spacy instead of nltk and move data functions to separate module
a092d54

Tymec commited on