Commit History

Update docstrings and comments
d4ef46b

Tymec commited on

Swap test dataset
183f8cd

Tymec commited on

Parallelize text cleaning
1414454

Tymec commited on

Cache label data along with tokenized text data
af84d9b

Tymec commited on

Add progress bar to serialize
632adc4

Tymec commited on

Refactor typing and update tokenization rules
228859a

Tymec commited on

Chunked serialization
afaacd1

Tymec commited on

Completely change the structure of the project
85ac990

Tymec commited on

Restructure project into package structure
667fe9d

Tymec commited on