Commit History

Update python dependencies
9b6760b

Tymec commited on

Update docstrings and comments
d4ef46b

Tymec commited on

Update notebook and add results
68bf0ed

Tymec commited on

Replace notebook with updated and improved version
e3c426e

Tymec commited on

Updated models
639911c

Tymec commited on

Update README with images
370bb72

Tymec commited on

Swap test dataset
183f8cd

Tymec commited on

Experiment notebook
2b747dc

Tymec commited on

Add amazonreviews model
baf0dee

Tymec commited on

Parallelize text cleaning
1414454

Tymec commited on

Add notebook for experimentation
a0c00be

Tymec commited on

Fix wrong sentiment mapping on the test dataset
3a96048

Tymec commited on

Update dependencies
53bc5fb

Tymec commited on

Add min-df option
8b10b79

Tymec commited on

Improved models
7f29122

Tymec commited on

Change df
cc21abf

Tymec commited on

Add new model trained on sentiment140
419453c

Tymec commited on

Cache label data along with tokenized text data
af84d9b

Tymec commited on

Add imdb50k model
e3095cd

Tymec commited on

Remove all pre-trained models
0fce9f0

Tymec commited on

Fix broken tokenization
447f97e

Tymec commited on

Add slang map
e1645d7

Tymec commited on

Update README architecture
d29d6fe

Tymec commited on

Add progress bar to serialize
632adc4

Tymec commited on

Add emoji dependency
ac221ce

Tymec commited on

More pre-trained models
421ea0c

Tymec commited on

Add more vectorizers, classifiers and CLI options
b0ade1a

Tymec commited on

Refactor typing and update tokenization rules
228859a

Tymec commited on

Update documentation
71069d7

Tymec commited on

Add document frequency threshold
c5ed75e

Tymec commited on

Ignore amazonreviews test
d09d1f6

Tymec commited on

Chunked serialization
afaacd1

Tymec commited on

Update options, force GC, tweak parameters and add flags
18cc46a

Tymec commited on

Ability to change number of parallel jobs for search
8471e78

Tymec commited on

Create model in train_model
3854a1f

Tymec commited on

New model trained on imbb50k dataset
23e75e7

Tymec commited on

Tokenization rework
2c1f9dd

Tymec commited on

Add dataset for testing
e50b20c

Tymec commited on

Add prototyping notebook
a5c3a23

Tymec commited on

Rename app command
db8f6b2

Tymec commited on

Slight optimizations
0ca5366

Tymec commited on

Remove prev entry point
63ffb6b

Tymec commited on

Change HF entry point and add examples
b42b884

Tymec commited on

Handle missing spacy model
7ce074d

Tymec commited on

Remove check file size action
308dcf9

Tymec commited on

fix typo
3178817

Tymec commited on

Import spacy model as module
7b9e59d

Tymec commited on

Remove unused dependencies
16e15df

Tymec commited on

Update HF config
bf1042d

Tymec commited on

Add github actions
edfb539

Tymec commited on