Spaces:
Running
Running
Add imdb50k model
Browse files- README.md +5 -10
- models/imdb50k_tfidf_ft20000.pkl +3 -0
README.md
CHANGED
@@ -209,16 +209,11 @@ graph LR
|
|
209 |
|
210 |
### Pre-trained Models
|
211 |
The following pre-trained models are available for use:
|
212 |
-
| Dataset | Vectorizer | Features |
|
213 |
-
| --- | --- | --- | --- | --- | --- |
|
214 |
-
| `
|
215 |
-
| `
|
216 |
-
| `amazonreviews` | `
|
217 |
-
|
218 |
-
The accuracy of the models is based on the cross-validation score using the `test` dataset and `5` folds.
|
219 |
-
|
220 |
-
#### Note
|
221 |
-
Due to the size of the `amazonreviews` dataset, it was not possible to train a model with a vectorizer other than `hashing`.
|
222 |
|
223 |
|
224 |
## License
|
|
|
209 |
|
210 |
### Pre-trained Models
|
211 |
The following pre-trained models are available for use:
|
212 |
+
| Dataset | Vectorizer | Classifier | Features | Accuracy on test | Accuracy on self | Model |
|
213 |
+
| --- | --- | --- | --- | --- | --- | --- |
|
214 |
+
| `imdb50k` | `tfidf` | `LinearRegression` | 20 000 | 59.59% Β± 0.73% | 74.08% Β± 0.14% | [Here](models/imdb50k_tfidf_ft20000.pkl) |
|
215 |
+
| `sentiment140` | `tfidf` | `LinearRegression` | 20 000 | β | β | [Here](models/sentiment140_tfidf_ft20000.pkl) |
|
216 |
+
| `amazonreviews` | `hashing` | `LinearRegression` | 1 048 576 | β | β | [Here](models/amazonreviews_hashing_ft1048576.pkl) |
|
|
|
|
|
|
|
|
|
|
|
217 |
|
218 |
|
219 |
## License
|
models/imdb50k_tfidf_ft20000.pkl
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f06582370d3fe31c878c0feec254d2a3fcd7f6f6140de10d1bd11e528853d66e
|
3 |
+
size 3288
|