Spaces:

Tymec
/

sentiment-analysis

Running

Tymec commited on May 31, 2024

Commit

e50b20c

1 Parent(s): a5c3a23

Add dataset for testing

Files changed (2) hide show

README.md CHANGED Viewed

@@ -11,6 +11,7 @@ datasets:
   - mrshu/amazonreviews
   - stanfordnlp/sentiment140
   - stanfordnlp/imdb
 models:
   - spacy/en_core_web_sm
 ---
@@ -21,12 +22,13 @@ models:
 1. Clone the repository
 2. `cd` into the repository
 3. Run `just install` to install the dependencies
-4. Run `just app --help` to see the available commands
 ### Datasets
 - [Sentiment140](https://www.kaggle.com/datasets/kazanova/sentiment140)
 - [Amazon Reviews](https://www.kaggle.com/datasets/bittlingmayer/amazonreviews)
 - [IMDB](https://www.kaggle.com/datasets/lakshmi25npathi/imdb-dataset-of-50k-movie-reviews)
 ### Required tools
 - `just`

   - mrshu/amazonreviews
   - stanfordnlp/sentiment140
   - stanfordnlp/imdb
+  - Sp1786/multiclass-sentiment-analysis-dataset
 models:
   - spacy/en_core_web_sm
 ---
 1. Clone the repository
 2. `cd` into the repository
 3. Run `just install` to install the dependencies
+4. Run `just run --help` to see the available commands
 ### Datasets
 - [Sentiment140](https://www.kaggle.com/datasets/kazanova/sentiment140)
 - [Amazon Reviews](https://www.kaggle.com/datasets/bittlingmayer/amazonreviews)
 - [IMDB](https://www.kaggle.com/datasets/lakshmi25npathi/imdb-dataset-of-50k-movie-reviews)
+- [Multiclass Sentiment Analysis](https://huggingface.co/datasets/Sp1786/multiclass-sentiment-analysis-dataset) (Used only testing)
 ### Required tools
 - `just`

app/constants.py CHANGED Viewed

@@ -16,6 +16,9 @@ AMAZONREVIEWS_URL = "https://www.kaggle.com/datasets/bittlingmayer/amazonreviews
 IMDB50K_PATH = DATA_DIR / "imdb50k.csv"
 IMDB50K_URL = "https://www.kaggle.com/datasets/lakshmi25npathi/imdb-dataset-of-50k-movie-reviews"
 CACHE_DIR.mkdir(exist_ok=True, parents=True)
 DATA_DIR.mkdir(exist_ok=True, parents=True)
 MODELS_DIR.mkdir(exist_ok=True, parents=True)

 IMDB50K_PATH = DATA_DIR / "imdb50k.csv"
 IMDB50K_URL = "https://www.kaggle.com/datasets/lakshmi25npathi/imdb-dataset-of-50k-movie-reviews"
+TEST_DATASET_PATH = DATA_DIR / "test.csv"
+TEST_DATASET_URL = "https://huggingface.co/datasets/Sp1786/multiclass-sentiment-analysis-dataset"
 CACHE_DIR.mkdir(exist_ok=True, parents=True)
 DATA_DIR.mkdir(exist_ok=True, parents=True)
 MODELS_DIR.mkdir(exist_ok=True, parents=True)