Spaces:

LunaticMaestro
/

book-recommender

Running

Deepak Sahu commited on Nov 24, 2024

Commit

bd396f2

1 Parent(s): 0720e54

.

Files changed (4) hide show

.resources/eval7.png ADDED Viewed

.resources/preview.png CHANGED Viewed

README.md CHANGED Viewed

@@ -236,14 +236,12 @@ The generation is handled by functions in script `z_hypothetical_summary.py`. Un
 ![image](.resources/eval5.png)
-### Evaluation Metric
 So for given input title we can get rank (by desc order cosine similarity) of the store title. To evaluate we the entire approach we are going to use a modified version **Mean Reciprocal Rank (MRR)**.
 ![image](.resources/eval6.png)
 Test Plan:
   - Take random 30 samples and compute the mean of their reciprocal ranks.
   - If we want that our known book titles be in top 5 results then MRR >= 1/5 = 0.2
@@ -254,12 +252,14 @@ Test Plan:
 python z_evaluate.py
 ```
-![image](https://github.com/user-attachments/assets/d2c77d47-9244-474a-a850-d31fb914c9ca)
 The values of TOP_P and TOP_K (i.e. token sampling for our generator model) are sent as `CONST` in the `z_evaluate.py`; The current set of values are borrowed from the work: https://www.kaggle.com/code/tuckerarrants/text-generation-with-huggingface-gpt2#Top-K-and-Top-P-Sampling
 MRR = 0.311 implies that there's a good change that the target book will be in rank (1/.311) ~ 3 (third rank) **i.e. within top 5 recommendations**
 ## Inference
 `app.py` is written so that it can best work with gradio interface in the HuggingFace, althought you can try it out locally as well :)

 ![image](.resources/eval5.png)
+### Evaluation Metric & Result
 So for given input title we can get rank (by desc order cosine similarity) of the store title. To evaluate we the entire approach we are going to use a modified version **Mean Reciprocal Rank (MRR)**.
 ![image](.resources/eval6.png)
 Test Plan:
   - Take random 30 samples and compute the mean of their reciprocal ranks.
   - If we want that our known book titles be in top 5 results then MRR >= 1/5 = 0.2
 python z_evaluate.py
 ```
+![image](.resources/eval7.png)
 The values of TOP_P and TOP_K (i.e. token sampling for our generator model) are sent as `CONST` in the `z_evaluate.py`; The current set of values are borrowed from the work: https://www.kaggle.com/code/tuckerarrants/text-generation-with-huggingface-gpt2#Top-K-and-Top-P-Sampling
 MRR = 0.311 implies that there's a good change that the target book will be in rank (1/.311) ~ 3 (third rank) **i.e. within top 5 recommendations**
+> TODO: A sampling study can be done to better make this conclusion.
 ## Inference
 `app.py` is written so that it can best work with gradio interface in the HuggingFace, althought you can try it out locally as well :)

app.py CHANGED Viewed

@@ -93,4 +93,4 @@ demo = gr.Interface(
     description=GRADIO_DESCRIPTION
 )
-demo.launch()

     description=GRADIO_DESCRIPTION
 )
+demo.launch(share=True)