Spaces:
Sleeping
Sleeping
Deepak Sahu
commited on
Commit
·
3bfe553
1
Parent(s):
26c4757
updating references
Browse filesThis view is limited to 50 files because it contains too many changes.
See raw diff
- .gitattributes +1 -0
- README.md +45 -2
- _data/Cricket World Cup - Wikipedia.htm +3 -0
- _data/Cricket World Cup - Wikipedia_files/200px-Australian_World_Cup_treble.jpg +3 -0
- _data/Cricket World Cup - Wikipedia_files/220px-Glenn_McGrath_in_Circular_Quay,_Sydney,_Australia,_201.jpg +3 -0
- _data/Cricket World Cup - Wikipedia_files/28px-Cricketball.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/300px-Australian_World_Cup_treble.jpg +3 -0
- _data/Cricket World Cup - Wikipedia_files/31px-Sports_icon.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/330px-Glenn_McGrath_in_Circular_Quay,_Sydney,_Australia,_201.jpg +3 -0
- _data/Cricket World Cup - Wikipedia_files/35px-East_Africa_Cricket_Team_Flag.png +3 -0
- _data/Cricket World Cup - Wikipedia_files/400px-Australian_World_Cup_treble.jpg +3 -0
- _data/Cricket World Cup - Wikipedia_files/42px-Cricketball.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/440px-Glenn_McGrath_in_Circular_Quay,_Sydney,_Australia,_201.jpg +3 -0
- _data/Cricket World Cup - Wikipedia_files/47px-Sports_icon.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/56px-Cricketball.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/62px-Sports_icon.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Autographed_bat_of_ODI_World_Cup_winning_captains_at_Bla_002.jpg +3 -0
- _data/Cricket World Cup - Wikipedia_files/Autographed_bat_of_ODI_World_Cup_winning_captains_at_Bla_003.jpg +3 -0
- _data/Cricket World Cup - Wikipedia_files/Autographed_bat_of_ODI_World_Cup_winning_captains_at_Blades_.jpg +3 -0
- _data/Cricket World Cup - Wikipedia_files/Autographed_bats_of_ODI_World_Cup_winning_teams_at_Blade_002.jpg +3 -0
- _data/Cricket World Cup - Wikipedia_files/Autographed_bats_of_ODI_World_Cup_winning_teams_at_Blade_003.jpg +3 -0
- _data/Cricket World Cup - Wikipedia_files/Autographed_bats_of_ODI_World_Cup_winning_teams_at_Blades_of.jpg +3 -0
- _data/Cricket World Cup - Wikipedia_files/Civic_Centre-2003_CWC.jpg +3 -0
- _data/Cricket World Cup - Wikipedia_files/Civic_Centre-2003_CWC_002.jpg +3 -0
- _data/Cricket World Cup - Wikipedia_files/Civic_Centre-2003_CWC_003.jpg +3 -0
- _data/Cricket World Cup - Wikipedia_files/Cricket_Ireland_flag.svg.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Cricket_Ireland_flag.svg_002.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Cricket_Ireland_flag.svg_003.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Cricket_current_event.svg.png +3 -0
- _data/Cricket World Cup - Wikipedia_files/Cricket_current_event.svg.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Cricket_current_event.svg_002.png +3 -0
- _data/Cricket World Cup - Wikipedia_files/East_Africa_Cricket_Team_Flag.png +3 -0
- _data/Cricket World Cup - Wikipedia_files/East_Africa_Cricket_Team_Flag_002.png +3 -0
- _data/Cricket World Cup - Wikipedia_files/Englandvictorylap.png +3 -0
- _data/Cricket World Cup - Wikipedia_files/Englandvictorylap_002.png +3 -0
- _data/Cricket World Cup - Wikipedia_files/Englandvictorylap_003.png +3 -0
- _data/Cricket World Cup - Wikipedia_files/Flag_of_Afghanistan_(2013%E2%80%932021).svg_002.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Flag_of_Afghanistan_(2013%E2%80%932021).svg_003.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Flag_of_Afghanistan_(2013–2021).svg.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Flag_of_Australia_(converted).svg.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Flag_of_Australia_(converted).svg_002.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Flag_of_Australia_(converted).svg_003.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Flag_of_Australia_(converted).svg_004.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Flag_of_Australia_(converted).svg_005.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Flag_of_Australia_(converted).svg_006.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Flag_of_Bangladesh.svg.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Flag_of_Bangladesh.svg_002.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Flag_of_Bangladesh.svg_003.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Flag_of_Bermuda.svg.webp +3 -0
- _data/Cricket World Cup - Wikipedia_files/Flag_of_Bermuda.svg_002.webp +3 -0
.gitattributes
CHANGED
@@ -36,3 +36,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
36 |
cache_vector_store_text/** filter=lfs diff=lfs merge=lfs -text
|
37 |
cache_vector_store_images/** filter=lfs diff=lfs merge=lfs -text
|
38 |
sentencepiece-0.1.91-cp37-cp37m-manylinux1_x86_64.whl filter=lfs diff=lfs merge=lfs -text
|
|
|
|
36 |
cache_vector_store_text/** filter=lfs diff=lfs merge=lfs -text
|
37 |
cache_vector_store_images/** filter=lfs diff=lfs merge=lfs -text
|
38 |
sentencepiece-0.1.91-cp37-cp37m-manylinux1_x86_64.whl filter=lfs diff=lfs merge=lfs -text
|
39 |
+
_data/** filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
@@ -10,7 +10,30 @@ pinned: false
|
|
10 |
short_description: Just another rag but with Images 🖼️
|
11 |
---
|
12 |
|
13 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
14 |
|
15 |
## Local Debug
|
16 |
|
@@ -29,4 +52,24 @@ Highly recommend VS Code, makes life easy.
|
|
29 |
|
30 |
1. UI Blocks Concepts: https://huggingface.co/learn/nlp-course/en/chapter9/7
|
31 |
2. UI Row-Column Arrangement: https://www.gradio.app/guides/controlling-layout
|
32 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
10 |
short_description: Just another rag but with Images 🖼️
|
11 |
---
|
12 |
|
13 |
+
# Just another RAG, dont bother much!!
|
14 |
+
|
15 |
+
## File Descriptions
|
16 |
+
|
17 |
+
### `z_document_reader.py`
|
18 |
+
|
19 |
+
Images are only useful to limited resource computer if it has a caption. So this file helps parse the wikipedia html strips it off the tags.
|
20 |
+
|
21 |
+
### `z_embedding.py`
|
22 |
+
|
23 |
+
Generates vector store.
|
24 |
+
|
25 |
+
### `z_generate.py`
|
26 |
+
|
27 |
+
Use LLM and prompting to find the relevant texts and images stored in the vector stores.
|
28 |
+
|
29 |
+
## Adding more data sources
|
30 |
+
|
31 |
+
Currently limited to wikipedia pages downloaded as HTML.
|
32 |
+
|
33 |
+
1. Place the html in the folder `_data`
|
34 |
+
2. Run command
|
35 |
+
`python z_embedding.py`
|
36 |
+
3. Output will be two FAISS vectors stores in the folder `cache_vector...`
|
37 |
|
38 |
## Local Debug
|
39 |
|
|
|
52 |
|
53 |
1. UI Blocks Concepts: https://huggingface.co/learn/nlp-course/en/chapter9/7
|
54 |
2. UI Row-Column Arrangement: https://www.gradio.app/guides/controlling-layout
|
55 |
+
2. Show caption in image gallery: https://github.com/gradio-app/gradio/issues/3364
|
56 |
+
2. HF Implementation of basic: https://huggingface.co/learn/cookbook/en/advanced_rag
|
57 |
+
2. https://python.langchain.com/docs/integrations/vectorstores/faiss/
|
58 |
+
|
59 |
+
|
60 |
+
## Ideas
|
61 |
+
|
62 |
+
1. Shows frames of design patterns: https://www.falkordb.com/blog/advanced-rag/
|
63 |
+
2. HF Implementation of basic: https://huggingface.co/learn/cookbook/en/advanced_rag
|
64 |
+
2. HF RAG Evaluation: https://huggingface.co/learn/cookbook/en/rag_evaluation
|
65 |
+
2. HF Implementation by someone: https://medium.aiplanet.com/advanced-rag-implementation-on-custom-data-using-hybrid-search-embed-caching-and-mistral-ai-ce78fdae4ef6
|
66 |
+
2. HF Agentic Rag: https://huggingface.co/learn/cookbook/en/agent_rag
|
67 |
+
2. Future read, tooning https://huggingface.co/blog/lucifertrj/finetune-embeddings
|
68 |
+
2. Opinion on instruct embeddings: https://huggingface.co/blog/Tonic/instruct-embeddings-and-advanced-rag
|
69 |
+
2. Another Implementation: https://huggingface.co/learn/cookbook/en/rag_zephyr_langchain
|
70 |
+
2. Another Opinion on ray: https://www.anyscale.com/blog/retrieval-augmented-generation-with-huggingface-transformers-and-ray
|
71 |
+
2. Ray Follow up: https://github.com/run-llama/ai-engineer-workshop/blob/main/presentation.pdf?__s=2il5g6hpfc4mtmydioir
|
72 |
+
2. llama Index rag implementation: https://docs.llamaindex.ai/en/latest/optimizing/production_rag/
|
73 |
+
2. Just some termino book: https://www.projectpro.io/article/advanced-rag-techniques/1063
|
74 |
+
2. Another Evaluation Guide: https://pub.towardsai.net/evaluating-rag-metrics-across-different-retrieval-methods-770aa01380c8
|
75 |
+
2. Oracle Garbage: https://blogs.oracle.com/ai-and-datascience/post/ai-health-mixtral-oracle-23ai-rag-langchain-streamlit
|
_data/Cricket World Cup - Wikipedia.htm
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bef451e07204b8a4775d08482b4062432b7afddc7d05d9775f64279e384398b6
|
3 |
+
size 993621
|
_data/Cricket World Cup - Wikipedia_files/200px-Australian_World_Cup_treble.jpg
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/220px-Glenn_McGrath_in_Circular_Quay,_Sydney,_Australia,_201.jpg
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/28px-Cricketball.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/300px-Australian_World_Cup_treble.jpg
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/31px-Sports_icon.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/330px-Glenn_McGrath_in_Circular_Quay,_Sydney,_Australia,_201.jpg
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/35px-East_Africa_Cricket_Team_Flag.png
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/400px-Australian_World_Cup_treble.jpg
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/42px-Cricketball.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/440px-Glenn_McGrath_in_Circular_Quay,_Sydney,_Australia,_201.jpg
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/47px-Sports_icon.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/56px-Cricketball.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/62px-Sports_icon.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Autographed_bat_of_ODI_World_Cup_winning_captains_at_Bla_002.jpg
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Autographed_bat_of_ODI_World_Cup_winning_captains_at_Bla_003.jpg
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Autographed_bat_of_ODI_World_Cup_winning_captains_at_Blades_.jpg
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Autographed_bats_of_ODI_World_Cup_winning_teams_at_Blade_002.jpg
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Autographed_bats_of_ODI_World_Cup_winning_teams_at_Blade_003.jpg
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Autographed_bats_of_ODI_World_Cup_winning_teams_at_Blades_of.jpg
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Civic_Centre-2003_CWC.jpg
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Civic_Centre-2003_CWC_002.jpg
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Civic_Centre-2003_CWC_003.jpg
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Cricket_Ireland_flag.svg.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Cricket_Ireland_flag.svg_002.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Cricket_Ireland_flag.svg_003.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Cricket_current_event.svg.png
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Cricket_current_event.svg.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Cricket_current_event.svg_002.png
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/East_Africa_Cricket_Team_Flag.png
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/East_Africa_Cricket_Team_Flag_002.png
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Englandvictorylap.png
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Englandvictorylap_002.png
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Englandvictorylap_003.png
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Flag_of_Afghanistan_(2013%E2%80%932021).svg_002.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Flag_of_Afghanistan_(2013%E2%80%932021).svg_003.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Flag_of_Afghanistan_(2013–2021).svg.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Flag_of_Australia_(converted).svg.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Flag_of_Australia_(converted).svg_002.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Flag_of_Australia_(converted).svg_003.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Flag_of_Australia_(converted).svg_004.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Flag_of_Australia_(converted).svg_005.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Flag_of_Australia_(converted).svg_006.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Flag_of_Bangladesh.svg.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Flag_of_Bangladesh.svg_002.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Flag_of_Bangladesh.svg_003.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Flag_of_Bermuda.svg.webp
ADDED
![]() |
Git LFS Details
|
_data/Cricket World Cup - Wikipedia_files/Flag_of_Bermuda.svg_002.webp
ADDED
![]() |
Git LFS Details
|