peralp24 commited on
Commit
8c2d5e5
·
verified ·
1 Parent(s): 7c6354e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -0
README.md CHANGED
@@ -131,6 +131,35 @@ print("Steered embedding causes higher similarity of query to TV show:")
131
  print(f"Similarity query/TV show ({sim1}) > similarity query/Italian polymath: ({sim2})")
132
  ```
133
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
134
 
135
 
136
 
 
131
  print(f"Similarity query/TV show ({sim1}) > similarity query/Italian polymath: ({sim2})")
132
  ```
133
 
134
+ ### Explanation of the instruct embedding code example
135
+
136
+ Pharia-1-Embedding-4608-control is useful for any use-case that relates to estimating the similarity/relevance between
137
+ text fragments. This is relevant for use-cases such as information retrieval, semantic search, re-ranking and clustering.
138
+ We use the task of information retrieval as a guiding example where we assume the
139
+ following query: “Which country is Galileo from?” and two documents:
140
+ - Galileo is a German television program series produced and broadcast on ProSieben television network. It is also sold to broadcasters in other countries (namely Russia and Poland). The first show was broadcast in 1998, and is now stored in the Arctic World Archive in Svalbard, Norway, after being transferred to special film created by Piql.
141
+ - Galileo di Vincenzo Bonaiuti de' Galilei (15 February 1564 - 8 January 1642), commonly referred to as Galileo Galilei or mononymously as Galileo, was an Italian (Florentine) astronomer, physicist and engineer, sometimes described as a polymath. He was born in the city of Pisa, then part of the Duchy of Florence and present-day Italy.
142
+ Source: Wikipedia
143
+ For our guiding example we assume the context of this use-case is a Question-Answer system for movies and TV shows.
144
+
145
+ **Step 1:**
146
+
147
+ Embed the Query
148
+ "input": "Which country is Galileo from?"
149
+ → Embedding: [-0.6780134, 0.61449033, 0.102911085, ...]
150
+
151
+
152
+
153
+
154
+
155
+
156
+
157
+
158
+
159
+
160
+
161
+
162
+
163
 
164
 
165