davidmezzetti commited on
Commit
7515d5e
·
1 Parent(s): 6bf7048
Files changed (6) hide show
  1. .gitattributes +2 -0
  2. README.md +30 -0
  3. config.json +8 -0
  4. model.safetensors +3 -0
  5. model.sqlite +3 -0
  6. vocab.json +3 -0
.gitattributes CHANGED
@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ model.sqlite filter=lfs diff=lfs merge=lfs -text
37
+ vocab.json filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-similarity
4
+ inference: false
5
+ license: pddl
6
+ language: en
7
+ library_name: staticvectors
8
+ ---
9
+
10
+ # GloVe 2024 WikiGiga StaticVectors model
11
+
12
+ This model is an export of the new [GloVe 2024 WikiGiga Vectors](https://nlp.stanford.edu/projects/glove/) (_300d_) for [`staticvectors`](https://github.com/neuml/staticvectors). `staticvectors` enables running inference in Python with NumPy. This helps it maintain solid runtime performance.
13
+
14
+ ## Usage with StaticVectors
15
+
16
+ ```python
17
+ from staticvectors import StaticVectors
18
+
19
+ model = StaticVectors("neuml/glove-2024-wikigiga")
20
+ model.embeddings(["word"])
21
+ ```
22
+
23
+ Given that pre-trained embeddings models can get quite large, there is also a SQLite version that lazily loads vectors.
24
+
25
+ ```python
26
+ from staticvectors import StaticVectors
27
+
28
+ model = StaticVectors("neuml/glove-2024-wikigiga/model.sqlite")
29
+ model.embeddings(["word"])
30
+ ```
config.json ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "model_type": "staticvectors",
3
+ "storage": "safetensors",
4
+ "format": "text",
5
+ "source": "wiki_giga_2024_300_MFT20_vectors_seed_2024_alpha_0.75_eta_0.05_combined.txt",
6
+ "total": 1200000,
7
+ "dim": 300
8
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cee5f03734ac8206ee553740aa3e15b2f0dcf16787983eb83ac65a053696909e
3
+ size 1549376488
model.sqlite ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a2a2862a93b421bd5226ee434d923df8f66068f095989f3d03bb130009d1501e
3
+ size 1791074304
vocab.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:614fd4f9a1dc2862bad8406806c241e005707226dfaac8f5a4c1f1a44bde3eeb
3
+ size 25852205