s-emanuilov commited on
Commit
a9317c3
·
verified ·
1 Parent(s): 6d7eaf5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +69 -3
README.md CHANGED
@@ -1,3 +1,69 @@
1
- ---
2
- license: llama3.2
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3.2
3
+ tags:
4
+ - llama.cpp
5
+ - gguf
6
+ - query-expansion
7
+ datasets:
8
+ - s-emanuilov/query-expansion
9
+ base_model:
10
+ - meta-llama/Llama-3.2-3B-GGUF
11
+ ---
12
+ # Query Expansion GGUF - based on Llama-3.2-3B
13
+
14
+ GGUF quantized version of Llama-3.2-3B for query expansion task.
15
+ Part of a collection of query expansion models available in different architectures and sizes.
16
+
17
+ ## Overview
18
+
19
+ **Task:** Search query expansion
20
+ **Base model:** [Llama-3.2-3B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct)
21
+ **Training data:** [Query Expansion Dataset](https://huggingface.co/datasets/s-emanuilov/query-expansion)
22
+
23
+ <img src="static/query-expansion-model.jpg" alt="Query Expansion Model" width="600px" />
24
+
25
+ ## Quantized Versions
26
+
27
+ Model available in multiple quantization formats:
28
+ - F16 (Original size)
29
+ - Q8_0 (~8-bit quantization)
30
+ - Q5_K_M (~5-bit quantization)
31
+ - Q4_K_M (~4-bit quantization)
32
+ - Q3_K_M (~3-bit quantization)
33
+
34
+ ## Related Models
35
+
36
+ ### LoRA Adaptors
37
+ - [Qwen2.5-3B](https://huggingface.co/s-emanuilov/query-expansion-Qwen2.5-3B)
38
+ - [Qwen2.5-7B](https://huggingface.co/s-emanuilov/query-expansion-Qwen2.5-7B)
39
+ - [Llama-3.2-3B](https://huggingface.co/s-emanuilov/query-expansion-Llama-3.2-3B)
40
+
41
+ ### GGUF Variants
42
+ - [Qwen2.5-3B-GGUF](https://huggingface.co/s-emanuilov/query-expansion-Qwen2.5-3B-GGUF)
43
+ - [Qwen2.5-7B-GGUF](https://huggingface.co/s-emanuilov/query-expansion-Qwen2.5-7B-GGUF)
44
+
45
+ ## Details
46
+ This model is designed for enhancing search and retrieval systems by generating semantically relevant query expansions.
47
+
48
+ It could be useful for:
49
+ - Advanced RAG systems
50
+ - Search enhancement
51
+ - Query preprocessing
52
+ - Low-latency query expansion
53
+
54
+ ## Example
55
+
56
+ **Input:** "apple stock"
57
+ **Expansions:**
58
+ - "apple market"
59
+ - "apple news"
60
+ - "apple stock price"
61
+ - "apple stock forecast"
62
+
63
+ ## Citation
64
+
65
+ If you find my work helpful, feel free to give me a citation.
66
+
67
+ ```
68
+
69
+