tsunemoto commited on
Commit
5067634
·
verified ·
1 Parent(s): d808b72

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,17 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ nexonimbus-7b.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ nexonimbus-7b.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
38
+ nexonimbus-7b.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ nexonimbus-7b.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
+ nexonimbus-7b.Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
41
+ nexonimbus-7b.Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
42
+ nexonimbus-7b.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
43
+ nexonimbus-7b.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
44
+ nexonimbus-7b.Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
45
+ nexonimbus-7b.Q5_1.gguf filter=lfs diff=lfs merge=lfs -text
46
+ nexonimbus-7b.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
47
+ nexonimbus-7b.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
48
+ nexonimbus-7b.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
49
+ nexonimbus-7b.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,107 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ title: "NexoNimbus-7B Quantized in GGUF"
3
+ tags:
4
+ - GGUF
5
+ language: en
6
+ ---
7
+ ![Image description](https://i.postimg.cc/MGwhtFfF/tsune-fixed.png)
8
+
9
+ # Tsunemoto GGUF's of NexoNimbus-7B
10
+
11
+ This is a GGUF quantization of NexoNimbus-7B.
12
+
13
+ ## Original Repo Link:
14
+ [Original Repository](https://huggingface.co/abideen/NexoNimbus-7B)
15
+
16
+ ## Original Model Card:
17
+ ---
18
+
19
+ # NexoNimbus-7B
20
+
21
+
22
+
23
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64e380b2e12618b261fa6ba0/9lIzCPqDYR6nnLgoH6kMp.png)
24
+
25
+
26
+ NexoNimbus-7B is a merge of the following models:
27
+ * [abideen/DareVox-7B](https://huggingface.co/abideen/DareVox-7B)
28
+ * [udkai/Garrulus](https://huggingface.co/udkai/Garrulus)
29
+
30
+ 🏆 Evaluation
31
+ NexoNimbus-7B is the 5th best-performing 7B LLM on the Open LLM Leaderboard:
32
+
33
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64e380b2e12618b261fa6ba0/MIkOaXVGJ0T5UVYIEhtYA.png)
34
+
35
+
36
+ | Task |Version| Metric |Value| |Stderr|
37
+ |-------------|------:|--------|----:|---|-----:|
38
+ |arc_challenge| 0|acc |68.25|± | 1.36|
39
+ | | |acc_norm|70.81|± | 1.38|
40
+ |hellaswag | 0|acc |70.86|± | 0.45|
41
+ | | |acc_norm|87.86|± | 0.32|
42
+ |gsm8k | 0|acc |70.35|± | 1.25|
43
+ |winogrande | 0|acc |84.84|± | 1.00|
44
+ |mmlu | 0|acc |64.69|± | 1.00|
45
+
46
+ Average: 73.5%
47
+
48
+ ### TruthfulQA
49
+ | Task |Version|Metric|Value| |Stderr|
50
+ |-------------|------:|------|----:|---|-----:|
51
+ |truthfulqa_mc| 1|mc1 |46.26|± | 1.74|
52
+ | | |mc2 |62.42|± | 1.54|
53
+
54
+
55
+ ## 🧩 Configuration
56
+
57
+ ```yaml
58
+ slices:
59
+ - sources:
60
+ - model: abideen/DareVox-7B
61
+ layer_range: [0, 32]
62
+ - model: udkai/Garrulus
63
+ layer_range: [0, 32]
64
+ merge_method: slerp
65
+ base_model: abideen/DareVox-7B
66
+ parameters:
67
+ t:
68
+ - filter: self_attn
69
+ value: [0, 0.5, 0.3, 0.7, 1]
70
+ - filter: mlp
71
+ value: [1, 0.5, 0.7, 0.3, 0]
72
+ - value: 0.5
73
+ dtype: bfloat16
74
+
75
+ ```
76
+
77
+ ## 💻 Usage
78
+
79
+ Here's a [Colab notebook](https://colab.research.google.com/drive/1F9lzL1IeZRMgiSbY9UbgCR__RreIflJh?usp=sharing) to run NexoNimbus-7B in 4-bit precision on a free T4 GPU.
80
+
81
+ ```python
82
+ !pip install -qU transformers accelerate
83
+
84
+ from transformers import AutoTokenizer
85
+ import transformers
86
+ import torch
87
+
88
+ model = "abideen/NexoNimbus-7B"
89
+ messages = [{"role": "user", "content": "Explain what is Machine learning."}]
90
+
91
+ tokenizer = AutoTokenizer.from_pretrained(model)
92
+ prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
93
+ pipeline = transformers.pipeline(
94
+ "text-generation",
95
+ model=model,
96
+ torch_dtype=torch.float16,
97
+ device_map="auto",
98
+ )
99
+
100
+ outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
101
+ print(outputs[0]["generated_text"])
102
+ ```
103
+
104
+ "Machine learning is a subfield of artificial intelligence that focuses on developing algorithms and models that allow computers to learn and improve their performance over time, without being explicitly programmed. It involves the use of statistical techniques and data analysis to identify patterns and make predictions based on input data.
105
+ In machine learning, data is fed into a model, which then adjusts its internal parameters to minimize the difference between the predicted output and the actual output. This process is called training, and as the model is exposed to more data, it becomes better at making predictions or classifications.
106
+ Machine learning can be divided into several categories, including supervised learning, unsupervised learning, and reinforcement learning. Supervised learning involves using labeled data, where the desired output is known, and the model learns to map inputs to outputs. Unsupervised learning, on the other hand, does not have a predefined output, and the model learns to identify patterns or relationships within the data. Reinforcement learning involves learning through trial and error, with the model receiving feedback in the form of rewards or penalties based on its actions.
107
+ Some common applications of machine learning include image recognition, natural language processing, recommendation systems, fraud detection, and self-driving."
nexonimbus-7b.Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:24908261edae2c198762440dc69fb8fea5c546c35c15087fabd3a0676aed5c62
3
+ size 3084034048
nexonimbus-7b.Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8f2e6358ea7b1911a81b2a8162f1709cd9294598d740e95653462c077912d66e
3
+ size 3822960640
nexonimbus-7b.Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:82228f5006e8860a6a005b0dfea4a3162699c8bf73c16f6f3b5f04930719ebab
3
+ size 3519922176
nexonimbus-7b.Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a377c635daada6ae3e33ce18bf403cd31b9cbfaa4fc2ad57a8b2ea36c556aac1
3
+ size 3165503488
nexonimbus-7b.Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:df6d8334cb4386622deeaa8f104030aa5b82a47e866fcd7dcf511d8c9caeed74
3
+ size 4109852672
nexonimbus-7b.Q4_1.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c7f2324cd3337a49f0a3e34b73af27584ea0f4c50eb14e2f2ddeddffc99ed4f1
3
+ size 4554252288
nexonimbus-7b.Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:625474567deff612f5d7f0fdc96b9afb4709f1adad00a8909d9c59b6225fbd0d
3
+ size 4369375232
nexonimbus-7b.Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f2a8c9bbea58d303a11b2448436beaa7bd03feeb28048534feee90503b641953
3
+ size 4141309952
nexonimbus-7b.Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9969b5f21935ea2030f10f19b0db312701f1d6e7ce7e3f31c9a36b9cbe6758e6
3
+ size 4998651904
nexonimbus-7b.Q5_1.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ccd5b1f14b0b90f05882850be79421f96fdfa8b41878abe7e409d01c30e7b387
3
+ size 5443051520
nexonimbus-7b.Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:609feb8159870f0c67cc9e0c6ff2e125b85b30dc4cb147e4073cecae769dd103
3
+ size 5132345344
nexonimbus-7b.Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7f9436a724138802d3764eb84b7597f9e24a1e9b1573c676b232e099c9b58ffd
3
+ size 4998651904
nexonimbus-7b.Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:55087e5ff8ba233df4b796bbcb5a7a076bd78a9b8283c04efb8b134482242011
3
+ size 5943001088
nexonimbus-7b.Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6b687de85326bbe5db62325cd8080948e8994fb7233a493bd3b63829a0aaddc7
3
+ size 7696793600