Xois
/

Text Generation
Xois commited on
Commit
ee44bf2
·
verified ·
1 Parent(s): cc86af5

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +351 -0
README.md ADDED
@@ -0,0 +1,351 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - fka/awesome-chatgpt-prompts
5
+ - agibot-world/AgiBotWorld-Alpha
6
+ language:
7
+ - aa
8
+ - ab
9
+ - ae
10
+ - af
11
+ - ak
12
+ - am
13
+ - an
14
+ - ar
15
+ - as
16
+ - av
17
+ - ay
18
+ - az
19
+ - ba
20
+ - be
21
+ - bg
22
+ - bh
23
+ - bi
24
+ - bm
25
+ - bn
26
+ - bo
27
+ - br
28
+ - bs
29
+ - ca
30
+ - ce
31
+ - ch
32
+ - co
33
+ - cr
34
+ - cs
35
+ - cu
36
+ - cv
37
+ - cy
38
+ - da
39
+ - de
40
+ - dv
41
+ - dz
42
+ - ee
43
+ - el
44
+ - en
45
+ - eo
46
+ - es
47
+ - et
48
+ - eu
49
+ - fa
50
+ - ff
51
+ - fi
52
+ - fj
53
+ - fo
54
+ - fr
55
+ - fy
56
+ - ga
57
+ - gd
58
+ - gl
59
+ - gn
60
+ - gu
61
+ - gv
62
+ - ha
63
+ - he
64
+ - hi
65
+ - ho
66
+ - hr
67
+ - ht
68
+ - hu
69
+ - hy
70
+ - hz
71
+ - ia
72
+ - id
73
+ - ie
74
+ - ig
75
+ - ii
76
+ - ik
77
+ - io
78
+ - is
79
+ - it
80
+ - iu
81
+ - ja
82
+ - jv
83
+ - ka
84
+ - kg
85
+ - ki
86
+ - kj
87
+ - kk
88
+ - km
89
+ - kl
90
+ - kn
91
+ - ko
92
+ - kr
93
+ - ks
94
+ - ku
95
+ - kv
96
+ - kw
97
+ - ky
98
+ - la
99
+ - lb
100
+ - lg
101
+ - li
102
+ - ln
103
+ - lo
104
+ - lu
105
+ - lt
106
+ - lv
107
+ - mg
108
+ - mh
109
+ - mk
110
+ - ml
111
+ - mn
112
+ - mr
113
+ - ms
114
+ - mt
115
+ - my
116
+ - na
117
+ - nb
118
+ - nd
119
+ - ne
120
+ - ng
121
+ - nl
122
+ - mi
123
+ - 'no'
124
+ - nn
125
+ - nr
126
+ - nv
127
+ - ny
128
+ - oc
129
+ - oj
130
+ - om
131
+ - or
132
+ - os
133
+ - pa
134
+ - pi
135
+ - pl
136
+ - ps
137
+ - pt
138
+ - qu
139
+ - rm
140
+ - rn
141
+ - ro
142
+ - ru
143
+ - rw
144
+ - sa
145
+ - sc
146
+ - sd
147
+ - se
148
+ metrics:
149
+ - accuracy
150
+ base_model:
151
+ - deepseek-ai/DeepSeek-V3
152
+ - deepseek-ai/DeepSeek-V3-Base
153
+ - DevQuasar/deepseek-ai.DeepSeek-V3-Base-GGUF
154
+ - meta-llama/Llama-3.3-70B-Instruct
155
+ new_version: meta-llama/Llama-3.3-70B-Instruct
156
+ pipeline_tag: text-generation
157
+ ---
158
+ # Model Card for Model ID
159
+
160
+ <!-- Provide a quick summary of what the model is/does. -->
161
+
162
+ This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1).
163
+
164
+ ## Model Details
165
+
166
+ ### Model Description
167
+
168
+ <!-- Provide a longer summary of what this model is. -->
169
+
170
+
171
+
172
+ - **Developed by:** [More Information Needed]
173
+ - **Funded by [optional]:** [More Information Needed]
174
+ - **Shared by [optional]:** [More Information Needed]
175
+ - **Model type:** [More Information Needed]
176
+ - **Language(s) (NLP):** [More Information Needed]
177
+ - **License:** [More Information Needed]
178
+ - **Finetuned from model [optional]:** [More Information Needed]
179
+
180
+ ### Model Sources [optional]
181
+
182
+ <!-- Provide the basic links for the model. -->
183
+
184
+ - **Repository:** [More Information Needed]
185
+ - **Paper [optional]:** [More Information Needed]
186
+ - **Demo [optional]:** [More Information Needed]
187
+
188
+ ## Uses
189
+
190
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
191
+
192
+ ### Direct Use
193
+
194
+ <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
195
+
196
+ [More Information Needed]
197
+
198
+ ### Downstream Use [optional]
199
+
200
+ <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
201
+
202
+ [More Information Needed]
203
+
204
+ ### Out-of-Scope Use
205
+
206
+ <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
207
+
208
+ [More Information Needed]
209
+
210
+ ## Bias, Risks, and Limitations
211
+
212
+ <!-- This section is meant to convey both technical and sociotechnical limitations. -->
213
+
214
+ [More Information Needed]
215
+
216
+ ### Recommendations
217
+
218
+ <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
219
+
220
+ Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
221
+
222
+ ## How to Get Started with the Model
223
+
224
+ Use the code below to get started with the model.
225
+
226
+ [More Information Needed]
227
+
228
+ ## Training Details
229
+
230
+ ### Training Data
231
+
232
+ <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
233
+
234
+ [More Information Needed]
235
+
236
+ ### Training Procedure
237
+
238
+ <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
239
+
240
+ #### Preprocessing [optional]
241
+
242
+ [More Information Needed]
243
+
244
+
245
+ #### Training Hyperparameters
246
+
247
+ - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
248
+
249
+ #### Speeds, Sizes, Times [optional]
250
+
251
+ <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
252
+
253
+ [More Information Needed]
254
+
255
+ ## Evaluation
256
+
257
+ <!-- This section describes the evaluation protocols and provides the results. -->
258
+
259
+ ### Testing Data, Factors & Metrics
260
+
261
+ #### Testing Data
262
+
263
+ <!-- This should link to a Dataset Card if possible. -->
264
+
265
+ [More Information Needed]
266
+
267
+ #### Factors
268
+
269
+ <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
270
+
271
+ [More Information Needed]
272
+
273
+ #### Metrics
274
+
275
+ <!-- These are the evaluation metrics being used, ideally with a description of why. -->
276
+
277
+ [More Information Needed]
278
+
279
+ ### Results
280
+
281
+ [More Information Needed]
282
+
283
+ #### Summary
284
+
285
+
286
+
287
+ ## Model Examination [optional]
288
+
289
+ <!-- Relevant interpretability work for the model goes here -->
290
+
291
+ [More Information Needed]
292
+
293
+ ## Environmental Impact
294
+
295
+ <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
296
+
297
+ Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
298
+
299
+ - **Hardware Type:** [More Information Needed]
300
+ - **Hours used:** [More Information Needed]
301
+ - **Cloud Provider:** [More Information Needed]
302
+ - **Compute Region:** [More Information Needed]
303
+ - **Carbon Emitted:** [More Information Needed]
304
+
305
+ ## Technical Specifications [optional]
306
+
307
+ ### Model Architecture and Objective
308
+
309
+ [More Information Needed]
310
+
311
+ ### Compute Infrastructure
312
+
313
+ [More Information Needed]
314
+
315
+ #### Hardware
316
+
317
+ [More Information Needed]
318
+
319
+ #### Software
320
+
321
+ [More Information Needed]
322
+
323
+ ## Citation [optional]
324
+
325
+ <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
326
+
327
+ **BibTeX:**
328
+
329
+ [More Information Needed]
330
+
331
+ **APA:**
332
+
333
+ [More Information Needed]
334
+
335
+ ## Glossary [optional]
336
+
337
+ <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
338
+
339
+ [More Information Needed]
340
+
341
+ ## More Information [optional]
342
+
343
+ [More Information Needed]
344
+
345
+ ## Model Card Authors [optional]
346
+
347
+ [More Information Needed]
348
+
349
+ ## Model Card Contact
350
+
351
+ [More Information Needed]