--- datasets: - bigscience/xP3 - mc4 license: apache-2.0 language: - af - am - ar - az - be - bg - bn - ca - ceb - co - cs - cy - da - de - el - en - eo - es - et - eu - fa - fi - fil - fr - fy - ga - gd - gl - gu - ha - haw - hi - hmn - ht - hu - hy - ig - is - it - iw - ja - jv - ka - kk - km - kn - ko - ku - ky - la - lb - lo - lt - lv - mg - mi - mk - ml - mn - mr - ms - mt - my - ne - nl - 'no' - ny - pa - pl - ps - pt - ro - ru - sd - si - sk - sl - sm - sn - so - sq - sr - st - su - sv - sw - ta - te - tg - th - tr - uk - und - ur - uz - vi - xh - yi - yo - zh - zu pipeline_tag: text2text-generation widget: - text: 一个传奇的开端,一个不灭的神话,这不仅仅是一部电影,而是作为一个走进新时代的标签,永远彪炳史册。Would you rate the previous review as positive, neutral or negative? example_title: zh-en sentiment - text: 一个传奇的开端,一个不灭的神话,这不仅仅是一部电影,而是作为一个走进新时代的标签,永远彪炳史册。你认为这句话的立场是赞扬、中立还是批评? example_title: zh-zh sentiment - text: Suggest at least five related search terms to "Mạng neural nhân tạo". example_title: vi-en query - text: Proposez au moins cinq mots clés concernant «Réseau de neurones artificiels». example_title: fr-fr query - text: Explain in a sentence in Telugu what is backpropagation in neural networks. example_title: te-en qa - text: Why is the sky blue? example_title: en-en qa - text: 'Write a fairy tale about a troll saving a princess from a dangerous dragon. The fairy tale is a masterpiece that has achieved praise worldwide and its moral is "Heroes Come in All Shapes and Sizes". Story (in Spanish):' example_title: es-en fable - text: 'Write a fable about wood elves living in a forest that is suddenly invaded by ogres. The fable is a masterpiece that has achieved praise worldwide and its moral is "Violence is the last refuge of the incompetent". Fable (in Hindi):' example_title: hi-en fable tags: - llama-cpp - gguf-my-repo base_model: bigscience/mt0-large model-index: - name: mt0-large results: - task: type: Coreference resolution dataset: name: Winogrande XL (xl) type: winogrande config: xl split: validation revision: a80f460359d1e9a67c006011c94de42a8759430c metrics: - type: Accuracy value: 51.78 - task: type: Coreference resolution dataset: name: XWinograd (en) type: Muennighoff/xwinograd config: en split: test revision: 9dd5ea5505fad86b7bedad667955577815300cee metrics: - type: Accuracy value: 54.8 - task: type: Coreference resolution dataset: name: XWinograd (fr) type: Muennighoff/xwinograd config: fr split: test revision: 9dd5ea5505fad86b7bedad667955577815300cee metrics: - type: Accuracy value: 56.63 - task: type: Coreference resolution dataset: name: XWinograd (jp) type: Muennighoff/xwinograd config: jp split: test revision: 9dd5ea5505fad86b7bedad667955577815300cee metrics: - type: Accuracy value: 53.08 - task: type: Coreference resolution dataset: name: XWinograd (pt) type: Muennighoff/xwinograd config: pt split: test revision: 9dd5ea5505fad86b7bedad667955577815300cee metrics: - type: Accuracy value: 56.27 - task: type: Coreference resolution dataset: name: XWinograd (ru) type: Muennighoff/xwinograd config: ru split: test revision: 9dd5ea5505fad86b7bedad667955577815300cee metrics: - type: Accuracy value: 55.56 - task: type: Coreference resolution dataset: name: XWinograd (zh) type: Muennighoff/xwinograd config: zh split: test revision: 9dd5ea5505fad86b7bedad667955577815300cee metrics: - type: Accuracy value: 54.37 - task: type: Natural language inference dataset: name: ANLI (r1) type: anli config: r1 split: validation revision: 9dbd830a06fea8b1c49d6e5ef2004a08d9f45094 metrics: - type: Accuracy value: 33.3 - task: type: Natural language inference dataset: name: ANLI (r2) type: anli config: r2 split: validation revision: 9dbd830a06fea8b1c49d6e5ef2004a08d9f45094 metrics: - type: Accuracy value: 34.7 - task: type: Natural language inference dataset: name: ANLI (r3) type: anli config: r3 split: validation revision: 9dbd830a06fea8b1c49d6e5ef2004a08d9f45094 metrics: - type: Accuracy value: 34.75 - task: type: Natural language inference dataset: name: SuperGLUE (cb) type: super_glue config: cb split: validation revision: 9e12063561e7e6c79099feb6d5a493142584e9e2 metrics: - type: Accuracy value: 51.79 - task: type: Natural language inference dataset: name: SuperGLUE (rte) type: super_glue config: rte split: validation revision: 9e12063561e7e6c79099feb6d5a493142584e9e2 metrics: - type: Accuracy value: 64.26 - task: type: Natural language inference dataset: name: XNLI (ar) type: xnli config: ar split: validation revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16 metrics: - type: Accuracy value: 42.61 - task: type: Natural language inference dataset: name: XNLI (bg) type: xnli config: bg split: validation revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16 metrics: - type: Accuracy value: 43.94 - task: type: Natural language inference dataset: name: XNLI (de) type: xnli config: de split: validation revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16 metrics: - type: Accuracy value: 44.18 - task: type: Natural language inference dataset: name: XNLI (el) type: xnli config: el split: validation revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16 metrics: - type: Accuracy value: 43.94 - task: type: Natural language inference dataset: name: XNLI (en) type: xnli config: en split: validation revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16 metrics: - type: Accuracy value: 44.26 - task: type: Natural language inference dataset: name: XNLI (es) type: xnli config: es split: validation revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16 metrics: - type: Accuracy value: 45.34 - task: type: Natural language inference dataset: name: XNLI (fr) type: xnli config: fr split: validation revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16 metrics: - type: Accuracy value: 42.01 - task: type: Natural language inference dataset: name: XNLI (hi) type: xnli config: hi split: validation revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16 metrics: - type: Accuracy value: 41.89 - task: type: Natural language inference dataset: name: XNLI (ru) type: xnli config: ru split: validation revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16 metrics: - type: Accuracy value: 42.13 - task: type: Natural language inference dataset: name: XNLI (sw) type: xnli config: sw split: validation revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16 metrics: - type: Accuracy value: 40.08 - task: type: Natural language inference dataset: name: XNLI (th) type: xnli config: th split: validation revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16 metrics: - type: Accuracy value: 40.8 - task: type: Natural language inference dataset: name: XNLI (tr) type: xnli config: tr split: validation revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16 metrics: - type: Accuracy value: 41.29 - task: type: Natural language inference dataset: name: XNLI (ur) type: xnli config: ur split: validation revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16 metrics: - type: Accuracy value: 39.88 - task: type: Natural language inference dataset: name: XNLI (vi) type: xnli config: vi split: validation revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16 metrics: - type: Accuracy value: 41.81 - task: type: Natural language inference dataset: name: XNLI (zh) type: xnli config: zh split: validation revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16 metrics: - type: Accuracy value: 40.84 - task: type: Sentence completion dataset: name: StoryCloze (2016) type: story_cloze config: '2016' split: validation revision: e724c6f8cdf7c7a2fb229d862226e15b023ee4db metrics: - type: Accuracy value: 59.49 - task: type: Sentence completion dataset: name: SuperGLUE (copa) type: super_glue config: copa split: validation revision: 9e12063561e7e6c79099feb6d5a493142584e9e2 metrics: - type: Accuracy value: 65 - task: type: Sentence completion dataset: name: XCOPA (et) type: xcopa config: et split: validation revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187 metrics: - type: Accuracy value: 56 - task: type: Sentence completion dataset: name: XCOPA (ht) type: xcopa config: ht split: validation revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187 metrics: - type: Accuracy value: 62 - task: type: Sentence completion dataset: name: XCOPA (id) type: xcopa config: id split: validation revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187 metrics: - type: Accuracy value: 61 - task: type: Sentence completion dataset: name: XCOPA (it) type: xcopa config: it split: validation revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187 metrics: - type: Accuracy value: 63 - task: type: Sentence completion dataset: name: XCOPA (qu) type: xcopa config: qu split: validation revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187 metrics: - type: Accuracy value: 57 - task: type: Sentence completion dataset: name: XCOPA (sw) type: xcopa config: sw split: validation revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187 metrics: - type: Accuracy value: 54 - task: type: Sentence completion dataset: name: XCOPA (ta) type: xcopa config: ta split: validation revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187 metrics: - type: Accuracy value: 62 - task: type: Sentence completion dataset: name: XCOPA (th) type: xcopa config: th split: validation revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187 metrics: - type: Accuracy value: 57 - task: type: Sentence completion dataset: name: XCOPA (tr) type: xcopa config: tr split: validation revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187 metrics: - type: Accuracy value: 57 - task: type: Sentence completion dataset: name: XCOPA (vi) type: xcopa config: vi split: validation revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187 metrics: - type: Accuracy value: 63 - task: type: Sentence completion dataset: name: XCOPA (zh) type: xcopa config: zh split: validation revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187 metrics: - type: Accuracy value: 58 - task: type: Sentence completion dataset: name: XStoryCloze (ar) type: Muennighoff/xstory_cloze config: ar split: validation revision: 8bb76e594b68147f1a430e86829d07189622b90d metrics: - type: Accuracy value: 56.59 - task: type: Sentence completion dataset: name: XStoryCloze (es) type: Muennighoff/xstory_cloze config: es split: validation revision: 8bb76e594b68147f1a430e86829d07189622b90d metrics: - type: Accuracy value: 55.72 - task: type: Sentence completion dataset: name: XStoryCloze (eu) type: Muennighoff/xstory_cloze config: eu split: validation revision: 8bb76e594b68147f1a430e86829d07189622b90d metrics: - type: Accuracy value: 52.61 - task: type: Sentence completion dataset: name: XStoryCloze (hi) type: Muennighoff/xstory_cloze config: hi split: validation revision: 8bb76e594b68147f1a430e86829d07189622b90d metrics: - type: Accuracy value: 52.15 - task: type: Sentence completion dataset: name: XStoryCloze (id) type: Muennighoff/xstory_cloze config: id split: validation revision: 8bb76e594b68147f1a430e86829d07189622b90d metrics: - type: Accuracy value: 54.67 - task: type: Sentence completion dataset: name: XStoryCloze (my) type: Muennighoff/xstory_cloze config: my split: validation revision: 8bb76e594b68147f1a430e86829d07189622b90d metrics: - type: Accuracy value: 51.69 - task: type: Sentence completion dataset: name: XStoryCloze (ru) type: Muennighoff/xstory_cloze config: ru split: validation revision: 8bb76e594b68147f1a430e86829d07189622b90d metrics: - type: Accuracy value: 53.74 - task: type: Sentence completion dataset: name: XStoryCloze (sw) type: Muennighoff/xstory_cloze config: sw split: validation revision: 8bb76e594b68147f1a430e86829d07189622b90d metrics: - type: Accuracy value: 55.53 - task: type: Sentence completion dataset: name: XStoryCloze (te) type: Muennighoff/xstory_cloze config: te split: validation revision: 8bb76e594b68147f1a430e86829d07189622b90d metrics: - type: Accuracy value: 57.18 - task: type: Sentence completion dataset: name: XStoryCloze (zh) type: Muennighoff/xstory_cloze config: zh split: validation revision: 8bb76e594b68147f1a430e86829d07189622b90d metrics: - type: Accuracy value: 59.5 --- # cstr/mt0-large-Q4_K_M-GGUF This model was converted to GGUF format from [`bigscience/mt0-large`](https://huggingface.co/bigscience/mt0-large) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space. Refer to the [original model card](https://huggingface.co/bigscience/mt0-large) for more details on the model. ## Use with llama.cpp Install llama.cpp through brew (works on Mac and Linux) ```bash brew install llama.cpp ``` Invoke the llama.cpp server or the CLI. ### CLI: ```bash llama-cli --hf-repo cstr/mt0-large-Q4_K_M-GGUF --hf-file mt0-large-q4_k_m.gguf -p "The meaning to life and the universe is" ``` ### Server: ```bash llama-server --hf-repo cstr/mt0-large-Q4_K_M-GGUF --hf-file mt0-large-q4_k_m.gguf -c 2048 ``` Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well. Step 1: Clone llama.cpp from GitHub. ``` git clone https://github.com/ggerganov/llama.cpp ``` Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux). ``` cd llama.cpp && LLAMA_CURL=1 make ``` Step 3: Run inference through the main binary. ``` ./llama-cli --hf-repo cstr/mt0-large-Q4_K_M-GGUF --hf-file mt0-large-q4_k_m.gguf -p "The meaning to life and the universe is" ``` or ``` ./llama-server --hf-repo cstr/mt0-large-Q4_K_M-GGUF --hf-file mt0-large-q4_k_m.gguf -c 2048 ```