This is a merge of pre-trained language models created based on DARE using mergekit.

More descriptions of the model will be added soon.

Loading the Model

Use the following Python code to load the model:

import torch
from transformers import MistralForCausalLM, AutoTokenizer

model = MistralForCausalLM.from_pretrained("vanillaOVO/supermario_v3", device_map="auto")
tokenizer = AutoTokenizer.from_pretrained("vanillaOVO/supermario_v3")

Generating Text

To generate text, use the following Python code:

text = "Large language models are "
inputs = tokenizer(text, return_tensors="pt")

outputs = model.generate(**inputs, max_new_tokens=256)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
Downloads last month
80
Safetensors
Model size
7.24B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for vanillaOVO/supermario_v3

Merges
2 models