MMoshtaghi
/

Pixtral-12B-2409-LoRAAdpt-General

@@ -4,8 +4,9 @@ tags:
 - text-generation-inference
 - transformers
 - unsloth
-- llava
 - trl
 license: apache-2.0
 language:
 - en
@@ -16,7 +17,58 @@ language:
 - **Developed by:** MMoshtaghi
 - **License:** apache-2.0
 - **Finetuned from model :** unsloth/pixtral-12b-2409-unsloth-bnb-4bit
-This llava model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 - text-generation-inference
 - transformers
 - unsloth
+- qwen2_vl
 - trl
+- qlora
 license: apache-2.0
 language:
 - en
 - **Developed by:** MMoshtaghi
 - **License:** apache-2.0
 - **Finetuned from model :** unsloth/pixtral-12b-2409-unsloth-bnb-4bit
+- **Finetuned on dataset:** (unsloth/llava-instruct-mix-vsft-mini)[https://huggingface.co/datasets/unsloth/llava-instruct-mix-vsft-mini]
+- **PEFT method :** (Quantized LoRA)[https://huggingface.co/papers/2305.14314]
+## Quick start
+```python
+from datasets import load_dataset
+from unsloth import FastVisionModel
+model, tokenizer = FastVisionModel.from_pretrained(
+    model_name = "MMoshtaghi/Pixtral-12B-2409-LoRAAdpt-General",
+    load_in_4bit = True,
+)
+FastVisionModel.for_inference(model) # Enable for inference!
+dataset = load_dataset("unsloth/llava-instruct-mix-vsft-mini", split = "train")
+image = dataset[2]["images"][0]
+instruction = "Is there something interesting about this image?"
+messages = [
+    {"role": "user", "content": [
+        {"type": "image"},
+        {"type": "text", "text": instruction}
+    ]}
+]
+input_text = tokenizer.apply_chat_template(messages, add_generation_prompt = True)
+inputs = tokenizer(
+    image,
+    input_text,
+    add_special_tokens = False,
+    return_tensors = "pt",
+).to("cuda")
+from transformers import TextStreamer
+text_streamer = TextStreamer(tokenizer, skip_prompt = True)
+_ = model.generate(**inputs, streamer = text_streamer, max_new_tokens = 64,
+                   use_cache = True, temperature = 1.5, min_p = 0.1)
+```
+### Framework versions
+- TRL: 0.13.0
+- Transformers: 4.47.1
+- Pytorch: 2.5.1+cu121
+- Datasets: 3.2.0
+- Tokenizers: 0.21.0
+- Unsloth: 2025.1.5
+## Training procedure
+(Log-in required!)
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/open_ai/huggingface/runs/8juqyo5h)
+## Citations
+This VLM model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.