Add model card

Browse files

Files changed (1) hide show

README.md +306 -0

README.md ADDED Viewed

	@@ -0,0 +1,306 @@

+---
+base_model: mistralai/Ministral-8B-Instruct-2410
+tags:
+- unsloth
+- lora
+- qlora
+- vulnerability-detection
+- security
+- code-analysis
+- cybersecurity
+- ultival
+- peft
+- adapter
+language:
+- en
+license: apache-2.0
+library_name: peft
+pipeline_tag: text-generation
+---
+# UltiVal: Ministral-8B QLoRA Adapter for Vulnerability Detection
+This is a **QLoRA adapter** fine-tuned from **Ministral-8B-Instruct-2410** for detecting security vulnerabilities in source code as part of the **UltiVal** project.
+## 🚨 Important Note
+This is a **LoRA adapter**, not a standalone model. You must load it together with the base model `mistralai/Ministral-8B-Instruct-2410`.
+## 📋 Model Details
+- **Base Model**: `mistralai/Ministral-8B-Instruct-2410`
+- **Adapter Type**: QLoRA (4-bit Low-Rank Adaptation)
+- **Training Framework**: Unsloth
+- **Task**: Security vulnerability detection in source code
+- **Model Size**: ~334MB (adapter only)
+- **Context Length**: 2048 tokens
+- **Languages**: Multi-language code analysis (Python, JavaScript, Java, C/C++, etc.)
+## 🎯 Training Configuration
+| Parameter | Value |
+|-----------|--------|
+| **Training Steps** | 6,000 (best checkpoint) |
+| **Total Steps** | 6,184 |
+| **Validation Loss** | 0.5840 (lowest achieved at step 6000) |
+| **Final Training Loss** | 0.4081 |
+| **Epochs** | 2 |
+| **Learning Rate** | 2e-4 → 1.76e-7 (cosine schedule) |
+| **Batch Size** | 8 (2 × 4 gradient accumulation) |
+| **Sequence Length** | 2048 tokens |
+| **LoRA Rank** | 32 |
+| **LoRA Alpha** | 32 |
+| **LoRA Dropout** | 0.0 |
+| **Weight Decay** | 0.01 |
+| **Warmup Steps** | ~5% of total steps |
+### Target Modules
+```
+q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
+```
+## 🔧 Usage
+### Option 1: Using Unsloth (Recommended)
+```python
+from unsloth import FastLanguageModel
+import torch
+# Load base model
+model, tokenizer = FastLanguageModel.from_pretrained(
+    model_name="mistralai/Ministral-8B-Instruct-2410",
+    max_seq_length=2048,
+    dtype=None,
+    load_in_4bit=True,
+)
+# Add LoRA configuration
+model = FastLanguageModel.get_peft_model(
+    model,
+    r=32,
+    target_modules=["q_proj", "k_proj", "v_proj", "o_proj",
+                   "gate_proj", "up_proj", "down_proj"],
+    lora_alpha=32,
+    lora_dropout=0,
+    bias="none",
+    use_gradient_checkpointing="unsloth",
+    random_state=3407,
+)
+# Load the trained adapter
+model.load_adapter("starsofchance/Mistral-Unsloth-QLoRA-adapter")
+# Enable inference mode
+FastLanguageModel.for_inference(model)
+```
+### Option 2: Using Transformers + PEFT
+```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+from peft import PeftModel
+import torch
+# Load base model
+base_model = AutoModelForCausalLM.from_pretrained(
+    "mistralai/Ministral-8B-Instruct-2410",
+    torch_dtype=torch.float16,
+    device_map="auto",
+    load_in_4bit=True
+)
+tokenizer = AutoTokenizer.from_pretrained("mistralai/Ministral-8B-Instruct-2410")
+# Load LoRA adapter
+model = PeftModel.from_pretrained(base_model, "starsofchance/Mistral-Unsloth-QLoRA-adapter")
+```
+## 💻 Inference Example
+```python
+# Example: SQL Injection Detection
+code_snippet = '''
+def authenticate_user(username, password):
+    query = "SELECT * FROM users WHERE username='" + username + "' AND password='" + password + "'"
+    cursor.execute(query)
+    return cursor.fetchone()
+'''
+messages = [
+    {"role": "user", "content": f"Analyze this code for security vulnerabilities:\n\n{code_snippet}"}
+]
+# Tokenize and generate
+input_ids = tokenizer.apply_chat_template(
+    messages,
+    add_generation_prompt=True,
+    return_tensors="pt"
+).to(model.device)
+outputs = model.generate(
+    input_ids,
+    max_new_tokens=512,
+    do_sample=False,
+    pad_token_id=tokenizer.eos_token_id,
+    temperature=0.1
+)
+response = tokenizer.decode(outputs[0][len(input_ids[0]):], skip_special_tokens=True)
+print(response)
+```
+### Expected Output
+```
+This code contains a critical SQL injection vulnerability. The user input (username and password)
+is directly concatenated into the SQL query without any sanitization or parameterization.
+**Vulnerability Type**: SQL Injection (CWE-89)
+**Severity**: High
+**Location**: Line 2, query construction
+**How to exploit**: An attacker could input malicious SQL code like:
+- Username: `admin' --`
+- Password: `anything`
+**Secure fix**: Use parameterized queries:
+```python
+def authenticate_user(username, password):
+    query = "SELECT * FROM users WHERE username=? AND password=?"
+    cursor.execute(query, (username, password))
+    return cursor.fetchone()
+```
+```
+## 🛡️ Supported Vulnerability Types
+The model is trained to detect various security vulnerabilities including:
+| Category | Examples |
+|----------|----------|
+| **Injection** | SQL Injection, Command Injection, LDAP Injection |
+| **XSS** | Reflected XSS, Stored XSS, DOM-based XSS |
+| **Authentication** | Weak passwords, Authentication bypass, Session management |
+| **Authorization** | Privilege escalation, Access control issues |
+| **Cryptography** | Weak encryption, Hardcoded keys, Improper random generation |
+| **File Operations** | Path traversal, File inclusion, Unsafe deserialization |
+| **Memory Safety** | Buffer overflow, Use after free, Memory leaks |
+| **Web Security** | CSRF, SSRF, Insecure redirects |
+## 📊 Performance Metrics
+### Training Progress
+- **Initial Loss**: 1.5544
+- **Final Loss**: 0.4081
+- **Best Validation Loss**: 0.5840 (step 6000)
+- **Training Duration**: ~15 hours
+- **Convergence**: Stable convergence with cosine learning rate schedule
+### Hardware Requirements
+- **Training**: NVIDIA GPU with 4-bit quantization
+- **Inference**: Can run on CPU or GPU (GPU recommended for speed)
+- **Memory**: ~6GB GPU memory for inference with 4-bit quantization
+## 📁 Repository Structure
+```
+starsofchance/Mistral-Unsloth-QLoRA-adapter/
+├── adapter_config.json          # LoRA configuration
+├── adapter_model.safetensors    # Trained adapter weights (~334MB)
+├── tokenizer.json               # Tokenizer configuration
+├── tokenizer_config.json        # Tokenizer settings
+├── special_tokens_map.json      # Special tokens mapping
+└── README.md                    # This file
+```
+## ⚠️ Limitations
+1. **Adapter Dependency**: Requires the base model to function
+2. **Context Window**: Limited to 2048 tokens
+3. **Language Coverage**: Primarily trained on common programming languages
+4. **False Positives**: May flag secure code patterns as potentially vulnerable
+5. **Novel Vulnerabilities**: May not detect cutting-edge or highly obfuscated attacks
+6. **Code Context**: Performance depends on having sufficient code context
+## 🔄 Integration Tips
+### Batch Processing
+```python
+def analyze_multiple_files(code_files):
+    results = []
+    for file_path, code_content in code_files:
+        # Analyze each file
+        messages = [{"role": "user", "content": f"Analyze for vulnerabilities:\n\n{code_content}"}]
+        # ... generate response
+        results.append({"file": file_path, "analysis": response})
+    return results
+```
+### Custom Prompting
+```python
+# For specific vulnerability types
+prompt = f"""
+Focus on SQL injection vulnerabilities in this code:
+{code_snippet}
+Provide:
+1. Vulnerability assessment (Yes/No)
+2. Risk level (Low/Medium/High/Critical)
+3. Specific location
+4. Remediation steps
+"""
+```
+## 📚 Training Data
+The model was fine-tuned on a curated dataset featuring:
+- **Real-world vulnerabilities** from CVE databases
+- **Secure code patterns** for contrast learning
+- **Multi-language examples** across different frameworks
+- **Detailed explanations** with remediation guidance
+- **Context-rich examples** showing vulnerability in realistic scenarios
+## 🎓 Model Lineage
+```
+Ministral-8B-Instruct-2410 (Mistral AI)
+    ↓
+QLoRA Fine-tuning (Unsloth)
+    ↓
+UltiVal Vulnerability Detection Adapter
+```
+## 📄 Citation
+If you use this model in your research or applications, please cite:
+```bibtex
+@misc{ultival_mistral_lora_2025,
+  title={UltiVal: Ministral-8B QLoRA Adapter for Vulnerability Detection},
+  author={StarsOfChance},
+  year={2025},
+  publisher={Hugging Face},
+  url={https://huggingface.co/starsofchance/Mistral-Unsloth-QLoRA-adapter}
+}
+```
+## ⚖️ License
+This adapter inherits the license from the base model `mistralai/Ministral-8B-Instruct-2410`. Please refer to the [base model's license](https://huggingface.co/mistralai/Ministral-8B-Instruct-2410) for specific terms and conditions.
+## 🙏 Acknowledgments
+- **Unsloth Team**: For the efficient LoRA fine-tuning framework
+- **Mistral AI**: For the powerful Ministral-8B-Instruct-2410 base model
+- **Hugging Face**: For the model hosting and PEFT library
+- **UltiVal Project**: Part of ongoing research in automated vulnerability detection
+## 📞 Contact & Support
+- **Issues**: Report bugs or issues in the [model repository](https://huggingface.co/starsofchance/Mistral-Unsloth-QLoRA-adapter/discussions)
+- **Updates**: Follow for model updates and improvements
+- **Community**: Join discussions about vulnerability detection and code security
+---
+**🔒 Security Note**: This model is designed to assist in security analysis but should not be the sole method for vulnerability assessment. Always conduct comprehensive security reviews with multiple tools and expert analysis.