CodeBERT Vulnerability Detector (Multi-class)

C/C++ ์ฝ”๋“œ์˜ ์ทจ์•ฝ์ ์„ ํƒ์ง€ํ•˜๋Š” ๋‹ค์ค‘ ํด๋ž˜์Šค ๋ถ„๋ฅ˜ ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค.

๋ชจ๋ธ ์ •๋ณด

  • ๊ธฐ๋ฐ˜ ๋ชจ๋ธ: microsoft/codebert-base
  • ๋ถ„๋ฅ˜ ํด๋ž˜์Šค: 4๊ฐœ (CWE-79, CWE-89, CWE-119, ๊ธฐํƒ€)
  • ์ž…๋ ฅ: C/C++ ์†Œ์Šค ์ฝ”๋“œ ํ…์ŠคํŠธ

์‚ฌ์šฉ ๋ฐฉ๋ฒ•

from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch

# ๋ชจ๋ธ ๋กœ๋“œ
model_name = "eunJ/codebert_vulnerability_detector_multi"
tokenizer = AutoTokenizer.from_pretrained(model_name)  
model = AutoModelForSequenceClassification.from_pretrained(model_name)

# ์ฝ”๋“œ ๋ถ„์„
code = '''
char buffer[100];
gets(buffer);  
'''

inputs = tokenizer(code, return_tensors="pt", max_length=512, truncation=True)
with torch.no_grad():
    outputs = model(**inputs)
    predictions = torch.softmax(outputs.logits, dim=-1)
    predicted_class = torch.argmax(predictions)

print(f"์˜ˆ์ธก ํด๋ž˜์Šค: {predicted_class.item()}")

ํด๋ž˜์Šค ๋ ˆ์ด๋ธ”

  • 0: CWE-79 (Cross-site Scripting)
  • 1: CWE-89 (SQL Injection)
  • 2: CWE-119 (Buffer Overflow)
  • 3: CWE-Other (๊ธฐํƒ€)
Downloads last month
-
Safetensors
Model size
125M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for eunJ/codebert_vulnerability_detector_multi

Finetuned
(103)
this model