imabedalghafer/arabic-ner-masking

This model is a fine-tuned version of a BERT model for Named Entity Recognition (NER) specifically designed to identify and mask sensitive information in Arabic text.

Model Description

This model can identify entities that should be masked for privacy protection, such as:

  • Personal names
  • Email addresses
  • Company names
  • Other sensitive information

Usage

from transformers import pipeline

# Load the model
ner_pipeline = pipeline("ner", model="imabedalghafer/arabic-ner-masking")

# Example usage
text = "ู…ุฑุญุจุง ูŠุง ุฃุญู…ุฏุŒ ูŠู…ูƒู†ูƒ ุฅุฑุณุงู„ ุงู„ุจุฑูŠุฏ ุฅู„ู‰ [email protected]"
results = ner_pipeline(text)
print(results)

Training Data

The model was fine-tuned on Arabic text data with manually annotated entities for masking.

Performance

The model achieves good performance on entity detection and masking tasks. See the evaluation metrics in the training logs.

Intended Use

This model is intended for:

  • Privacy protection in Arabic text
  • Automated data anonymization
  • Content filtering and moderation

Limitations

  • Performance may vary on out-of-domain text
  • May not capture all types of sensitive information
  • Requires post-processing for complete anonymization
Downloads last month
3
Safetensors
Model size
108M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support