imabedalghafer/arabic-ner-masking
This model is a fine-tuned version of a BERT model for Named Entity Recognition (NER) specifically designed to identify and mask sensitive information in Arabic text.
Model Description
This model can identify entities that should be masked for privacy protection, such as:
- Personal names
- Email addresses
- Company names
- Other sensitive information
Usage
from transformers import pipeline
# Load the model
ner_pipeline = pipeline("ner", model="imabedalghafer/arabic-ner-masking")
# Example usage
text = "ู
ุฑุญุจุง ูุง ุฃุญู
ุฏุ ูู
ููู ุฅุฑุณุงู ุงูุจุฑูุฏ ุฅูู [email protected]"
results = ner_pipeline(text)
print(results)
Training Data
The model was fine-tuned on Arabic text data with manually annotated entities for masking.
Performance
The model achieves good performance on entity detection and masking tasks. See the evaluation metrics in the training logs.
Intended Use
This model is intended for:
- Privacy protection in Arabic text
- Automated data anonymization
- Content filtering and moderation
Limitations
- Performance may vary on out-of-domain text
- May not capture all types of sensitive information
- Requires post-processing for complete anonymization
- Downloads last month
- 3
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support