H. Aldhaheri
aenawi
AI & ML interests
LLMs Agents
Organizations
None yet
Text2Image LLMs
LLMs
Spaces For Demos
Models-Support-Arabic
Speech-to-Speech
Token-Classification
-
hatmimoha/arabic-ner
Token Classification • 0.1B • Updated • 45.6k • • 21 -
Ammar-alhaj-ali/arabic-MARBERT-poetry-classification
Text Classification • Updated • 1.39k • • 3 -
CAMeL-Lab/bert-base-arabic-camelbert-mix-ner
Token Classification • Updated • 23.1k • • 14 -
SinaLab/ArabicNER-Wojood
Token Classification • Updated • 63 • 10
Neo4j-Cypher
Coding
DeepResearch Models
Translation-Models
-
tencent/Hunyuan-MT-7B
Translation • 8B • Updated • 16.1k • 711 -
tencent/Hunyuan-MT-Chimera-7B
Translation • 8B • Updated • 1.9k • 86 -
swiss-ai/Apertus-8B-Instruct-2509
Text Generation • 8B • Updated • 471k • • 416 -
Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale
Paper • 2509.14008 • Published • 88
Speech-To-Text
Papers - Researches
Arabic Datasets
Embedding Models
-
WhereIsAI/UAE-Large-V1
Feature Extraction • 0.3B • Updated • 1.3M • • 237 -
intfloat/multilingual-e5-large
Feature Extraction • 0.6B • Updated • 3.05M • • 1.11k -
sentence-transformers/distiluse-base-multilingual-cased-v1
Sentence Similarity • 0.1B • Updated • 603k • • 127 -
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
Sentence Similarity • 0.1B • Updated • 24.4M • • 1.08k
Datasets
-
ahmedheakl/resume-atlas
Viewer • Updated • 13.4k • 136 • 10 -
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
Paper • 2506.20920 • Published • 75 -
Running279
Infinite Dataset Hub
♾279Search and save datasets generated with a LLM in real time
-
IntrEx: A Dataset for Modeling Engagement in Educational Conversations
Paper • 2509.06652 • Published • 24
Train-On-Datasets
Cybersecurity Models
Animation
DeepResearch Models
Text2Image LLMs
Translation-Models
-
tencent/Hunyuan-MT-7B
Translation • 8B • Updated • 16.1k • 711 -
tencent/Hunyuan-MT-Chimera-7B
Translation • 8B • Updated • 1.9k • 86 -
swiss-ai/Apertus-8B-Instruct-2509
Text Generation • 8B • Updated • 471k • • 416 -
Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale
Paper • 2509.14008 • Published • 88
LLMs
Speech-To-Text
Spaces For Demos
Papers - Researches
Models-Support-Arabic
Arabic Datasets
Speech-to-Speech
Embedding Models
-
WhereIsAI/UAE-Large-V1
Feature Extraction • 0.3B • Updated • 1.3M • • 237 -
intfloat/multilingual-e5-large
Feature Extraction • 0.6B • Updated • 3.05M • • 1.11k -
sentence-transformers/distiluse-base-multilingual-cased-v1
Sentence Similarity • 0.1B • Updated • 603k • • 127 -
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
Sentence Similarity • 0.1B • Updated • 24.4M • • 1.08k
Token-Classification
-
hatmimoha/arabic-ner
Token Classification • 0.1B • Updated • 45.6k • • 21 -
Ammar-alhaj-ali/arabic-MARBERT-poetry-classification
Text Classification • Updated • 1.39k • • 3 -
CAMeL-Lab/bert-base-arabic-camelbert-mix-ner
Token Classification • Updated • 23.1k • • 14 -
SinaLab/ArabicNER-Wojood
Token Classification • Updated • 63 • 10
Datasets
-
ahmedheakl/resume-atlas
Viewer • Updated • 13.4k • 136 • 10 -
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
Paper • 2506.20920 • Published • 75 -
Running279
Infinite Dataset Hub
♾279Search and save datasets generated with a LLM in real time
-
IntrEx: A Dataset for Modeling Engagement in Educational Conversations
Paper • 2509.06652 • Published • 24
Neo4j-Cypher
Train-On-Datasets
Coding
Cybersecurity Models