Some RLHF experiments using GRPO and DPO.
Abdelaziz Bounhar PRO
BounharAbdelaziz


·
AI & ML interests
Deep Learning, Reinforcement Learning, AI Agents, Generative Modeling, NLP, Information Theory, Security of Machine Learning, ...etc
Recent Activity
liked
a dataset
4 days ago
agentica-org/DeepScaleR-Preview-Dataset
published
a model
12 days ago
BounharAbdelaziz/Qwen2.5-0.5B-DPO-French-Orca
published
a model
12 days ago
BounharAbdelaziz/Qwen2.5-0.5B-DPO-English-Orca
Organizations
Moroccan Darija Embeddings Models & Datasets
Sentence and word embedding models for Moroccan darija (ary)
-
BounharAbdelaziz/ModernBERT-Morocco-Sentence-Embeddings-v0.2-bs-32-lr-2e-05-ep-2-wp-0.05-gacc-1-gnm-1.0-v0.3
Sentence Similarity • 0.2B • Updated • 6 -
BounharAbdelaziz/Morocco-Darija-Sentence-Embedding-v0.1
Feature Extraction • 0.6B • Updated • 2 -
BounharAbdelaziz/XLM-RoBERTa-Morocco-bs-32-lr-2e-05-ep-2-wp-0.05-gacc-1-gnm-1.0-v0.3
0.6B • Updated • 13 -
atlasia/Morocco-Darija-Word-Embedding
Feature Extraction • Updated • 2
Moroccan Darija Datasets
A collection of all available datasets for pretraining LLMs
Arabic (MSA) Language Models & Datasets
Moroccan Darija LLMs
Language Models that speaks Moroccan darija (ary)
Moroccan Speech Models & Datasets
Moroccan darija STT
Translation Models & Datasets
English to Moroccan darija (ary) models
Arabic (MSA) Summarization Models & Datasets
A collection of models (and the dataset used to train them) that are trained for summarizing arabic text.
-
BounharAbdelaziz/MaYofid-Qwen2.5-3B-Instruct
Text Generation • 3B • Updated • 6 -
BounharAbdelaziz/MaYofid-Falcon3-3B-Instruct
Text Generation • 3B • Updated • 21 -
BounharAbdelaziz/MaYofid-Qwen2.5-3B-Instruct-AWQ
0.7B • Updated • 5 -
BounharAbdelaziz/Arabic-Synthetic-Summarization-Dataset-Filtered
Viewer • Updated • 4.41k • 45 • 1
RLHF
Some RLHF experiments using GRPO and DPO.
Moroccan Darija LLMs
Language Models that speaks Moroccan darija (ary)
Moroccan Darija Embeddings Models & Datasets
Sentence and word embedding models for Moroccan darija (ary)
-
BounharAbdelaziz/ModernBERT-Morocco-Sentence-Embeddings-v0.2-bs-32-lr-2e-05-ep-2-wp-0.05-gacc-1-gnm-1.0-v0.3
Sentence Similarity • 0.2B • Updated • 6 -
BounharAbdelaziz/Morocco-Darija-Sentence-Embedding-v0.1
Feature Extraction • 0.6B • Updated • 2 -
BounharAbdelaziz/XLM-RoBERTa-Morocco-bs-32-lr-2e-05-ep-2-wp-0.05-gacc-1-gnm-1.0-v0.3
0.6B • Updated • 13 -
atlasia/Morocco-Darija-Word-Embedding
Feature Extraction • Updated • 2
Moroccan Speech Models & Datasets
Moroccan darija STT
Moroccan Darija Datasets
A collection of all available datasets for pretraining LLMs
Translation Models & Datasets
English to Moroccan darija (ary) models
Arabic (MSA) Language Models & Datasets
Arabic (MSA) Summarization Models & Datasets
A collection of models (and the dataset used to train them) that are trained for summarizing arabic text.
-
BounharAbdelaziz/MaYofid-Qwen2.5-3B-Instruct
Text Generation • 3B • Updated • 6 -
BounharAbdelaziz/MaYofid-Falcon3-3B-Instruct
Text Generation • 3B • Updated • 21 -
BounharAbdelaziz/MaYofid-Qwen2.5-3B-Instruct-AWQ
0.7B • Updated • 5 -
BounharAbdelaziz/Arabic-Synthetic-Summarization-Dataset-Filtered
Viewer • Updated • 4.41k • 45 • 1