Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1930.4
TFLOPS
14
41
115
Abdelaziz Bounhar
PRO
BounharAbdelaziz
Follow
MoSBAIHI's profile picture
atlas00119's profile picture
amr-mohamed's profile picture
48 followers
·
50 following
http://abdelazizbounhar.com/
BounharAbdelaziz
abdelaziz-bounhar-a58910138
AI & ML interests
Deep Learning, Reinforcement Learning, AI Agents, Generative Modeling, NLP, Information Theory, Security of Machine Learning, ...etc
Recent Activity
liked
a dataset
4 days ago
agentica-org/DeepScaleR-Preview-Dataset
published
a model
12 days ago
BounharAbdelaziz/Qwen2.5-0.5B-DPO-French-Orca
published
a model
12 days ago
BounharAbdelaziz/Qwen2.5-0.5B-DPO-English-Orca
View all activity
Organizations
BounharAbdelaziz
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
4 days ago
agentica-org/DeepScaleR-Preview-Dataset
Viewer
•
Updated
Feb 10
•
40.3k
•
5.4k
•
135
published
2 models
12 days ago
BounharAbdelaziz/Qwen2.5-0.5B-DPO-French-Orca
Text Generation
•
0.5B
•
Updated
12 days ago
•
12
BounharAbdelaziz/Qwen2.5-0.5B-DPO-English-Orca
Text Generation
•
0.5B
•
Updated
12 days ago
•
10
updated
a collection
12 days ago
RLHF
Collection
Some RLHF experiments using GRPO and DPO.
•
3 items
•
Updated
12 days ago
updated
3 models
12 days ago
BounharAbdelaziz/Qwen2.5-3B-GRPO-Math-GSM8K
Text Generation
•
3B
•
Updated
12 days ago
•
9
BounharAbdelaziz/Qwen2.5-0.5B-DPO-French-Orca
Text Generation
•
0.5B
•
Updated
12 days ago
•
12
BounharAbdelaziz/Qwen2.5-0.5B-DPO-English-Orca
Text Generation
•
0.5B
•
Updated
12 days ago
•
10
published
a model
12 days ago
BounharAbdelaziz/Qwen2.5-3B-GRPO-Math-GSM8K
Text Generation
•
3B
•
Updated
12 days ago
•
9
updated
a model
12 days ago
BounharAbdelaziz/Qwen2.5-3B-GRPO-GSM8K-old
Text Generation
•
3B
•
Updated
12 days ago
•
2
published
a model
13 days ago
BounharAbdelaziz/Qwen2.5-3B-GRPO-GSM8K-old
Text Generation
•
3B
•
Updated
12 days ago
•
2
liked
2 datasets
17 days ago
AIffl/french_orca_dpo_pairs
Viewer
•
Updated
May 26, 2024
•
12.7k
•
93
•
6
AIffl/french_hh_rlhf
Viewer
•
Updated
Jun 15, 2024
•
169k
•
141
•
4
liked
2 datasets
26 days ago
a-m-team/AM-Thinking-v1-Distilled
Preview
•
Updated
25 days ago
•
5.55k
•
33
a-m-team/AM-Qwen3-Distilled
Preview
•
Updated
May 22
•
2.73k
•
12
liked
a model
27 days ago
QCRI/Fanar-1-9B-Instruct
Text Generation
•
9B
•
Updated
Jun 5
•
5.63k
•
22
liked
a dataset
28 days ago
open-thoughts/OpenThoughts3-1.2M
Viewer
•
Updated
28 days ago
•
1.2M
•
21k
•
117
Load more