Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
Nguyễn Minh Phúc
DatPySci
Follow
0 followers
·
1 following
AI & ML interests
Reinforcement learning, NLP
Recent Activity
updated
a model
6 days ago
DatPySci/Qwen-2.5-7B-Simple-RL
published
a model
7 days ago
DatPySci/Qwen-2.5-7B-Simple-RL
published
a model
11 days ago
DatPySci/Llama-3.2-3B-sft-mixture
View all activity
Organizations
DatPySci
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
6 days ago
DatPySci/Qwen-2.5-7B-Simple-RL
Updated
6 days ago
published
a model
7 days ago
DatPySci/Qwen-2.5-7B-Simple-RL
Updated
6 days ago
published
a model
11 days ago
DatPySci/Llama-3.2-3B-sft-mixture
Text Generation
•
Updated
Feb 10
•
615
updated
a model
12 days ago
DatPySci/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
12 days ago
•
2
updated
a model
17 days ago
DatPySci/DeepSeek-Qwen-1.5B-GRPO
Updated
17 days ago
•
2
published
2 models
17 days ago
DatPySci/DeepSeek-Qwen-1.5B-GRPO
Updated
17 days ago
•
2
DatPySci/Qwen-1.5B-Math-GRPO
Updated
17 days ago
published
a model
18 days ago
DatPySci/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
12 days ago
•
2
updated
a dataset
3 months ago
DatPySci/Llama-3.1-8B-rm-anthropic-hh
Viewer
•
Updated
Feb 10
•
140k
•
12
published
a dataset
3 months ago
DatPySci/Llama-3.1-8B-rm-anthropic-hh
Viewer
•
Updated
Feb 10
•
140k
•
12
updated
a dataset
3 months ago
DatPySci/Llama-3.1-8B-rm-tldr-pref
Viewer
•
Updated
Feb 10
•
177k
•
20
published
a dataset
3 months ago
DatPySci/Llama-3.1-8B-rm-tldr-pref
Viewer
•
Updated
Feb 10
•
177k
•
20
updated
a model
3 months ago
DatPySci/Llama-3.2-3B-sft-mixture
Text Generation
•
Updated
Feb 10
•
615
updated
a dataset
3 months ago
DatPySci/tldr_pythia-6.9b_pref
Viewer
•
Updated
Feb 6
•
94.9k
•
20
published
2 datasets
3 months ago
DatPySci/tldr_pythia-6.9b_pref
Viewer
•
Updated
Feb 6
•
94.9k
•
20
DatPySci/gpt2_dpo_tldr
Viewer
•
Updated
Dec 21, 2024
•
8k
•
50
updated
a dataset
4 months ago
DatPySci/tldr_synthetic_llama3_3b_32
Viewer
•
Updated
Jan 24
•
5.47k
•
14
published
a dataset
4 months ago
DatPySci/tldr_synthetic_llama3_3b_32
Viewer
•
Updated
Jan 24
•
5.47k
•
14
updated
a dataset
4 months ago
DatPySci/llama3_3b_sft_tldr_synthetic
Viewer
•
Updated
Jan 19
•
5.47k
•
21
published
a dataset
4 months ago
DatPySci/llama3_3b_sft_tldr_synthetic
Viewer
•
Updated
Jan 19
•
5.47k
•
21
Load more