Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
Nguyễn Minh Phúc
DatPySci
Follow
AI & ML interests
Reinforcement learning, NLP
Recent Activity
updated
a dataset
4 days ago
DatPySci/tldr_synthetic_llama3_3b_32
published
a dataset
4 days ago
DatPySci/tldr_synthetic_llama3_3b_32
updated
a dataset
8 days ago
DatPySci/llama3_3b_sft_tldr_synthetic
View all activity
Organizations
DatPySci
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
updated
a dataset
4 days ago
DatPySci/tldr_synthetic_llama3_3b_32
Viewer
•
Updated
4 days ago
•
5.47k
•
42
published
a dataset
4 days ago
DatPySci/tldr_synthetic_llama3_3b_32
Viewer
•
Updated
4 days ago
•
5.47k
•
42
updated
a dataset
8 days ago
DatPySci/llama3_3b_sft_tldr_synthetic
Viewer
•
Updated
8 days ago
•
5.47k
•
88
published
a dataset
8 days ago
DatPySci/llama3_3b_sft_tldr_synthetic
Viewer
•
Updated
8 days ago
•
5.47k
•
88
updated
3 datasets
18 days ago
DatPySci/weak_gpt2_large_dpo_hh
Viewer
•
Updated
18 days ago
•
8k
•
38
DatPySci/weak_gpt2_medium_dpo_hh
Viewer
•
Updated
18 days ago
•
8k
•
41
DatPySci/weak_gpt2_dpo_hh
Viewer
•
Updated
18 days ago
•
8k
•
41
updated
6 datasets
19 days ago
DatPySci/Llama-3.2-3B_refine_gpt2-large_tldr
Viewer
•
Updated
19 days ago
•
8k
•
71
DatPySci/Llama-3.2-3B_refine_gpt2-medium_tldr
Viewer
•
Updated
19 days ago
•
8k
•
73
DatPySci/Llama-3.2-3B_refine_gpt2_tldr
Viewer
•
Updated
19 days ago
•
8k
•
64
DatPySci/Llama-3.2-1B_refine_gpt2-large_tldr
Viewer
•
Updated
19 days ago
•
8k
•
39
DatPySci/Llama-3.2-1B_refine_gpt2-medium_tldr
Viewer
•
Updated
19 days ago
•
8k
•
39
DatPySci/Llama-3.2-1B_refine_gpt2_tldr
Viewer
•
Updated
19 days ago
•
8k
•
51
updated
6 datasets
23 days ago
DatPySci/hh_gpt2_large_w2s_feedback
Viewer
•
Updated
23 days ago
•
53.8k
•
37
DatPySci/hh_gpt2_medium_w2s_feedback
Viewer
•
Updated
23 days ago
•
53.8k
•
35
DatPySci/hh_gpt2_w2s_feedback
Viewer
•
Updated
23 days ago
•
53.8k
•
41
DatPySci/tldr_gpt2_large_w2s_feedback
Viewer
•
Updated
23 days ago
•
46.4k
•
39
DatPySci/tldr_gpt2_medium_w2s_feedback
Viewer
•
Updated
23 days ago
•
46.4k
•
37
DatPySci/tldr_gpt2_w2s_feedback
Viewer
•
Updated
23 days ago
•
46.4k
•
48
updated
a collection
25 days ago
Weak reward TL;DR
Collection
6 items
•
Updated
25 days ago
Load more