Nguyễn Minh Phúc

DatPySci

AI & ML interests

Reinforcement learning, NLP

Recent Activity

updated a model 12 days ago

DatPySci/PreRLFisher

updated a model 12 days ago

DatPySci/PreRLFisher

updated a model 12 days ago

DatPySci/PreRLFisher

View all activity

Organizations

Collections 1

models 92

datasets 60

DatPySci/Qwen2.5-Math-1.5B-deepscaler

Viewer • Updated Sep 16, 2025 • 161k • 15

DatPySci/Qwen2.5-Math-7B-deepscaler

Viewer • Updated Sep 16, 2025 • 161k • 8 • 1

DatPySci/Llama-3.2-3B-deepscaler

Viewer • Updated Sep 16, 2025 • 161k • 2

DatPySci/Llama-3.1-8B-rm-anthropic-hh

Viewer • Updated Feb 10, 2025 • 140k • 11

DatPySci/Llama-3.1-8B-rm-tldr-pref

Viewer • Updated Feb 10, 2025 • 177k • 7

DatPySci/tldr_pythia-6.9b_pref

Viewer • Updated Feb 6, 2025 • 94.9k • 37

DatPySci/tldr_synthetic_llama3_3b_32

Viewer • Updated Jan 24, 2025 • 5.47k • 5

DatPySci/llama3_3b_sft_tldr_synthetic

Viewer • Updated Jan 19, 2025 • 5.47k • 4

DatPySci/weak_gpt2_large_dpo_hh

Viewer • Updated Jan 9, 2025 • 8k • 13

DatPySci/weak_gpt2_medium_dpo_hh

Viewer • Updated Jan 9, 2025 • 8k • 11

View 60 datasets

Nguyễn Minh Phúc

AI & ML interests

Recent Activity

Organizations

Collections 1

DatPySci/sft_model_tldr_summarization

DatPySci/model_step_32400_tldr_summarization

DatPySci/model_step_72000_tldr_summarization

DatPySci/sft_model_tldr_summarization

DatPySci/model_step_32400_tldr_summarization

DatPySci/model_step_72000_tldr_summarization

models 92

DatPySci/PreRLFisher

DatPySci/RLDI

DatPySci/Qwen-2.5-7B-Simple-RL

DatPySci/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

DatPySci/DeepSeek-Qwen-1.5B-GRPO

DatPySci/Qwen-1.5B-Math-GRPO

DatPySci/Llama-3.2-3B-sft-mixture

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.01_steps_72000__tldr

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.1_steps_32400__tldr

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.05_steps_32400__tldr

datasets 60

DatPySci/Qwen2.5-Math-1.5B-deepscaler

DatPySci/Qwen2.5-Math-7B-deepscaler

DatPySci/Llama-3.2-3B-deepscaler

DatPySci/Llama-3.1-8B-rm-anthropic-hh

DatPySci/Llama-3.1-8B-rm-tldr-pref

DatPySci/tldr_pythia-6.9b_pref

DatPySci/tldr_synthetic_llama3_3b_32

DatPySci/llama3_3b_sft_tldr_synthetic

DatPySci/weak_gpt2_large_dpo_hh

DatPySci/weak_gpt2_medium_dpo_hh

Nguyễn Minh Phúc

AI & ML interests

Recent Activity

Organizations

Collections 1

models 92 Sort: Recently updated

datasets 60 Sort: Recently updated

models 92

datasets 60