arxiv:2502.01618
Guangxuan Xu
gx-ai-architect
AI & ML interests
None yet
Recent Activity
authored
a paper
about 9 hours ago
A Probabilistic Inference Approach to Inference-Time Scaling of LLMs
using Particle-Based Monte Carlo Methods
updated
a dataset
4 days ago
gx-ai-architect/numinamath-178k-phi4-bon-verified-dpo-trl-40k-old-r1-format
published
a dataset
4 days ago
gx-ai-architect/numinamath-178k-phi4-bon-verified-dpo-trl-40k-old-r1-format
Organizations
Papers
1
models
2
datasets
16
gx-ai-architect/numinamath-178k-phi4-bon-verified-dpo-trl-40k-old-r1-format
Viewer
•
Updated
•
39k
•
7
gx-ai-architect/numinamath-178k-phi4-bon-verified-dpo-trl-40k
Viewer
•
Updated
•
39k
•
9
gx-ai-architect/numinamath-178k-phi4-bon-verified-dpo-trl
Viewer
•
Updated
•
39k
•
8
gx-ai-architect/official_half_rh_half_r1_prompt_60k
Viewer
•
Updated
•
62k
•
16
gx-ai-architect/official_dpo_rh_bo8_random_rej_balanced
Viewer
•
Updated
•
48.8k
•
8
gx-ai-architect/official_dpo_r1_prompt_bo8_random_rej_balanced_fixed
Viewer
•
Updated
•
59.4k
•
28
gx-ai-architect/official_dpo_r1_prompt_bo8_random_rej_balanced
Viewer
•
Updated
•
59.4k
•
20
gx-ai-architect/official_dpo_r1_prompt_bo8_random_rej
Viewer
•
Updated
•
50.9k
•
22
gx-ai-architect/trl_dpo_vanilla_bo8_random_rej
Viewer
•
Updated
•
59.1k
•
37
gx-ai-architect/dpo_vanilla_bo8_random_rej
Viewer
•
Updated
•
59.1k
•
17