PiCSAR: Probabilistic Confidence Selection And Ranking Paper • 2508.21787 • Published Aug 29, 2025 • 4
BaCaDI: Bayesian Causal Discovery with Unknown Interventions Paper • 2206.01665 • Published Jun 3, 2022 • 2
Self-Training Large Language Models for Tool-Use Without Demonstrations Paper • 2502.05867 • Published Feb 9, 2025
Parameter-Efficient Fine-Tuning of LLaMA for the Clinical Domain Paper • 2307.03042 • Published Jul 6, 2023
Scalpel vs. Hammer: GRPO Amplifies Existing Capabilities, SFT Replaces Them Paper • 2507.10616 • Published Jul 13, 2025 • 1