Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
vishaljoshi24
/
trl-4-dnd
Paused

App Files Files Community
Fetching metadata from the HF Docker repository...
trl-4-dnd / examples /scripts
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
vishaljoshi24's picture
vishaljoshi24
Initial Commit
a080fe0 2 days ago
  • evals
    Initial Commit 2 days ago
  • ppo
    Initial Commit 2 days ago
  • rloo
    Initial Commit 2 days ago
  • alignprop.py
    5.26 kB
    Initial Commit 2 days ago
  • bco.py
    5.98 kB
    Initial Commit 2 days ago
  • cpo.py
    3.58 kB
    Initial Commit 2 days ago
  • ddpo.py
    7.7 kB
    Initial Commit 2 days ago
  • dpo.py
    900 Bytes
    Initial Commit 2 days ago
  • dpo_online.py
    5.47 kB
    Initial Commit 2 days ago
  • dpo_vlm.py
    5.84 kB
    Initial Commit 2 days ago
  • gkd.py
    4.7 kB
    Initial Commit 2 days ago
  • grpo_vlm.py
    7.16 kB
    Initial Commit 2 days ago
  • gspo.py
    6.34 kB
    Initial Commit 2 days ago
  • gspo_vlm.py
    6.74 kB
    Initial Commit 2 days ago
  • kto.py
    3.78 kB
    Initial Commit 2 days ago
  • mpo_vlm.py
    4.49 kB
    Initial Commit 2 days ago
  • nash_md.py
    5.32 kB
    Initial Commit 2 days ago
  • orpo.py
    3.67 kB
    Initial Commit 2 days ago
  • prm.py
    4.46 kB
    Initial Commit 2 days ago
  • reward_modeling.py
    4.81 kB
    Initial Commit 2 days ago
  • sft.py
    900 Bytes
    Initial Commit 2 days ago
  • sft_gemma3.py
    2 kB
    Initial Commit 2 days ago
  • sft_gpt_oss.py
    3.33 kB
    Initial Commit 2 days ago
  • sft_video_llm.py
    8.45 kB
    Initial Commit 2 days ago
  • sft_vlm.py
    5.08 kB
    Initial Commit 2 days ago
  • sft_vlm_gemma3.py
    8.51 kB
    Initial Commit 2 days ago
  • sft_vlm_smol_vlm.py
    5.5 kB
    Initial Commit 2 days ago
  • xpo.py
    4.75 kB
    Initial Commit 2 days ago