wvaneaton commited on
Commit
3452b09
·
verified ·
1 Parent(s): 5ee7489

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -0
README.md CHANGED
@@ -1,6 +1,12 @@
1
  ---
2
  license: apache-2.0
3
  ---
 
 
 
 
 
 
4
 
5
  # How to use this model
6
 
 
1
  ---
2
  license: apache-2.0
3
  ---
4
+ # About the model
5
+ Predibase-T2T-32B-RFT is a 32-billion parameter transformer model fine-tuned with Reinforcement Fine-Tuning (RFT) on the Predibase platform. It is optimized for high-quality text generation with strong generalization across reasoning, instruction-following, and domain-specific tasks.
6
+
7
+ Unlike traditional supervised fine-tuning approaches, Predibase-T2T-32B-RFT leverages RFT to adapt model behavior interactively, optimizing for downstream task quality with minimal labeled data. This makes it a highly cost-efficient and performant alternative to proprietary LLMs.
8
+
9
+ The model is fine-tuned on a diverse set of reward functions, allowing it to dynamically adjust responses based on contextual understanding.
10
 
11
  # How to use this model
12