marcuscedricridia commited on
Commit
0df2340
·
verified ·
1 Parent(s): 1178b4a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -122,6 +122,9 @@ This suggests that **reasoning and non-reasoning modes are not behaviorally isol
122
 
123
  A likely cause is shared parameter space. Despite mode control via `enable_thinking`, both modes tap into the same underlying weights and attention flows. The slight signal imbalance—reasoning traces being more structured or expressive—may have also contributed to stronger transfer from reasoning → non-reasoning than vice versa.
124
 
 
 
 
125
  ---
126
 
127
  ## Conclusion
 
122
 
123
  A likely cause is shared parameter space. Despite mode control via `enable_thinking`, both modes tap into the same underlying weights and attention flows. The slight signal imbalance—reasoning traces being more structured or expressive—may have also contributed to stronger transfer from reasoning → non-reasoning than vice versa.
124
 
125
+ Yes, Qwen3 lets your model have 2 personalities (via enable_thinking or / commands), but they’re not cleanly separated. Fine-tuning one mode affects the other. Even 4 epochs showed behavioral bleed. With strong finetuning (2–4 epochs), you can mostly separate behaviors. Mode control works, just not perfectly. Not perfect, but good enough to steer two distinct modes with care.
126
+
127
+
128
  ---
129
 
130
  ## Conclusion