shubhrapandit's picture
Upload model files
f409e55
raw
history blame contribute delete
143 Bytes
DEFAULT_stage:
DEFAULT_modifiers:
QuantizationModifier:
ignore: ['re:.*lm_head']
targets: [Linear]
scheme: FP8_DYNAMIC