Update config.json

#11

by deathknight0 - opened Jun 18

base: refs/heads/main

←

from: refs/pr/11

Discussion Files changed

-0

deathknight0

Jun 18

•

edited Jun 18

This PR addresses a bug that would prevent flash attention 2 from running with granite-speech-8b using HF transformers. The same bug was not present for the 2b version.

Upon closer inspection the line " "_attn_implementation_autoset": true "was not present in config.json (but was present in the 2b version). After adding this line FA2 is functional again

Update config.json36320168

gsaon

IBM Granite org Jun 20

Done in the latest revision, thank you!

gsaon changed pull request status to closed Jun 20

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment