Fix for FlashAttention RuntimeError & Triton Multi GPU fix.
#17
by
Satandon1999
- opened
Fix based on the discussion here: https://huggingface.co/microsoft/Phi-3-small-8k-instruct/discussions/11
Satandon1999
changed pull request title from
Update positional_embedding.py
to Fix for FlashAttention RuntimeError
Satandon1999
changed pull request title from
Fix for FlashAttention RuntimeError
to Fix for FlashAttention RuntimeError & Triton Multi GPU fix.
@damajercakms . Please review. Thanks.