`provider_options` for running on GPU

by cecheta - opened Dec 8, 2024

Dec 8, 2024

The GPU microsoft/Phi-3.5-mini-instruct-onnx model was recently updated, however provider_options in genai_config.json is now set to []. When trying to run the model using the package onnxruntime-genai-cuda, it now seems to run on CPU, even if the device has GPU.

kvaishnavi

Microsoft org Dec 14, 2024

•

edited Dec 14, 2024

The provider_options have been intentionally left blank since the GPU ONNX model can run on both CUDA and DirectML. You can add back the execution provider information in your downloadedgenai_config.json file or you can set that information at runtime (note thatargs.provider can be cpu, cuda, or dml).

kvaishnavi changed discussion status to closed Dec 14, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment