Google org

The model is running only on the CPU instead of the GPU because the device_map="auto" parameter was not specified in the Gemma3ForConditionalGeneration.from_pretrained function. Adding this parameter will enable automatic GPU allocation.

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment