import gradio as gr gr.load( "models/sarvamai/sarvam-m", provider="hf-inference", ).launch()