Explore LLM performance across hardware
Convert and PR models to Safetensors
Generate chat responses using FalconMamba-7b model