Text Generation
Transformers
English
llama
Inference Endpoints

Commit History

Add 4-bits model & quantize config
7f128ee

BurnThePage commited on