Text Generation
Transformers
PyTorch
llama
text-generation-inference
Inference Endpoints

Commit History

Maximum sequence length for a Llama 2 model is 4096
d3850bc

TheBloke commited on

Update README.md
a043ea3

WizardLM commited on

Update README.md
05f8dc6

WizardLM commited on

Update README.md
85a4cbc

WizardLM commited on

Update README.md
1d8f11e

WizardLM commited on

Update README.md
97759fa

WizardLM commited on

Update README.md
8ecb6cc

WizardLM commited on

Update README.md
14a8aed

WizardLM commited on

Update README.md
24c046e

WizardLM commited on

Update README.md
64b2a3d

WizardLM commited on

Update README.md
c43ff4d

WizardLM commited on

Update README.md
7cdebc2

WizardLM commited on

Update README.md
a6e6a97

WizardLM commited on

70B V1.0
37558d7

WizardLM commited on

initial commit
1d9e3af

WizardLM commited on