BUG Chat template doesn't respect `add_generation_prompt`flag from transformers tokenizer
1
#44 opened 4 months ago
by
ilu000
How to use the ASR on LLama3.1
1
#43 opened 4 months ago
by
andrygasy
Tokenizer 'apply_chat_template' issue
1
#42 opened 4 months ago
by
Ksgk-fy
Function Calling Evaluation bench Nexus (0-shot)
#41 opened 4 months ago
by
WateBear
Error: json: cannot unmarshal array into Go struct field Params.eos_token_id of type int
2
#40 opened 4 months ago
by
SadeghPouriyan
ValueError: Pipeline with tokenizer without pad_token cannot do batching. You can try to set it with `pipe.tokenizer.pad_token_id = model.config.eos_token_id`.
4
#39 opened 4 months ago
by
jsemrau
Run this on CPU and use tool calling
1
#38 opened 4 months ago
by
J22
!!Access Problem
11
#37 opened 4 months ago
by
fengzi258
LLama-3.1-8B generates way to long answers!
2
#36 opened 4 months ago
by
ayyylemao
Tokenizer error and/or 'rope_scaling' problem
5
#35 opened 4 months ago
by
fazayjo
Deployment to Inference Endpoints
6
#34 opened 4 months ago
by
stmackcat
Best practice for tool calling with meta-llama/Meta-Llama-3.1-8B-Instruct
1
#33 opened 4 months ago
by
zzclynn
The model often enters infinite generation loops
13
#32 opened 4 months ago
by
sszymczyk
unable to load 4-bit quantized varient with llama.cpp
#31 opened 4 months ago
by
sunnykusawa
Garbage output ?
10
#30 opened 4 months ago
by
danielus
Question about chat template and fine-tuning
3
#23 opened 4 months ago
by
tblattner
Issues loading model with ooabooga textgenwebui
5
#20 opened 4 months ago
by
Kenji776
what is the right tokenizer should I use for llama 3.1 8B?
2
#19 opened 4 months ago
by
calebl
The sample code on the model card page is not right
#18 opened 4 months ago
by
kmtao
My alternative quantizations.
7
#16 opened 4 months ago
by
ZeroWw
ValueError: `rope_scaling` must be a dictionary with two fields
45
#15 opened 4 months ago
by
jsemrau
Independently Benchmarked Humaneval and Evalplus scores
2
#13 opened 4 months ago
by
VaibhavSahai
DO NOT MERGE v2 make sure vllm and transformers work
#12 opened 4 months ago
by
ArthurZ
DO NOT MERGE test for vllm
2
#11 opened 4 months ago
by
ArthurZ
Please do not include original PTH files.
4
#10 opened 4 months ago
by
Qubitium
Utterly based
1
#9 opened 4 months ago
by
llama-anon