llama3 8B instruct : generate ---> repeate word or sequence
#163 opened 9 months ago
by
gulululu
RuntimeError: The size of tensor a (3840) must match the size of tensor b (2) at non-singleton dimension 1
#162 opened 9 months ago
by
anumafzal94

Update README.md
#160 opened 9 months ago
by
sarohask
Can we use this model in Tensorflow?
#159 opened 9 months ago
by
Rockramsri
Request: DOI
#158 opened 9 months ago
by
SharanB
Llama3 Not Running
3
#157 opened 9 months ago
by
Mattb0124
🚩 Report: Ethical issue(s)
#155 opened 10 months ago
by
yanan128
Converting into 4-bit or 8-bit weights from tf/flax weights is currently not supported
#154 opened 10 months ago
by
Noah0704
Performance Discrepancy Between Local Llama 3 Deployment and Hugging Face Demo
2
#152 opened 10 months ago
by
depsemt
Is there an Updated version of LLaMa-3-8B-Instruct in the works?
#150 opened 10 months ago
by
Joseph717171
Llama-3-Instruct with Langchain keeps talking to itself
5
11
#147 opened 10 months ago
by
fahim9778

About llama3 tokenizer
#146 opened 10 months ago
by
Yingshu
Need urgent help with connecting with model, was working previously
2
#145 opened 10 months ago
by
amit15gupta
Will there be an updated version of Llama-3-8B-Instruct?
#144 opened 10 months ago
by
Joseph717171
Cannot load tokenizer: Exception: data did not match any variant of untagged enum PostProcessorWrapper at line 2395 column 3
4
#143 opened 10 months ago
by
huangleiabcde
Fine tuned Llama3 doesn't stop generating tokens when it should.
3
#142 opened 10 months ago
by
keyuisai
Error while deserializing header: HeaderTooLarge
1
#141 opened 10 months ago
by
kdavid109
Using Llama offline with transformers is slow and response is repeating itself
#139 opened 11 months ago
by
AyoxRay
Request to access the repo
#138 opened 11 months ago
by
anatris

Meta-Llama-3-8B-Instruct: "max_new_tokens" is not working for /v1/chat/completions
1
#137 opened 11 months ago
by
kuswe
Add co2_eq_emissions to README so that the HF API exposes this info to users.
#136 opened 11 months ago
by
selenecodes
Despite latest transformers version '4.41.1', I get the error: The model 'LlamaModel' is not supported for text-generation
#134 opened 11 months ago
by
guptaansit
Changing maximum input length
#133 opened 11 months ago
by
shipWr3ck
How to introduce stop_strings in llama3?
#132 opened 11 months ago
by
Srinjoy
*** OSError: meta-llama/Meta-Llama-3-8B-Instruct does not appear to have a file named pytorch_model.bin, tf_model.h5, model.ckpt or flax_model.msgpack.
3
#131 opened 11 months ago
by
akjagadish
use inference api
6
#127 opened 11 months ago
by
Miralaeeop1
How to stop Llama from generating follow-up questions and excessive conversation with langchain
1
#125 opened 11 months ago
by
pelumifagbemi
Compatibility with llama-cpp-python and Ollama
#124 opened 11 months ago
by
liashchynskyi
Cloning 'Meta-Llama-3-8B-Instruct' taking excessively long time
#121 opened 11 months ago
by
guptaansit
Discrepancy between Base and Instruct model eos_token.
1
#119 opened 11 months ago
by
richardlian
System prompt during Fine Tuning
#117 opened 11 months ago
by
matteoperiani

Local configuration with LlamaCpp
#116 opened 11 months ago
by
alessandervs
login does not work on fine tuned model
1
2
#114 opened 11 months ago
by
Dave-theGr8
Original Directory
#113 opened 11 months ago
by
raunakkumar
It can be used with JavaScript if yes, how can I assign a role and some custom parameters
#111 opened 11 months ago
by
vdggds
Request: DOI
#110 opened 11 months ago
by
SushantGautam

tokenizer.model can't be loaded by SentencePiece: "RuntimeError: Internal: could not parse ModelProto from tokenizer.model"
6
8
#109 opened 11 months ago
by
ericx134
PrivateGPT settings
#108 opened 11 months ago
by
kickinai
Possible stopping issues?
4
#106 opened 12 months ago
by
Dampfinchen
Tokenizer Chat Template
4
#103 opened 12 months ago
by
Sm1Ling
It`s a great job ~!!!!!!
#102 opened 12 months ago
by
MallemJerry
Why there is not pad token?
25
#101 opened 12 months ago
by
Imran1

Slow API Calls in Transformers
3
4
#98 opened 12 months ago
by
Zifeng01code

Llama3 8b related model has bursting issues when it generates emotion-related words
2
#95 opened 12 months ago
by
HyperBlaze
Where can I get a config.json file for Meta-Llama-3-8B-instruct?
2
#93 opened 12 months ago
by
gdb94086

Local Mobile Device Hosting
1
#91 opened 12 months ago
by
suguspnk
How should names & fewshot examples be encoded?
7
#89 opened 12 months ago
by
theobjectivedad

Please provide some guidance or documentation on tool usage.
2
#88 opened 12 months ago
by
lcahill
AttributeError: type object 'AttentionMaskConverter' has no attribute '_ignore_causal_mask_sdpa'
#87 opened 12 months ago
by
gy19