Can't load model with SentenceTransformers 3.0.1 AttributeError: 'LatentAttentionConfig' object has no attribute '_attn_implementation_internal'
8
#50 opened 3 months ago
by
jswarner85
Can we get a simple example please using "huggingfaceembeddings" from the Langchain library or whatever class is the correct one?
#48 opened 3 months ago
by
ctranslate2-4you
Update feature for NVEmbedConfig class
1
#45 opened 4 months ago
by
lukelv
Batch_size
#44 opened 4 months ago
by
lukelv
replicate experimental results on the MTEB dataset
1
#42 opened 4 months ago
by
lzq2021
Code trying to download model from huggingface instead of using Locally Downloaded Model
4
#41 opened 4 months ago
by
sharedJackpot
Model Loading Error
3
#40 opened 4 months ago
by
kcsham
Supporting Flash Attention 2.0
#39 opened 4 months ago
by
Cdemir
'MistralModel' object has no attribute 'encode'
1
#38 opened 4 months ago
by
dadada
Has the tokenizer of the base model(Mistral-7B-v0.1) been retrained?
#37 opened 5 months ago
by
LH0521
How did you trained your LatentAttentionLayer?
1
#36 opened 5 months ago
by
juneonetwothree
Why do we need to hardcode self._attn_implementation = "eager"
1
#35 opened 5 months ago
by
shantanuagarwal
Error to load model with HuggingFace API
1
#34 opened 5 months ago
by
deleted
Regarding max seq length
1
#33 opened 5 months ago
by
sandeep456
How to fine-tune this model?
#32 opened 5 months ago
by
caochengchen
error with module datasets
2
#31 opened 5 months ago
by
claraadam
Distant resource does not have a Content-Length
#30 opened 5 months ago
by
caochengchen
Best instructions for clustering and semantic similarity
2
#29 opened 5 months ago
by
rmilliere
Dataloader multiprocessing error
1
#28 opened 5 months ago
by
Atsunori
Fixing "KeyError: 'NVEmbedConfig'"
9
#27 opened 5 months ago
by
Th3l
Error using multi-gpu support
5
#26 opened 5 months ago
by
bobwhiterabbit
Access to model nvidia/NV-Embed-v1 is restricted. You must be authenticated to access it
6
#25 opened 5 months ago
by
yijiu
Matryoshka Embedding
1
#24 opened 5 months ago
by
XingyanZhang
nvidia/NV-Embed-v1 is not the path to a directory containing a file named config.json.
3
#23 opened 5 months ago
by
XuehangCang
Finetuning guidelines
#21 opened 5 months ago
by
mali404
How much VRAM is needed to run this model? Like for the bare minimum length etc?
3
#20 opened 5 months ago
by
smpa239
Ollama Version
1
#19 opened 5 months ago
by
yangwang825
Weights are in FP16 (loaded in FP32) but paper mentions BF16
#17 opened 5 months ago
by
AdrienC
ONNX version
1
#16 opened 5 months ago
by
michaelfeil
Sentence Transformer compatibility
4
#15 opened 5 months ago
by
michaelfeil
Please provide a 8bit quantified version
#14 opened 5 months ago
by
fukai
How to use for AutoModelForSequenceClassification?
#13 opened 5 months ago
by
deshwalmahesh
Possible to implement `_no_split_modules` attribute?
1
#12 opened 5 months ago
by
ronnybehrens
missing citation
3
#11 opened 5 months ago
by
SeanLee97
Multi-Lingual?
2
#10 opened 5 months ago
by
dejanseo
Getting "KeyError" when loading model
5
#8 opened 5 months ago
by
tsakaiba
TypeError: MistralDecoderLayer.forward() got an unexpected keyword argument 'is_causal'
3
#7 opened 5 months ago
by
yxzwayne
Is this model active?
1
#5 opened 5 months ago
by
gsnic
Sharing training data & reproducing training
1
#4 opened 6 months ago
by
xhluca