missing files
#1
by
igprod
- opened
Hi HumbleWorth team,
Thanks a lot for releasing the TLD embedding model and the domain-MLM checkpoint.
We’re integrating your models in a domain appraisal pipeline
(Kaggle/Colab + GPU), and we’d like to load humbleworth/tld-embedding
as a
drop-in SentenceTransformers model when possible.
Currently, the repo provides model.safetensors
and tokenizer files, but it lacksmodules.json
and config_sentence_transformers.json
(404 on the Hub).
Also, the config has model_type: "dual_task_tld"
, which suggests a custom
architecture. Could you please provide one of the following?
- A SentenceTransformers export (i.e.,
modules.json
+config_sentence_transformers.json
)
so we can load viaSentenceTransformer(<local_dir>)
, or - The minimal custom class / loader code for
"dual_task_tld"
in Transformers,
and the expected MASK token / special tokens behavior to replicate your inference.
Repo links:
- TLD embeddings: https://huggingface.co/humbleworth/tld-embedding
Many thanks for your great work!