Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
daven3
/
molmoe-step-16500-w-router
like
0
Safetensors
olmoe
License:
mit
Model card
Files
Files and versions
Community
4c81226
molmoe-step-16500-w-router
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
daven3
ckpt for step 16500 with router
4c81226
25 days ago
.gitattributes
Safe
1.52 kB
initial commit
25 days ago
README.md
Safe
24 Bytes
initial commit
25 days ago
config.json
889 Bytes
ckpt for step 16500 with router
25 days ago
generation_config.json
120 Bytes
ckpt for step 16500 with router
25 days ago
latest
16 Bytes
ckpt for step 16500 with router
25 days ago
model-00001-of-00006.safetensors
5 GB
LFS
ckpt for step 16500 with router
25 days ago
model-00002-of-00006.safetensors
5 GB
LFS
ckpt for step 16500 with router
25 days ago
model-00003-of-00006.safetensors
5 GB
LFS
ckpt for step 16500 with router
25 days ago
model-00004-of-00006.safetensors
5 GB
LFS
ckpt for step 16500 with router
25 days ago
model-00005-of-00006.safetensors
5 GB
LFS
ckpt for step 16500 with router
25 days ago
model-00006-of-00006.safetensors
1.74 GB
LFS
ckpt for step 16500 with router
25 days ago
model.safetensors.index.json
Safe
564 kB
ckpt for step 16500 with router
25 days ago
scheduler.pt
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
1.06 kB
LFS
ckpt for step 16500 with router
25 days ago
special_tokens_map.json
Safe
293 Bytes
ckpt for step 16500 with router
25 days ago
tokenizer.json
Safe
3.57 MB
ckpt for step 16500 with router
25 days ago
tokenizer_config.json
5.9 kB
ckpt for step 16500 with router
25 days ago
trainer_state.json
2.91 MB
ckpt for step 16500 with router
25 days ago
training_args.bin
pickle
Detected Pickle imports (14)
"transformers.integrations.deepspeed.HfDeepSpeedConfig"
,
"llamafactory.hparams.training_args.TrainingArguments"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"transformers.training_args.OptimizerNames"
,
"transformers.trainer_utils.SchedulerType"
,
"accelerate.utils.dataclasses.DeepSpeedPlugin"
,
"transformers.trainer_utils.IntervalStrategy"
,
"torch.bfloat16"
,
"accelerate.utils.dataclasses.DistributedType"
,
"torch.device"
,
"transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig"
,
"accelerate.state.PartialState"
,
"transformers.trainer_utils.HubStrategy"
,
"transformers.trainer_utils.SaveStrategy"
How to fix it?
7.48 kB
LFS
ckpt for step 16500 with router
25 days ago
zero_to_fp32.py
Safe
33.3 kB
ckpt for step 16500 with router
25 days ago