Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
daven3
/
molmoe-step-16500-w-router
like
0
Safetensors
olmoe
License:
mit
Model card
Files
Files and versions
Community
main
molmoe-step-16500-w-router
1 contributor
History:
2 commits
daven3
ckpt for step 16500 with router
4c81226
16 days ago
.gitattributes
Safe
1.52 kB
initial commit
16 days ago
README.md
Safe
24 Bytes
initial commit
16 days ago
config.json
889 Bytes
ckpt for step 16500 with router
16 days ago
generation_config.json
120 Bytes
ckpt for step 16500 with router
16 days ago
latest
Safe
16 Bytes
ckpt for step 16500 with router
16 days ago
model-00001-of-00006.safetensors
5 GB
LFS
ckpt for step 16500 with router
16 days ago
model-00002-of-00006.safetensors
5 GB
LFS
ckpt for step 16500 with router
16 days ago
model-00003-of-00006.safetensors
5 GB
LFS
ckpt for step 16500 with router
16 days ago
model-00004-of-00006.safetensors
5 GB
LFS
ckpt for step 16500 with router
16 days ago
model-00005-of-00006.safetensors
5 GB
LFS
ckpt for step 16500 with router
16 days ago
model-00006-of-00006.safetensors
1.74 GB
LFS
ckpt for step 16500 with router
16 days ago
model.safetensors.index.json
564 kB
ckpt for step 16500 with router
16 days ago
scheduler.pt
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
1.06 kB
LFS
ckpt for step 16500 with router
16 days ago
special_tokens_map.json
Safe
293 Bytes
ckpt for step 16500 with router
16 days ago
tokenizer.json
Safe
3.57 MB
ckpt for step 16500 with router
16 days ago
tokenizer_config.json
5.9 kB
ckpt for step 16500 with router
16 days ago
trainer_state.json
2.91 MB
ckpt for step 16500 with router
16 days ago
training_args.bin
pickle
Detected Pickle imports (14)
"transformers.integrations.deepspeed.HfDeepSpeedConfig"
,
"llamafactory.hparams.training_args.TrainingArguments"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"transformers.training_args.OptimizerNames"
,
"transformers.trainer_utils.SchedulerType"
,
"accelerate.utils.dataclasses.DeepSpeedPlugin"
,
"transformers.trainer_utils.IntervalStrategy"
,
"torch.bfloat16"
,
"accelerate.utils.dataclasses.DistributedType"
,
"torch.device"
,
"transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig"
,
"accelerate.state.PartialState"
,
"transformers.trainer_utils.HubStrategy"
,
"transformers.trainer_utils.SaveStrategy"
How to fix it?
7.48 kB
LFS
ckpt for step 16500 with router
16 days ago
zero_to_fp32.py
Safe
33.3 kB
ckpt for step 16500 with router
16 days ago