cj453
/

dense_reward_trainer_final_opt__NumTrainEpochs5_SaveStrategiesno_reward_modeling_anthropic_hh

Generated from Trainer

Model card Files Files and versions Community

dense_reward_trainer_final_opt__NumTrainEpochs5_SaveStrategiesno_reward_modeling_anthropic_hh

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

cj453's picture

End of training

1681b41 verified 11 months ago

.gitattributes

1.52 kB

initial commit 11 months ago
README.md

9.97 kB

End of training 11 months ago
config.json

841 Bytes

End of training 11 months ago
merges.txt

456 kB

End of training 11 months ago
model-00001-of-00002.safetensors

4.99 GB
LFS

End of training 11 months ago
model-00002-of-00002.safetensors

680 MB
LFS

End of training 11 months ago
model.safetensors.index.json

33.9 kB

End of training 11 months ago
special_tokens_map.json

548 Bytes

End of training 11 months ago
tokenizer.json

2.11 MB

End of training 11 months ago
tokenizer_config.json

669 Bytes

End of training 11 months ago
training_args.bin
Detected Pickle imports (8)
- "trl.trainer.reward_config.RewardConfig",
- "accelerate.state.PartialState",
- "transformers.trainer_utils.SchedulerType",
- "transformers.trainer_utils.HubStrategy",
- "transformers.training_args.OptimizerNames",
- "torch.device",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.trainer_utils.IntervalStrategy"
How to fix it?
4.92 kB
LFS

End of training 11 months ago
vocab.json

798 kB

End of training 11 months ago