isemmanuelolowe
/

BerKANT_171M

Kolmogorov-Arnold Network

Model card Files Files and versions Community

1 contributor

History: 27 commits

isemmanuelolowe's picture

isemmanuelolowe

Update README.md

dfc9313 verified 6 months ago

.gitattributes

1.52 kB

initial commit 6 months ago
KAN.py

11.1 kB

Upload 5 files 6 months ago
README.md

754 Bytes

Update README.md 6 months ago
berkant_layers.py

47.6 kB

Upload 5 files 6 months ago
berkant_padding.py

6.26 kB

Upload 5 files 6 months ago
config.json

913 Bytes

Upload BerKANTForMaskedLM 6 months ago
configuration_berkant.py

1.02 kB

Upload 5 files 6 months ago
flash_attn_triton.py

42.7 kB

Upload 5 files 6 months ago
generation_config.json

95 Bytes

Upload BerKANTForMaskedLM 6 months ago
model.safetensors

686 MB
LFS

Upload BerKANTForMaskedLM 6 months ago
special_tokens_map.json

125 Bytes

Training in progress, step 100 6 months ago
tokenizer.json

712 kB

Training in progress, step 100 6 months ago
tokenizer_config.json

1.19 kB

Training in progress, step 100 6 months ago
training_args.bin
Detected Pickle imports (9)
- "transformers.trainer_utils.SchedulerType",
- "torch.device",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.training_args.TrainingArguments",
- "transformers.trainer_utils.HubStrategy",
- "transformers.training_args.OptimizerNames",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "accelerate.state.PartialState"
How to fix it?
5.05 kB
LFS

Training in progress, step 1000 6 months ago
vocab.txt

232 kB

Training in progress, step 100 6 months ago