Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
dongyh
/
FANformer-1B
like
3
Text Generation
Transformers
Safetensors
allenai/dolma
English
hf_olmo
custom_code
arxiv:
2502.21309
arxiv:
2410.02675
License:
mit
Model card
Files
Files and versions
Community
Train
Use this model
55c82d2
FANformer-1B
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
dongyh
Upload 15 files
55c82d2
verified
28 days ago
.gitattributes
Safe
1.52 kB
initial commit
28 days ago
README.md
Safe
5.17 kB
add tokenizer
28 days ago
beam_search.py
Safe
46.6 kB
Upload 15 files
28 days ago
checkpoint.py
88.2 kB
Upload 15 files
28 days ago
config.json
1.5 kB
Upload 15 files
28 days ago
config.py
41.7 kB
Upload 15 files
28 days ago
configuration_olmo.py
2.07 kB
first commit
28 days ago
exceptions.py
Safe
838 Bytes
Upload 15 files
28 days ago
generation_config.json
Safe
115 Bytes
first commit
28 days ago
initialization.py
597 Bytes
Upload 15 files
28 days ago
model.py
81.7 kB
Upload 15 files
28 days ago
model.safetensors
4.91 GB
LFS
first commit
28 days ago
modeling_fan.py
11.2 kB
Upload 15 files
28 days ago
optim.py
47.1 kB
Upload 15 files
28 days ago
safetensors_util.py
Safe
2.45 kB
Upload 15 files
28 days ago
special_tokens_map.json
Safe
293 Bytes
add tokenizer
28 days ago
tokenizer.json
Safe
3.57 MB
add tokenizer
28 days ago
tokenizer_config.json
Safe
5.4 kB
add tokenizer
28 days ago
torch_util.py
4.75 kB
Upload 15 files
28 days ago
train.py
59.2 kB
Upload 15 files
28 days ago
util.py
33.6 kB
Upload 15 files
28 days ago
version.py
407 Bytes
Upload 15 files
28 days ago