Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
dongyh
/
FANformer-1B
like
2
Text Generation
Transformers
Safetensors
allenai/dolma
English
hf_olmo
custom_code
arxiv:
2502.21309
arxiv:
2410.02675
License:
mit
Model card
Files
Files and versions
Community
Train
Use this model
main
FANformer-1B
1 contributor
History:
17 commits
dongyh
Update README.md
ebd97cc
verified
7 days ago
.gitattributes
Safe
1.52 kB
initial commit
17 days ago
README.md
5.89 kB
Update README.md
7 days ago
__init__.py
314 Bytes
Upload 2 files
17 days ago
aliases.py
Safe
109 Bytes
Upload 2 files
17 days ago
beam_search.py
Safe
46.6 kB
Upload 15 files
17 days ago
checkpoint.py
88.2 kB
Upload 15 files
17 days ago
config.json
1.5 kB
Upload 15 files
17 days ago
config.py
41.7 kB
Upload 15 files
17 days ago
configuration_olmo.py
2.07 kB
first commit
17 days ago
exceptions.py
Safe
838 Bytes
Upload 15 files
17 days ago
generation_config.json
Safe
115 Bytes
first commit
17 days ago
initialization.py
597 Bytes
Upload 15 files
17 days ago
model.py
81.7 kB
Upload 15 files
17 days ago
model.safetensors
4.91 GB
LFS
first commit
17 days ago
modeling_fan.py
11.2 kB
Upload 15 files
17 days ago
optim.py
47.1 kB
Upload 15 files
17 days ago
safetensors_util.py
Safe
2.45 kB
Upload 15 files
17 days ago
special_tokens_map.json
Safe
293 Bytes
add tokenizer
17 days ago
tokenizer.json
Safe
3.57 MB
add tokenizer
17 days ago
tokenizer_config.json
Safe
5.4 kB
add tokenizer
17 days ago
torch_util.py
4.75 kB
Upload 15 files
17 days ago
train.py
59.2 kB
Upload 15 files
17 days ago
util.py
33.6 kB
Upload 15 files
17 days ago
version.py
407 Bytes
Upload 15 files
17 days ago