Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
dongyh
/
FANformer-1B
like
3
Text Generation
Transformers
Safetensors
allenai/dolma
English
hf_olmo
custom_code
arxiv:
2502.21309
arxiv:
2410.02675
License:
mit
Model card
Files
Files and versions
Community
1
Train
Use this model
main
FANformer-1B
Ctrl+K
Ctrl+K
1 contributor
History:
17 commits
dongyh
Update README.md
ebd97cc
verified
about 1 month ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 month ago
README.md
5.89 kB
Update README.md
about 1 month ago
__init__.py
Safe
314 Bytes
Upload 2 files
about 1 month ago
aliases.py
Safe
109 Bytes
Upload 2 files
about 1 month ago
beam_search.py
Safe
46.6 kB
Upload 15 files
about 1 month ago
checkpoint.py
Safe
88.2 kB
Upload 15 files
about 1 month ago
config.json
Safe
1.5 kB
Upload 15 files
about 1 month ago
config.py
Safe
41.7 kB
Upload 15 files
about 1 month ago
configuration_olmo.py
Safe
2.07 kB
first commit
about 1 month ago
exceptions.py
Safe
838 Bytes
Upload 15 files
about 1 month ago
generation_config.json
Safe
115 Bytes
first commit
about 1 month ago
initialization.py
Safe
597 Bytes
Upload 15 files
about 1 month ago
model.py
Safe
81.7 kB
Upload 15 files
about 1 month ago
model.safetensors
Safe
4.91 GB
LFS
first commit
about 1 month ago
modeling_fan.py
Safe
11.2 kB
Upload 15 files
about 1 month ago
optim.py
Safe
47.1 kB
Upload 15 files
about 1 month ago
safetensors_util.py
Safe
2.45 kB
Upload 15 files
about 1 month ago
special_tokens_map.json
Safe
293 Bytes
add tokenizer
about 1 month ago
tokenizer.json
Safe
3.57 MB
add tokenizer
about 1 month ago
tokenizer_config.json
Safe
5.4 kB
add tokenizer
about 1 month ago
torch_util.py
Safe
4.75 kB
Upload 15 files
about 1 month ago
train.py
Safe
59.2 kB
Upload 15 files
about 1 month ago
util.py
Safe
33.6 kB
Upload 15 files
about 1 month ago
version.py
Safe
407 Bytes
Upload 15 files
about 1 month ago