Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
dongyh
/
FANformer-1B
like
3
Text Generation
Transformers
Safetensors
allenai/dolma
English
hf_olmo
custom_code
arxiv:
2502.21309
arxiv:
2410.02675
License:
mit
Model card
Files
Files and versions
Community
1
Train
Use this model
545d83f
FANformer-1B
Ctrl+K
Ctrl+K
1 contributor
History:
5 commits
dongyh
Upload 2 files
545d83f
verified
about 2 months ago
.gitattributes
Safe
1.52 kB
initial commit
about 2 months ago
README.md
Safe
5.17 kB
add tokenizer
about 2 months ago
__init__.py
Safe
314 Bytes
Upload 2 files
about 2 months ago
aliases.py
Safe
109 Bytes
Upload 2 files
about 2 months ago
beam_search.py
Safe
46.6 kB
Upload 15 files
about 2 months ago
checkpoint.py
Safe
88.2 kB
Upload 15 files
about 2 months ago
config.json
Safe
1.5 kB
Upload 15 files
about 2 months ago
config.py
Safe
41.7 kB
Upload 15 files
about 2 months ago
configuration_olmo.py
Safe
2.07 kB
first commit
about 2 months ago
exceptions.py
Safe
838 Bytes
Upload 15 files
about 2 months ago
generation_config.json
Safe
115 Bytes
first commit
about 2 months ago
initialization.py
Safe
597 Bytes
Upload 15 files
about 2 months ago
model.py
Safe
81.7 kB
Upload 15 files
about 2 months ago
model.safetensors
Safe
4.91 GB
LFS
first commit
about 2 months ago
modeling_fan.py
Safe
11.2 kB
Upload 15 files
about 2 months ago
optim.py
Safe
47.1 kB
Upload 15 files
about 2 months ago
safetensors_util.py
Safe
2.45 kB
Upload 15 files
about 2 months ago
special_tokens_map.json
Safe
293 Bytes
add tokenizer
about 2 months ago
tokenizer.json
Safe
3.57 MB
add tokenizer
about 2 months ago
tokenizer_config.json
Safe
5.4 kB
add tokenizer
about 2 months ago
torch_util.py
Safe
4.75 kB
Upload 15 files
about 2 months ago
train.py
Safe
59.2 kB
Upload 15 files
about 2 months ago
util.py
Safe
33.6 kB
Upload 15 files
about 2 months ago
version.py
Safe
407 Bytes
Upload 15 files
about 2 months ago