Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
OpenNLPLab
/
TransNormerLLM2-3B-300B
like
3
Text Generation
Transformers
PyTorch
English
Chinese
TransNormerLLM
custom_code
arxiv:
2307.14995
arxiv:
2210.10340
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Use this model
main
TransNormerLLM2-3B-300B
1 contributor
History:
10 commits
OpenNLPLab
Upgrade to lightning att2
5ba41e2
verified
11 months ago
images
Upload lightning-leopard.jpg
about 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 year ago
Community License for TransNormerLLM Model.pdf
Safe
263 kB
Add license
about 1 year ago
README.md
Safe
9.89 kB
Update README.md
about 1 year ago
TransNormerLLM模型社区许可协议.pdf
Safe
294 kB
Add license
about 1 year ago
config.json
Safe
926 Bytes
Fix 3B config error
12 months ago
configuration_transnormer.py
Safe
2.27 kB
Publish 3B2-300B
about 1 year ago
generation_config.json
Safe
164 Bytes
Publish 3B2-300B
about 1 year ago
lightning_attention.py
Safe
15.3 kB
Publish 3B2-300B
about 1 year ago
lightning_attention2.py
Safe
15.3 kB
Upgrade to lightning att2
11 months ago
modeling_transnormer.py
Safe
34.6 kB
Upgrade to lightning att2
11 months ago
norm.py
Safe
1.27 kB
Publish 3B2-300B
about 1 year ago
pytorch_model-00001-of-00003.bin
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.BFloat16Storage"
What is a pickle import?
1.97 GB
LFS
Publish 3B2-300B
about 1 year ago
pytorch_model-00002-of-00003.bin
Safe
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
1.97 GB
LFS
Publish 3B2-300B
about 1 year ago
pytorch_model-00003-of-00003.bin
Safe
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
1.88 GB
LFS
Publish 3B2-300B
about 1 year ago
pytorch_model.bin.index.json
Safe
13.8 kB
Publish 3B2-300B
about 1 year ago
special_tokens_map.json
Safe
410 Bytes
Publish 3B2-300B
about 1 year ago
srmsnorm_triton.py
Safe
5.76 kB
Publish 3B2-300B
about 1 year ago
tokenization_baichuan.py
Safe
9.57 kB
Publish 3B2-300B
about 1 year ago
tokenizer.model
Safe
1.14 MB
LFS
Publish 3B2-300B
about 1 year ago
tokenizer_config.json
Safe
819 Bytes
Publish 3B2-300B
about 1 year ago
utils.py
Safe
4.39 kB
Publish 3B2-300B
about 1 year ago