Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ltg
/
norbert4-large
like
2
Follow
Language Technology Group (University of Oslo)
55
PyTorch
Safetensors
4 languages
BERT
GPT-BERT
NorBERT
Norwegian
encoder
decoder
custom_code
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
18
main
norbert4-large
Commit History
make FlashAttention logic more robust
6f3b747
verified
davda54
commited on
15 days ago
Upload model.safetensors with huggingface_hub
dea935c
verified
davda54
commited on
25 days ago
fix
fcc9a7d
verified
davda54
commited on
Jul 11
Update config.json
0fd79f5
verified
davda54
commited on
Jun 26
removed SDPA
d30e2d1
verified
davda54
commited on
Jun 17
Update modeling_gptbert.py
232b117
verified
davda54
commited on
Jun 17
fixed SDPA for older PyTorch versions
85a2222
verified
davda54
commited on
Jun 17
Delete model.safetensors
9bfd509
verified
davda54
commited on
Jun 10
Adding `safetensors` variant of this model (
#12
)
ded2911
verified
davda54
commited on
Jun 10
Upload model_performance.png
80408d4
verified
davda54
commited on
Jun 10
Update README.md
eb060da
verified
davda54
commited on
Jun 10
Update config.json
04f813f
verified
davda54
commited on
Jun 10
fix
e419f0f
verified
davda54
commited on
Jun 5
Update modeling_gptbert.py
8b27574
verified
davda54
commited on
Jun 5
Update modeling_gptbert.py
042ceea
verified
davda54
commited on
Jun 5
fix
30fd0e7
verified
davda54
commited on
Jun 5
fix
39265fc
verified
davda54
commited on
Jun 5
fix
87e0acb
verified
davda54
commited on
Jun 5
fix attention mask
16b2b5e
verified
davda54
commited on
Jun 5
fix
6cbbc37
verified
davda54
commited on
Jun 5
fix
37fc2ed
verified
davda54
commited on
Jun 5
Update modeling_gptbert.py
60e56bf
verified
davda54
commited on
Jun 5
Update configuration_gptbert.py
ecb7a88
verified
davda54
commited on
Jun 5
Update modeling_gptbert.py
7741265
verified
davda54
commited on
Jun 5
Update config.json
4ba37ea
verified
davda54
commited on
Jun 5
Update config.json
5b29864
verified
davda54
commited on
Jun 5
fixes and optimizations
3bd5fb5
verified
davda54
commited on
Jun 5
Fix import error
e071182
verified
davda54
commited on
Jun 5
Update modeling_gptbert.py
268e841
verified
lgcharpe
commited on
Jun 4
Update modeling_gptbert.py
2d600a4
verified
lgcharpe
commited on
Jun 4
Update modeling_gptbert.py
9e04fd3
verified
lgcharpe
commited on
Jun 4
Update modeling_gptbert.py
45e963e
verified
lgcharpe
commited on
Jun 4
Update modeling_gptbert.py
91e4060
verified
lgcharpe
commited on
Jun 4
Update modeling_gptbert.py
a36b0d0
verified
lgcharpe
commited on
Jun 4
Update modeling_gptbert.py
4abdbe5
verified
lgcharpe
commited on
Jun 4
Update modeling_gptbert.py
f5214f6
verified
lgcharpe
commited on
Jun 4
Checking for errors in the CastedLinear
3e3d609
verified
lgcharpe
commited on
Jun 4
Update modeling_gptbert.py
73e5534
verified
lgcharpe
commited on
Jun 4
More testing
74cf691
verified
lgcharpe
commited on
Jun 4
Add checking for NaN
64bbd9e
verified
lgcharpe
commited on
Jun 4
Removing added print
757ab94
verified
lgcharpe
commited on
Jun 4
Testing output
98c5a3b
verified
lgcharpe
commited on
Jun 4
fix flasshattention
64f341a
verified
davda54
commited on
Jun 3
test
c96534d
verified
davda54
commited on
Jun 3
test
6a98774
verified
davda54
commited on
Jun 3
fix attention masks
e228931
verified
davda54
commited on
Jun 3
fix import error
0fd4c6d
verified
davda54
commited on
Jun 3
Fix error
7ee002c
verified
davda54
commited on
Jun 3
Add FlashAttention + unpadding support
4d4d26d
verified
davda54
commited on
May 28
Update modeling_gptbert.py
28e46d7
verified
lgcharpe
commited on
May 27
Previous
1
2
Next