arcee-ai
/

GLM-4-32B-Base-32K

Text Generation

Model card Files Files and versions

bartowski commited on 1 day ago

Commit

af83a73

·

verified ·

1 Parent(s): c2d3c88

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -15,6 +15,8 @@ GLM-4-32B-Base-32K is an enhanced version of [THUDM's GLM-4-32B-Base-0414](https
 This model was developed as a proof-of-concept to validate that a merging-centric approach to context extension can be successfully applied to larger-scale models. The techniques employed resulted in an approximate 5% overall improvement on standard base model benchmarks while significantly improving 32k recall.
 ## Model Details
 - Architecture Base: [THUDM/GLM-4-32B-Base-0414](https://huggingface.co/THUDM/GLM-4-32B-Base-0414)
 - Parameter Count: 32B

 This model was developed as a proof-of-concept to validate that a merging-centric approach to context extension can be successfully applied to larger-scale models. The techniques employed resulted in an approximate 5% overall improvement on standard base model benchmarks while significantly improving 32k recall.
+More details can be found in our blog post [here](https://www.arcee.ai/blog/extending-afm-4-5b-to-64k-context-length) where we applied this work to our upcoming AFM 4.5B
 ## Model Details
 - Architecture Base: [THUDM/GLM-4-32B-Base-0414](https://huggingface.co/THUDM/GLM-4-32B-Base-0414)
 - Parameter Count: 32B