Text Generation
Transformers
Safetensors
English
Chinese
qwen3
text-generation-inference
code
math
Mixture of Experts
conversational
prithivMLmods commited on
Commit
27534c0
·
verified ·
1 Parent(s): cb3e849

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -105,6 +105,6 @@ print(response)
105
 
106
  ## References
107
 
108
- 1. Saxton, D., Grefenstette, E., Hill, F., & Blunsom, P. (2019). Analysing Mathematical Reasoning Abilities of Neural Models. arXiv:1904.01557. [https://arxiv.org/pdf/1904.01557](https://arxiv.org/pdf/1904.01557)
109
 
110
- 2. Chen, X., Zheng, S., & Liu, Z. (2023). YaRN: Efficient Context Window Extension of Large Language Models. arXiv:2309.00071. [https://arxiv.org/pdf/2309.00071](https://arxiv.org/pdf/2309.00071)
 
105
 
106
  ## References
107
 
108
+ 1. Analysing Mathematical Reasoning Abilities of Neural Models. arXiv:1904.01557. [https://arxiv.org/pdf/1904.01557](https://arxiv.org/pdf/1904.01557)
109
 
110
+ 2. YaRN: Efficient Context Window Extension of Large Language Models. arXiv:2309.00071. [https://arxiv.org/pdf/2309.00071](https://arxiv.org/pdf/2309.00071)