Rombos-LLM-V2.5-Qwen-32b 4.0 BPW exl2
4 bpw quant of https://huggingface.co/rombodawg/Rombos-LLM-V2.5-Qwen-32b
Scores 63.2 on Aider benchmarks!
Rombos-LLM-V2.5-Qwen-32b
Rombos-LLM-V2.5-Qwen-32b is a continues finetuned version of Qwen2.5-32B. I noticed recently that the Qwen team did not learn from my methods of continuous finetuning, the great benefits, and no downsides of it. So I took it upon myself to merge the instruct model with the base model myself using the Ties merge method
This version of the model shows higher performance than the original instruct and base models.
Quants: (Coming soon)
GGUF: https://huggingface.co/bartowski/Replete-LLM-V2.5-Qwen-32b-GGUF
EXL2:
Benchmarks: (Coming soon)
- Downloads last month
- 4
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.