AI & ML interests

None defined yet.

Recent Activity

SpectraSuite 's collections 6

TriLMs-Unpacked
TriLMs unpacked to FP16 - compatible with any implementation supporting LLaMa architecture in huggingface's transformers format.
QuantLMs-8Bit-Unpacked
QuantLMs, unpacked to FP16 format - compatible with FP16 GEMMs. After unpacking, QuantLMs have the same architecture as LLaMa.
QuantLMs-4Bit-Unpacked
QuantLMs, unpacked to FP16 format - compatible with FP16 GEMMs. After unpacking, QuantLMs have the same architecture as LLaMa.
TriLMs-Unpacked
TriLMs unpacked to FP16 - compatible with any implementation supporting LLaMa architecture in huggingface's transformers format.
QuantLMs-8Bit-Unpacked
QuantLMs, unpacked to FP16 format - compatible with FP16 GEMMs. After unpacking, QuantLMs have the same architecture as LLaMa.
QuantLMs-6Bit-Unpacked
QuantLMs, unpacked to FP16 format - compatible with FP16 GEMMs. After unpacking, QuantLMs have the same architecture as LLaMa.
QuantLMs-4Bit-Unpacked
QuantLMs, unpacked to FP16 format - compatible with FP16 GEMMs. After unpacking, QuantLMs have the same architecture as LLaMa.
QuantLMs-3Bit-Unpacked
QuantLMs, unpacked to FP16 format - compatible with FP16 GEMMs. After unpacking, QuantLMs have the same architecture as LLaMa.