25 3 20

NIkita Balakin

Kotokin

AI & ML interests

None yet

Recent Activity

upvoted a collection 13 days ago

TheDrummer_Valkyrie-49B-v1 - EXL3

liked a model 2 months ago

Tarek07/Legion-V2.1-LLaMa-70B

reacted to csabakecskemeti's post with 👍 3 months ago

-UPDATED- 4bit inference is working! The blogpost is updated with code snippet and requirements.txt https://devquasar.com/uncategorized/all-about-amd-and-rocm/ -UPDATED- I've played around with an MI100 and ROCm and collected my experience in a blogpost: https://devquasar.com/uncategorized/all-about-amd-and-rocm/ Unfortunately I've could not make inference or training work with model loaded in 8bit or use BnB, but did everything else and documented my findings.

View all activity

Organizations

None yet

Kotokin's activity

upvoted a collection 13 days ago

TheDrummer_Valkyrie-49B-v1 - EXL3

Collection

Quants by ArtusDev • 16 items • Updated 12 days ago • 2

liked a model 2 months ago

Tarek07/Legion-V2.1-LLaMa-70B

Text Generation • Updated 15 days ago • 1.24k • • 20

reacted to csabakecskemeti's post with 👍 3 months ago

Post

1978

-UPDATED-
4bit inference is working! The blogpost is updated with code snippet and requirements.txt
https://devquasar.com/uncategorized/all-about-amd-and-rocm/
-UPDATED-
I've played around with an MI100 and ROCm and collected my experience in a blogpost:
https://devquasar.com/uncategorized/all-about-amd-and-rocm/
Unfortunately I've could not make inference or training work with model loaded in 8bit or use BnB, but did everything else and documented my findings.

4 replies

replied to csabakecskemeti's post 3 months ago

How many tokens per second do you receive? I didn't see this on the blog.

liked a model 5 months ago

Konnect1221/The-Inception-Presets-Methception-LLamaception-Qwenception

Updated Feb 3 • 110

updated a model 5 months ago

Kotokin/EVA-UNIT-01_EVA-LLaMA-3.33-70B-v0.1-exl2-5bpw

Text Generation • Updated Dec 21, 2024 • 10

New activity in ArliAI/Llama-3.1-70B-ArliAI-RPMax-v1.3 6 months ago

The question of training the model.

#2 opened 6 months ago by

Kotokin

New activity in MikeRoz/mistralai_Mistral-Large-Instruct-2411-3.0bpw-h6-exl2 6 months ago

Request for a 3.5 bpw

#1 opened 6 months ago by

Kotokin

liked a model 6 months ago

MikeRoz/mistralai_Mistral-Large-Instruct-2411-4.0bpw-h6-exl2

Updated Nov 19, 2024 • 10 • 3

New activity in mistralai/Mistral-Large-Instruct-2411 6 months ago

Where config.json?

#1 opened 6 months ago by

TheDrummer

liked a model 7 months ago

TheDrummer/Behemoth-123B-v1.1

Updated Oct 26, 2024 • 97 • 23

reacted to BlinkDL's post with 👀 8 months ago

Post

5665

RWKV-7 "Goose" preview rc2 => Peak RNN architecture?😃Will try to squeeze more performance for the final release. Preview code & model: https://github.com/BlinkDL/RWKV-LM/tree/main/RWKV-v7