LB

lbathen
ยท

AI & ML interests

None yet

Recent Activity

Organizations

IBM Granite's profile picture

lbathen's activity

New activity in mgoin/Nemotron-4-340B-Instruct-hf 8 months ago

Reward model also possible?

1
#1 opened 9 months ago by
noamgat
New activity in nvidia/Nemotron-4-340B-Base 9 months ago

Hf safetensors version

9
#3 opened 11 months ago by
ehartford
New activity in nvidia/Nemotron-4-340B-Reward 9 months ago

Convertion to HF

3
#7 opened 9 months ago by
lbathen
New activity in mistralai/Mistral-Nemo-Instruct-2407 9 months ago

NAN when training

1
#29 opened 9 months ago by
nthehai01
New activity in nvidia/Mistral-NeMo-12B-Instruct 9 months ago
New activity in nvidia/Nemotron-4-340B-Reward 9 months ago
New activity in mistralai/Mistral-Nemo-Instruct-2407 9 months ago

NeMo Format

2
#9 opened 9 months ago by
lbathen
New activity in mistralai/Mixtral-8x22B-v0.1 10 months ago

Use V1 tokenizer instead

7
#10 opened 10 months ago by
Rocketknight1

vocab size mismatch

4
#9 opened 10 months ago by
mradermacher