Mayank Mishra's picture

Mayank Mishra

mayank-mishra

AI & ML interests

Large Language Models, Distributed Training and Inference

Recent Activity

Articles

Organizations

IBM's profile picture BigCode's profile picture Ontocord's M*DEL's profile picture Blog-explorers's profile picture Aurora-M's profile picture IBM Granite's profile picture

mayank-mishra's activity

New activity in ibm-granite/granite-3.1-8b-instruct 6 days ago
New activity in ibm-granite/granite-3.0-2b-instruct 2 months ago

add base model metadata

#3 opened 2 months ago by
davanstrien
New activity in ibm-granite/granite-3.0-8b-instruct 2 months ago

add base model metadata

#5 opened 2 months ago by
davanstrien
New activity in ibm-granite/granite-3.0-1b-a400m-instruct 2 months ago

Add base model metadata

#2 opened 2 months ago by
davanstrien
New activity in ibm/PowerMoE-3b 3 months ago
New activity in cfahlgren1/model-release-heatmap 5 months ago

Add IBM

3
#5 opened 5 months ago by
mayank-mishra
New activity in ibm-granite/granite-8b-code-instruct-128k 5 months ago

Fix: link to 128k paper

1
#1 opened 5 months ago by
timrbula
New activity in meta-llama/Llama-3.1-405B 5 months ago

405B or 410B ?

2
#8 opened 5 months ago by
alielfilali01
New activity in ibm-granite/granite-8b-code-instruct-4k 7 months ago

Input context length

3
#6 opened 7 months ago by
dyoung

Official quants?

3
#2 opened 8 months ago by
joshuaturner
New activity in ibm-granite/granite-3b-code-base-2k 7 months ago

Release GGUF models?

3
#5 opened 8 months ago by
CosmicSound
New activity in ibm-granite/granite-3b-code-base-2k 7 months ago

Licensing

6
#4 opened 8 months ago by
tonylek