Shail Shah
shail-2512
AI & ML interests
None yet
Organizations
LLMs
Coder
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 80.1k • • 1.91k -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 186k • • 520 -
unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF
33B • Updated • 3.58k • 70 -
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 15.7k • 647
Image Generation
3D
Speech Recognition
-
nvidia/canary-1b
Automatic Speech Recognition • Updated • 17.7k • 439 -
facebook/seamless-m4t-v2-large
Automatic Speech Recognition • 2B • Updated • 47.2k • 868 -
nyrahealth/CrisperWhisper
Automatic Speech Recognition • 2B • Updated • 42.7k • 303 -
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 3.22M • • 2.52k
Reranking Models
ALMs (Audio Language Models)
TTS
Reasoning (LRMs)
VLMs
-
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text • 2B • Updated • 97.3k • 526 -
microsoft/OmniParser
Image-Text-to-Text • Updated • 1.21k • 1.68k -
vidore/colsmolvlm-v0.1
Visual Document Retrieval • Updated • 1.4k • 52 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 371k • • 1.49k
Video Generation
Dataset to fine-tune Embeddings
Embedding Models
MultiModal (Any-to-Any)
ALMs (Audio Language Models)
LLMs
TTS
Coder
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 80.1k • • 1.91k -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 186k • • 520 -
unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF
33B • Updated • 3.58k • 70 -
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 15.7k • 647
Reasoning (LRMs)
Image Generation
VLMs
-
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text • 2B • Updated • 97.3k • 526 -
microsoft/OmniParser
Image-Text-to-Text • Updated • 1.21k • 1.68k -
vidore/colsmolvlm-v0.1
Visual Document Retrieval • Updated • 1.4k • 52 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 371k • • 1.49k
3D
Video Generation
Speech Recognition
-
nvidia/canary-1b
Automatic Speech Recognition • Updated • 17.7k • 439 -
facebook/seamless-m4t-v2-large
Automatic Speech Recognition • 2B • Updated • 47.2k • 868 -
nyrahealth/CrisperWhisper
Automatic Speech Recognition • 2B • Updated • 42.7k • 303 -
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 3.22M • • 2.52k
Dataset to fine-tune Embeddings
Reranking Models
Embedding Models