Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
Backyard AI
Jan
Jellybox
llama.cpp
LM Studio
LocalAI
Msty
node-llama-cpp
Ollama
RecurseChat
Sanctum
TGI
vLLM
Apps with no match
Draw Things
DiffusionBee
Invoke
JoyFusion
MLX LM
Inference Providers
Inference Providers with no match
Fireworks
Novita
Nebius AI
Together AI
Cerebras
Featherless AI
Nscale
fal
Hyperbolic
SambaNova
Groq
Replicate
Cohere
HF Inference API
Misc
Reset Misc
Inference Endpoints
custom_code
audio-text-to-text
text-generation-inference
4-bit precision
Eval Results
Misc with no match
Merge
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
100
Full-text search
Edit filters
Sort: Trending
Active filters:
audio-text-to-text
Clear all
google/gemma-3n-E4B-it
Image-Text-to-Text
•
Updated
about 6 hours ago
•
5.55k
•
157
google/gemma-3n-E2B-it
Image-Text-to-Text
•
Updated
about 6 hours ago
•
3.48k
•
52
google/gemma-3n-E4B
Image-Text-to-Text
•
Updated
1 day ago
•
433
•
29
google/gemma-3n-E2B
Image-Text-to-Text
•
Updated
1 day ago
•
578
•
20
onnx-community/gemma-3n-E2B-it-ONNX
Image-Text-to-Text
•
Updated
about 23 hours ago
•
36
•
8
amaai-lab/SonicVerse
Audio-Text-to-Text
•
Updated
8 days ago
•
288
•
12
mlx-community/gemma-3n-E4B-bf16
Image-Text-to-Text
•
Updated
about 20 hours ago
•
271
•
4
nvidia/audio-flamingo-2
Audio-Text-to-Text
•
Updated
1 day ago
•
37
stepfun-ai/Step-Audio-Chat
Audio-Text-to-Text
•
Updated
Feb 17
•
115
•
445
mlx-community/gemma-3n-E2B-bf16
Image-Text-to-Text
•
Updated
about 20 hours ago
•
350
•
2
Qwen/Qwen2-Audio-7B
Audio-Text-to-Text
•
Updated
Nov 20, 2024
•
79.5k
•
124
Qwen/Qwen2-Audio-7B-Instruct
Audio-Text-to-Text
•
Updated
Jan 12
•
112k
•
461
fixie-ai/ultravox-v0_4
Audio-Text-to-Text
•
Updated
May 6
•
1.65k
•
49
NexaAIDev/OmniAudio-2.6B
Audio-Text-to-Text
•
Updated
Dec 13, 2024
•
431
•
268
fixie-ai/ultravox-v0_5-llama-3_1-8b
Audio-Text-to-Text
•
Updated
May 6
•
16k
•
26
fixie-ai/ultravox-v0_5-llama-3_2-1b
Audio-Text-to-Text
•
Updated
May 6
•
167k
•
44
nvidia/audio-flamingo-2-0.5B
Audio-Text-to-Text
•
Updated
1 day ago
•
8
mlx-community/gemma-3n-E4B-it-bf16
Image-Text-to-Text
•
Updated
about 20 hours ago
•
356
•
1
muranAI/gemma-3n-e4b-it-4bit
Image-Text-to-Text
•
Updated
about 6 hours ago
•
1
muranAI/gemma-3n-e4b-it-fp16
Image-Text-to-Text
•
Updated
about 5 hours ago
•
1
Qwen/Qwen-Audio
Text Generation
•
Updated
Nov 20, 2024
•
5.05k
•
122
Qwen/Qwen-Audio-Chat
Text Generation
•
Updated
Jan 12
•
3.88k
•
88
fixie-ai/ultravox-v0_2
Audio-Text-to-Text
•
Updated
May 6
•
335
•
51
fixie-ai/ultravox-v0_3
Audio-Text-to-Text
•
Updated
May 6
•
622
•
17
sarvamai/shuka-1
Audio-Text-to-Text
•
Updated
Mar 14
•
1.29k
•
67
Menlo/Ichigo-llama3.1-s-base-v0.3
Audio-Text-to-Text
•
Updated
Nov 19, 2024
•
80
•
4
fixie-ai/ultravox-v0_4-llama-3_1-70b
Audio-Text-to-Text
•
Updated
May 6
•
46
•
4
Menlo/Ichigo-llama3.1-s-instruct-v0.3-phase-2
Audio-Text-to-Text
•
Updated
Nov 19, 2024
•
17
•
5
Menlo/Ichigo-llama3.1-s-instruct-v0.3-phase-3
Audio-Text-to-Text
•
Updated
Nov 19, 2024
•
22
•
35
Menlo/mini-Ichigo-llama3.2-3B-s-base
Audio-Text-to-Text
•
Updated
Nov 19, 2024
•
17
•
1
Previous
1
2
3
4
Next