Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN-7B
like
33
Automatic Speech Recognition
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
License:
apache-2.0
Model card
Files
Files and versions
Community
main
SALMONN-7B
/
resource
/
response_demo
2 contributors
History:
1 commit
tangchangli
chore: init repo
7cf7820
10 months ago
aac.png
13 kB
chore: init repo
10 months ago
aed.png
18.6 kB
chore: init repo
10 months ago
asr.png
13.8 kB
chore: init repo
10 months ago
emo.png
11.4 kB
chore: init repo
10 months ago
jsac.png
21 kB
chore: init repo
10 months ago
lyrics.png
40.7 kB
chore: init repo
10 months ago
mc.png
28.8 kB
chore: init repo
10 months ago
memo.png
32.3 kB
chore: init repo
10 months ago
pr.png
14.8 kB
chore: init repo
10 months ago
sac.png
29.1 kB
chore: init repo
10 months ago
sq.png
22.5 kB
chore: init repo
10 months ago
sr.png
15.9 kB
chore: init repo
10 months ago
story.png
71.1 kB
chore: init repo
10 months ago
title.png
27.3 kB
chore: init repo
10 months ago