Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Qwen
/
Qwen2-Audio-7B-Instruct

Audio-Text-to-Text
Transformers
Safetensors
English
qwen2_audio
text2text-generation
chat
audio
Model card Files Files and versions
xet
Community
21
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

how to make qwen2-audio outptu asr result with timestamp ?

#20 opened 9 days ago by
vBaiCai

Compared to using audio to text and the qwen2 -7b model, does this model have any unique advantages?

1
#18 opened 2 months ago by
yinjun113

fine-tune this model, how to construct labels

1
#17 opened 3 months ago by
unsofhiest

Prueba

2
#15 opened 3 months ago by
Devin2310

Store chat template in its own file

1
#13 opened 4 months ago by
RaushanTurganbay

tuning on other languages

1
#12 opened 5 months ago by
makeAmericaGreatAgain

Move input_ids to cuda device

1
#9 opened 7 months ago by
freddyaboulton

Does the Qwen2-Audio-7B-Instruct model support Function Call?

#8 opened 7 months ago by
Berlin906

What's the maximum size of audio file?

1
#7 opened 8 months ago by
gaoxt1983

When will be able to provide a 4, 8bit quantized version?

1
#5 opened 9 months ago by
fukai

TTS support?

5
#4 opened 10 months ago by
yukiarimo

Context size or maximum length of the audio

2
#3 opened 10 months ago by
esab

🍭 Best Practice for Fine-Tuning of Qwen2-Audio

#2 opened 10 months ago by
study-hjt
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs