Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

meta-llama
/
Llama-3.2-90B-Vision-Instruct

Image-Text-to-Text
Transformers
Safetensors
PyTorch
mllama
facebook
meta
llama
llama-3
conversational
text-generation-inference
Model card Files Files and versions Community
32
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

🔥🔥🔥中文测评视频

#32 opened about 1 month ago by
leo009

Issues on noisy images

#30 opened 3 months ago by
elenapop

ValueError: Cross attention layer can't find neither `cross_attn_states` nor cached values for key/values!

1
#29 opened 4 months ago by
jhn9803

How to use model across multiple GPUs

1
#28 opened 5 months ago by
aswad546

The model does not support having a different number of images per batch?

1
#27 opened 5 months ago by
h1manshu

🚩 Report

#25 opened 6 months ago by
weizhengsuper

Request: DOI

1
#24 opened 7 months ago by
Madhuu77

"Your request to access this repo has been rejected by the repo's authors."

14
1
#19 opened 8 months ago by
Loie

Fine-tune Llama Vision models with TRL 🚀

6
2
#18 opened 8 months ago by
lewtun

Extracting language model only

3
#17 opened 8 months ago by
mariboo

Add widget examples

1
#16 opened 8 months ago by
mishig

Training Data

2
#15 opened 8 months ago by
JohnnieB
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs