Contributed by:

  • Shashwath P
  • Shashank Ashok
  • Akilan Yohendiran

Total downloads all time - 2106

Model Card for Model ID

The following model is an experimental fine tuned model of the IDEFIC 9B version, for medical Visual Question Answering. It uses a dataset combined from SLAKE and VQARAD. Check the following repository for the notebooks of training,merging and inference. https://github.com/Shashwathp/Idefic_medical_vqa

Model Description

This is the model card of a 馃 transformers model that has been pushed on the Hub. This model card has been automatically generated.

  • Developed by: [@Shashwath01,@Akill19,@Shashank91097 ]
  • Model type: [Multimodal, Visual Question Answering]
  • Language(s) (NLP): [English]
  • License: [Apache - 2.0]
  • Finetuned from model [optional]: [IDEFIC 9B]

Dataset

https://huggingface.co/datasets/Shashwath01/VQARAD_SLAKE

Model Sources

How to Get Started with the Model

Check the below link to get started with inferencing. https://github.com/Shashwathp/Idefic_medical_vqa/blob/main/inference.ipynb

Downloads last month
53
Safetensors
Model size
4.73B params
Tensor type
F32
FP16
U8
Inference API
Inference API (serverless) does not yet support transformers models for this pipeline type.

Spaces using Shashwath01/Idefic_medical_VQA_merged_4bit 3