maximuspowers
/

multimodal-bias-classifier

multimodal-bias-classifier

Model card Files Files and versions Community

maximuspowers commited on Oct 28, 2024

Commit

9b244bb

·

verified ·

1 Parent(s): 3cca563

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -29,6 +29,8 @@ co2_eq_emissions:
 This model is a multimodal classifier that combines text and image inputs to detect potential bias in content. It uses a BERT-based text encoder and a ResNet-34 image encoder, which are fused for classification purposes. A contrastive learning approach was used during training, leveraging CLIP embeddings as guidance to align the text and image representations.
 ## Model Details
 - **Text Encoder**: BERT (`bert-base-uncased`)
@@ -152,6 +154,7 @@ def load_model():
     return model
 ```
 ```python
 import torch

 This model is a multimodal classifier that combines text and image inputs to detect potential bias in content. It uses a BERT-based text encoder and a ResNet-34 image encoder, which are fused for classification purposes. A contrastive learning approach was used during training, leveraging CLIP embeddings as guidance to align the text and image representations.
+This model is based on [FND-CLIP](https://arxiv.org/pdf/2205.14304), proposed by Zhou et al. 2022.
 ## Model Details
 - **Text Encoder**: BERT (`bert-base-uncased`)
     return model
 ```
+## How to Run the Model
 ```python
 import torch