Improve CoMPaSS-FLUX.1 Model Card: Add pipeline_tag, library_name, and format links
Browse filesThis PR enhances the model card for CoMPaSS-FLUX.1 by:
- **Adding `pipeline_tag: text-to-image` and `library_name: diffusers`** to the metadata. This ensures the model is correctly categorized on the Hugging Face Hub and enables the automated "Use in Diffusers" widget, as the model explicitly states "Framework: Diffusers" in its description.
- **Improving the visibility of key links** (Project Page, Code, arXiv paper) by moving them to prominent inline links at the top of the model card.
- **Integrating the paper title** "CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models" into the main heading and the model description for better context.
- **Removing a redundant heading** and updating the "Using the Model" link for overall clarity and consistency.
These changes will make the model more discoverable and easier to use for the community.
@@ -1,4 +1,10 @@
|
|
1 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
2 |
tags:
|
3 |
- text-to-image
|
4 |
- lora
|
@@ -14,26 +20,18 @@ widget:
|
|
14 |
- text: a photo of a horse to the left of a bottle
|
15 |
output:
|
16 |
url: images/horse-left-bottle.jpg
|
17 |
-
base_model: black-forest-labs/FLUX.1-dev
|
18 |
-
instance_prompt: null
|
19 |
-
license: other
|
20 |
-
license_name: compass-lora-weights-nc-license
|
21 |
-
license_link: LICENSE
|
22 |
---
|
23 |
-
# CoMPaSS-FLUX.1
|
24 |
|
25 |
-
|
26 |
-
\[[code]\]
|
27 |
-
\[[arXiv]\]
|
28 |
|
29 |
-
|
30 |
|
31 |
-
|
32 |
|
33 |
-
|
34 |
|
35 |
A LoRA adapter that enhances spatial understanding capabilities of the FLUX.1 text-to-image
|
36 |
-
diffusion model. This model demonstrates significant improvements in generating images with specific
|
37 |
spatial relationships between objects.
|
38 |
|
39 |
## Model Details
|
@@ -62,7 +60,7 @@ spatial relationships between objects.
|
|
62 |
|
63 |
## Using the Model
|
64 |
|
65 |
-
See our [GitHub repository]
|
66 |
|
67 |
### Effective Prompting
|
68 |
|
@@ -124,7 +122,4 @@ Weights for this model are available in Safetensors format.
|
|
124 |
|
125 |
[Download](/blurgy/CoMPaSS-FLUX.1/tree/main) them in the Files & versions tab.
|
126 |
|
127 |
-
[./LICENSE]: <./LICENSE>
|
128 |
-
[Project page]: <https://compass.blurgy.xyz>
|
129 |
-
[code]: <https://github.com/blurgyy/CoMPaSS>
|
130 |
-
[arXiv]: <https://arxiv.org/abs/2412.13195>
|
|
|
1 |
---
|
2 |
+
base_model: black-forest-labs/FLUX.1-dev
|
3 |
+
license: other
|
4 |
+
license_name: compass-lora-weights-nc-license
|
5 |
+
license_link: LICENSE
|
6 |
+
pipeline_tag: text-to-image
|
7 |
+
library_name: diffusers
|
8 |
tags:
|
9 |
- text-to-image
|
10 |
- lora
|
|
|
20 |
- text: a photo of a horse to the left of a bottle
|
21 |
output:
|
22 |
url: images/horse-left-bottle.jpg
|
|
|
|
|
|
|
|
|
|
|
23 |
---
|
|
|
24 |
|
25 |
+
# CoMPaSS-FLUX.1: Enhancing Spatial Understanding in Text-to-Image Diffusion Models
|
|
|
|
|
26 |
|
27 |
+
[Project Page](https://compass.blurgy.xyz) | [Code](https://github.com/blurgyy/CoMPaSS) | [arXiv](https://arxiv.org/abs/2412.13195)
|
28 |
|
29 |
+
<Gallery />
|
30 |
|
31 |
+
## Model description
|
32 |
|
33 |
A LoRA adapter that enhances spatial understanding capabilities of the FLUX.1 text-to-image
|
34 |
+
diffusion model. This model, presented in [CoMPaSS: Enhancing Spatial Understanding in Text-to-Image Diffusion Models](https://arxiv.org/abs/2412.13195), demonstrates significant improvements in generating images with specific
|
35 |
spatial relationships between objects.
|
36 |
|
37 |
## Model Details
|
|
|
60 |
|
61 |
## Using the Model
|
62 |
|
63 |
+
See our [GitHub repository](https://github.com/blurgyy/CoMPaSS) to get started.
|
64 |
|
65 |
### Effective Prompting
|
66 |
|
|
|
122 |
|
123 |
[Download](/blurgy/CoMPaSS-FLUX.1/tree/main) them in the Files & versions tab.
|
124 |
|
125 |
+
[./LICENSE]: <./LICENSE>
|
|
|
|
|
|