liyangbing commited on
Commit
627e508
·
verified ·
1 Parent(s): ba7328c
Files changed (1) hide show
  1. index.html +181 -27
index.html CHANGED
@@ -3,23 +3,22 @@
3
  <head>
4
  <meta charset="utf-8" />
5
  <meta name="viewport" content="width=device-width, initial-scale=1.0" />
6
- <title>QwenImage - Advanced Text-to-Image Generation by Alibaba Tongyi Lab</title>
7
- <meta name="description" content="QwenImage: Open and Advanced Text-to-Image Generative Model by Alibaba Tongyi Lab. Create stunning images from text prompts with high-quality rendering, artistic style control, and exceptional detail." />
8
- <meta name="keywords" content="QwenImage, Alibaba, Tongyi Lab, Text-to-Image, AI Models, Prompt Engineering, WaveSpeedAI, Image Generation, AI Art, Generative AI, Image Synthesis" />
9
 
10
  <!-- Open Graph / Social Media Meta Tags -->
11
- <meta property="og:title" content="QwenImage - Advanced Text-to-Image Generation by Alibaba Tongyi Lab" />
12
- <meta property="og:description" content="Transform your text into stunning images with QwenImage, the advanced text-to-image model developed by Alibaba Tongyi Lab" />
13
  <meta property="og:type" content="website" />
14
- <meta property="og:url" content="https://wavespeed.ai/models/wavespeed-ai/qwen-image/text-to-image" />
15
 
16
  <!-- Additional Meta Information -->
17
- <meta name="author" content="Alibaba Tongyi Lab" />
18
  <meta name="robots" content="index, follow" />
19
- <link rel="canonical" href="https://wavespeed.ai/models/wavespeed-ai/qwen-image/text-to-image" />
20
 
21
  <link rel="stylesheet" href="style.css" />
22
- <meta http-equiv="refresh" content="30;url=https://wavespeed.ai/models/wavespeed-ai/qwen-image/text-to-image" />
23
  </head>
24
  <body>
25
  <nav class="top-nav">
@@ -37,40 +36,195 @@
37
  <div class="container">
38
  <div class="content">
39
  <div class="logo-section">
40
- <h1>QwenImage</h1>
41
- <p class="subtitle">By Alibaba Tongyi Lab</p>
42
  </div>
43
 
44
  <div class="announcement-section">
45
- <p class="announcement">QwenImage is now available on WaveSpeedAI!</p>
46
  <div class="divider"></div>
47
- <p class="description">Open and Advanced Text-to-Image Generative Model</p>
48
  </div>
 
 
 
 
 
 
 
 
 
 
 
 
49
 
50
  <div class="features-section">
51
  <div class="feature">
52
- <h3>🚀 Powerful Text-to-Image Conversion</h3>
53
- <p>Advanced architecture that transforms text descriptions into high-quality, detailed images with exceptional understanding of complex prompts</p>
54
  </div>
 
55
  <div class="feature">
56
- <h3>🎯 Precise Prompt Understanding</h3>
57
- <p>Meticulously trained to understand nuanced text prompts, allowing for detailed control over artistic styles, lighting, composition, and visual elements</p>
58
  </div>
59
  <div class="feature">
60
- <h3>🌟 Creative Text Interpretation</h3>
61
- <p>Trained on millions of text-image pairs, capable of interpreting creative descriptions and generating images that accurately reflect complex concepts and ideas</p>
62
  </div>
63
  </div>
64
 
65
- <div class="redirect-section">
66
- <p class="redirect-text">Redirecting to QwenImage Text-to-Image on WaveSpeedAI in 30 seconds...</p>
67
- <div class="progress-bar">
68
- <div class="progress"></div>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
69
  </div>
70
- <a href="https://wavespeed.ai/models/wavespeed-ai/qwen-image/text-to-image" class="cta-button" target="_blank" rel="noopener noreferrer">Visit Now →</a>
71
- <p class="huggingface-link">
72
- Also available on <a href="https://huggingface.co/Qwen" target="_blank" rel="noopener noreferrer">Hugging Face</a>
73
- </p>
74
  </div>
75
  </div>
76
  </div>
 
3
  <head>
4
  <meta charset="utf-8" />
5
  <meta name="viewport" content="width=device-width, initial-scale=1.0" />
6
+ <title>Qwen-Image - Advanced Text-to-Image Generation by Alibaba Cloud</title>
7
+ <meta name="description" content="Qwen-Image: Part of the Qwen (Tongyi Qianwen) model series by Alibaba Cloud. A powerful text-to-image generative model that creates stunning images from text prompts with high-quality rendering, artistic style control, and exceptional detail." />
8
+ <meta name="keywords" content="Qwen-Image, Qwen, Tongyi Qianwen, Alibaba Cloud, Text-to-Image, AI Models, Prompt Engineering, Image Generation, AI Art, Generative AI, Image Synthesis, Multimodal AI" />
9
 
10
  <!-- Open Graph / Social Media Meta Tags -->
11
+ <meta property="og:title" content="Qwen-Image - Advanced Text-to-Image Generation by Alibaba Cloud" />
12
+ <meta property="og:description" content="Transform your text into stunning images with Qwen-Image, part of the Tongyi Qianwen model series developed by Alibaba Cloud" />
13
  <meta property="og:type" content="website" />
14
+ <meta property="og:url" content="https://huggingface.co/Qwen/Qwen-Image" />
15
 
16
  <!-- Additional Meta Information -->
17
+ <meta name="author" content="Alibaba Cloud Qwen Team" />
18
  <meta name="robots" content="index, follow" />
19
+ <link rel="canonical" href="https://huggingface.co/Qwen/Qwen-Image" />
20
 
21
  <link rel="stylesheet" href="style.css" />
 
22
  </head>
23
  <body>
24
  <nav class="top-nav">
 
36
  <div class="container">
37
  <div class="content">
38
  <div class="logo-section">
39
+ <h1>Qwen-Image</h1>
40
+ <p class="subtitle">By Alibaba Cloud Qwen Team</p>
41
  </div>
42
 
43
  <div class="announcement-section">
44
+ <p class="announcement">Qwen-Image is now available!</p>
45
  <div class="divider"></div>
46
+ <p class="description">Open-source Advanced Text-to-Image Generative Model</p>
47
  </div>
48
+
49
+ <div class="hero-image">
50
+ <img src="https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-Image/merge3.jpg" alt="Qwen-Image Examples" class="full-width-img">
51
+ </div>
52
+
53
+ <section class="intro-section">
54
+ <h2>Introduction</h2>
55
+ <p>We are thrilled to release Qwen-Image, an image generation foundation model in the Qwen series that achieves significant advances in complex text rendering and precise image editing. Experiments show strong general capabilities in both image generation and editing, with exceptional performance in text rendering, especially for Chinese.</p>
56
+ <div class="benchmark-image">
57
+ <img src="https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-Image/bench.png" alt="Qwen-Image Benchmark" class="full-width-img">
58
+ </div>
59
+ </section>
60
 
61
  <div class="features-section">
62
  <div class="feature">
63
+ <h3>🚀 Multimodal AI Capabilities</h3>
64
+ <p>Part of the Qwen (Tongyi Qianwen) model series, offering powerful text-to-image generation with exceptional understanding of complex prompts</p>
65
  </div>
66
+
67
  <div class="feature">
68
+ <h3>🌟 Open Source Innovation</h3>
69
+ <p>Part of Alibaba's commitment to open-source AI development, allowing researchers and developers to build upon and extend its capabilities</p>
70
  </div>
71
  <div class="feature">
72
+ <h3>🔍 Comprehensive Model Family</h3>
73
+ <p>Works alongside other Qwen models for text, vision, and multimodal applications, providing a complete ecosystem for AI development</p>
74
  </div>
75
  </div>
76
 
77
+ <section class="quickstart-section">
78
+ <h2>Quick Start</h2>
79
+ <p>Choose your preferred Qwen image model:</p>
80
+
81
+ <h3>Option 1: Using the latest Qwen VLo model</h3>
82
+ <p>The new Qwen VLo model supports both text-to-image and image-to-image generation with progressive generation feature.</p>
83
+ <div class="code-block">
84
+ <pre><code>pip install dashscope>=1.20.7</code></pre>
85
+ </div>
86
+ <div class="code-block">
87
+ <pre><code>import dashscope
88
+ from dashscope import ImageSynthesis
89
+
90
+ # Set your API key
91
+ dashscope.api_key = "YOUR_API_KEY"
92
+
93
+ # Text-to-image generation
94
+ response = ImageSynthesis.call(
95
+ model='qwen-vlo',
96
+ prompt='A coffee shop entrance features a chalkboard sign reading "Qwen Coffee 😊 $2 per cup"',
97
+ negative_prompt='blurry, low quality',
98
+ n=1, # Number of images to generate
99
+ size='1024*1024', # Image size
100
+ steps=50 # Diffusion steps
101
+ )
102
+
103
+ # Save the generated image
104
+ if response.status_code == 200:
105
+ with open('qwen_vlo_result.png', 'wb') as f:
106
+ f.write(response.output.images[0].image)
107
+ print('Image saved successfully!')
108
+ else:
109
+ print(f'Failed to generate image: {response.message}')</code></pre>
110
+ </div>
111
+
112
+ <h3>Option 2: Using Qwen-Image with diffusers</h3>
113
+ <p>Install the latest version of diffusers</p>
114
+ <div class="code-block">
115
+ <pre><code>pip install git+https://github.com/huggingface/diffusers</code></pre>
116
+ </div>
117
+ <p>The following contains a code snippet illustrating how to use the model to generate images based on text prompts:</p>
118
+ <div class="code-block">
119
+ <pre><code>from diffusers import DiffusionPipeline
120
+ import torch
121
+
122
+ model_name = "Qwen/Qwen-Image"
123
+
124
+ # Load the pipeline
125
+ if torch.cuda.is_available():
126
+ torch_dtype = torch.bfloat16
127
+ device = "cuda"
128
+ else:
129
+ torch_dtype = torch.float32
130
+ device = "cpu"
131
+
132
+ pipe = DiffusionPipeline.from_pretrained(model_name, torch_dtype=torch_dtype)
133
+ pipe = pipe.to(device)
134
+
135
+ positive_magic = {
136
+ "en": "Ultra HD, 4K, cinematic composition.", # for english prompt
137
+ "zh": "超清,4K,电影级构图" # for chinese prompt
138
+ }
139
+
140
+ # Generate image
141
+ prompt = '''A coffee shop entrance features a chalkboard sign reading "Qwen Coffee 😊 $2 per cup," with a neon light beside it displaying "通义千问". Next to it hangs a poster showing a beautiful Chinese woman, and beneath the poster is written "π≈3.1415926-53589793-23846264-33832795-02384197". Ultra HD, 4K, cinematic composition'''
142
+
143
+ negative_prompt = " "
144
+
145
+ # Generate with different aspect ratios
146
+ aspect_ratios = {
147
+ "1:1": (1328, 1328),
148
+ "16:9": (1664, 928),
149
+ "9:16": (928, 1664),
150
+ "4:3": (1472, 1140),
151
+ "3:4": (1140, 1472)
152
+ }
153
+
154
+ width, height = aspect_ratios["16:9"]
155
+
156
+ image = pipe(
157
+ prompt=prompt + positive_magic["en"],
158
+ negative_prompt=negative_prompt,
159
+ width=width,
160
+ height=height,
161
+ num_inference_steps=50,
162
+ true_cfg_scale=4.0,
163
+ generator=torch.Generator(device="cuda").manual_seed(42)
164
+ ).images[0]
165
+
166
+ image.save("example.png")</code></pre>
167
+ </div>
168
+ </section>
169
+
170
+ <section class="showcase-section">
171
+ <h2>Show Cases</h2>
172
+
173
+ <div class="showcase-item-full">
174
+ <div class="showcase-description-full">
175
+ <h3>Superior Text Rendering</h3>
176
+ <p>One of its standout capabilities is high-fidelity text rendering across diverse images. Whether it's alphabetic languages like English or logographic scripts like Chinese, Qwen-Image preserves typographic details, layout coherence, and contextual harmony with stunning accuracy. Text isn't just overlaid—it's seamlessly integrated into the visual fabric.</p>
177
+ </div>
178
+ <div class="showcase-image-full">
179
+ <img src="https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-Image/s1.jpg" alt="Text Rendering Example" class="showcase-img-full">
180
+ </div>
181
+ </div>
182
+
183
+ <div class="showcase-item-full">
184
+ <div class="showcase-description-full">
185
+ <h3>Artistic Style Support</h3>
186
+ <p>Beyond text, Qwen-Image excels at general image generation with support for a wide range of artistic styles. From photorealistic scenes to impressionist paintings, from anime aesthetics to minimalist design, the model adapts fluidly to creative prompts, making it a versatile tool for artists, designers, and storytellers.</p>
187
+ </div>
188
+ <div class="showcase-image-full">
189
+ <img src="https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-Image/s2.jpg" alt="Artistic Styles Example" class="showcase-img-full">
190
+ </div>
191
+ </div>
192
+
193
+ <div class="showcase-item-full">
194
+ <div class="showcase-description-full">
195
+ <h3>Advanced Image Editing</h3>
196
+ <p>When it comes to image editing, Qwen-Image goes far beyond simple adjustments. It enables advanced operations such as style transfer, object insertion or removal, detail enhancement, text editing within images, and even human pose manipulation—all with intuitive input and coherent output. This level of control brings professional-grade editing within reach of everyday users.</p>
197
+ </div>
198
+ <div class="showcase-image-full">
199
+ <img src="https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-Image/s3.jpg" alt="Image Editing Example" class="showcase-img-full">
200
+ </div>
201
+ </div>
202
+
203
+ <div class="showcase-item-full">
204
+ <div class="showcase-description-full">
205
+ <h3>Image Understanding</h3>
206
+ <p>But Qwen-Image doesn't just create or edit—it understands. It supports a suite of image understanding tasks, including object detection, semantic segmentation, depth and edge (Canny) estimation, novel view synthesis, and super-resolution. These capabilities, while technically distinct, can all be seen as specialized forms of intelligent image editing, powered by deep visual comprehension.</p>
207
+ </div>
208
+ <div class="showcase-image-full">
209
+ <img src="https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-Image/s4.jpg" alt="Image Understanding Example" class="showcase-img-full">
210
+ </div>
211
+ </div>
212
+
213
+ <div class="showcase-conclusion">
214
+ <p>Together, these features make Qwen-Image not just a tool for generating pretty pictures, but a comprehensive foundation model for intelligent visual creation and manipulation—where language, layout, and imagery converge.</p>
215
+ </div>
216
+ </section>
217
+
218
+ <div class="resource-links-section">
219
+ <h2>Resources</h2>
220
+ <div class="resource-links">
221
+ <a href="https://huggingface.co/Qwen/Qwen-Image" target="_blank" rel="noopener noreferrer" class="resource-link">Qwen-Image on Hugging Face</a>
222
+ <a href="https://github.com/QwenLM/Qwen" target="_blank" rel="noopener noreferrer" class="resource-link">Qwen GitHub</a>
223
+ <a href="https://www.alibabacloud.com/en/solutions/generative-ai/qwen" target="_blank" rel="noopener noreferrer" class="resource-link">Alibaba Cloud Qwen</a>
224
+ <a href="https://modelscope.cn/models/qwen/Qwen-Image" target="_blank" rel="noopener noreferrer" class="resource-link">ModelScope</a>
225
+ <a href="https://help.aliyun.com/zh/dashscope/developer-reference/qwen-vlo-quick-start" target="_blank" rel="noopener noreferrer" class="resource-link">Qwen VLo Documentation</a>
226
+ <a href="https://www.alibabacloud.com/help/en/model-studio/vision/" target="_blank" rel="noopener noreferrer" class="resource-link">Qwen-VL Documentation</a>
227
  </div>
 
 
 
 
228
  </div>
229
  </div>
230
  </div>