Upload folder using huggingface_hub

Browse files

Files changed (4) hide show

README.md +55 -0
README_English.md +53 -0
config.json +36 -0
diffusion_pytorch_model.safetensors +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,55 @@

+---
+license: cc-by-nc-4.0
+language:
+- en
+base_model:
+- stabilityai/stable-diffusion-3.5-medium
+---
+# MANGA109 Pose HAの漫画画像で学習したText-Image-to-Image
+このリポジトリは、[MANGA109 Pose tools](https://github.com/kuri-lab/MANGA109-Pose-tools)の画像生成モデルです。画像生成モデルに入力する条件画像は、上記URLのレポジトリで作成してください。
+## 学習パラメータ
+|引数 | 値 |
+| ---- | ---- |
+|resolution | 512 |
+|train batch size | 4 |
+|learning rate | 1e-05 |
+|mixed precision | fp16 |
+|max train steps | 200,000 |
+## 学習データセット
+MANGA109 Pose HA をtraining set，validation set，test set を8:1:1に分割したデータセット
+## 作成者の環境
+  - GPU：H100NVL（1枚）
+  - CUDA：12.4
+  - PyTorch：2.6.0+cu124
+  - diffusers:0.33.0.dev0
+## 計算時間
+H100(NVL)94GB の1 つのGPU を用いて88 時間
+1 学習ステップあたり1.58 秒
+## License
+本リポジトリは、
+[Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) ](https://creativecommons.org/licenses/by-nc/4.0/deed.en)に基づいてライセンスされています。
+## 引用
+このリポジトリを研究で使用する場合は，次の Bibtex エントリを使用して引用することを検討してください．
+```
+@article{okada2025manga109pose,
+  title={MANGA109 に姿勢情報を追加したデータセットの構築による姿勢を制御した漫画キャラクター画像生成},
+  author={岡田 湧路 and 北川 峻 and 渡邉 謙吾 and 稲葉 通将 and 橋本 敦史 and 栗原 聡},
+  journal={人工知能学会全国大会論文集},
+  volume={JSAI2025},
+  pages={2O1GS1005-2O1GS1005}
+  year={2025}
+}
+```
+## 更新履歴
+* 2025/04/25: [公開]
+*

README_English.md ADDED Viewed

	@@ -0,0 +1,53 @@

+---
+license: cc-by-nc-4.0
+language:
+- en
+base_model:
+- stabilityai/stable-diffusion-3.5-medium
+---
+# Text-Image-to-Image Trained on MANGA109 Pose HA Manga Images
+This repository contains an image generation model trained using manga images from [MANGA109 Pose tools](https://github.com/kuri-lab/MANGA109-Pose-tools).
+Please create the conditional input images using the repository linked above.
+## Training Parameters
+|Argument | Value |
+| ---- | ---- |
+|resolution | 512 |
+|train batch size | 4 |
+|learning rate | 1e-05 |
+|mixed precision | fp16 |
+|max train steps | 200,000 |
+## Training Dataset
+The MANGA109 Pose HA dataset was split into training, validation, and test sets in an 8:1:1 ratio.
+## Author's Environment
+  - GPU：H100NVL (1 unit)
+  - CUDA：12.4
+  - PyTorch：2.6.0+cu124
+  - diffusers: 0.33.0.dev0
+## Computation Time
+Training was conducted on a single H100 NVL GPU (94GB) and took 88 hours.
+Each training step took approximately 1.58 seconds.
+## License
+This repository is licensed under the [Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) ](https://creativecommons.org/licenses/by-nc/4.0/deed.en).
+## Citation
+If you use this repository in your research, please consider citing it using the following BibTeX entry:
+```
+@article{okada2025manga109pose,
+  title={MANGA109 に姿勢情報を追加したデータセットの構築による姿勢を制御した漫画キャラクター画像生成},
+  author={岡田 湧路 and 北川 峻 and 渡邉 謙吾 and 稲葉 通将 and 橋本 敦史 and 栗原 聡},
+  journal={人工知能学会全国大会論文集},
+  volume={JSAI2025},
+  pages={2O1GS1005-2O1GS1005}
+  year={2025}
+}
+```
+Update History
+* 2025/04/25: [Public Release]

config.json ADDED Viewed

	@@ -0,0 +1,36 @@

+{
+  "_class_name": "SD3ControlNetModel",
+  "_diffusers_version": "0.32.1",
+  "_name_or_path": "stabilityai/stable-diffusion-3.5-medium",
+  "attention_head_dim": 64,
+  "caption_projection_dim": 1536,
+  "dual_attention_layers": [
+    0,
+    1,
+    2,
+    3,
+    4,
+    5,
+    6,
+    7,
+    8,
+    9,
+    10,
+    11,
+    12
+  ],
+  "extra_conditioning_channels": 0,
+  "force_zeros_for_pooled_projection": true,
+  "in_channels": 16,
+  "joint_attention_dim": 4096,
+  "num_attention_heads": 24,
+  "num_layers": 12,
+  "out_channels": 16,
+  "patch_size": 2,
+  "pooled_projection_dim": 2048,
+  "pos_embed_max_size": 384,
+  "pos_embed_type": "sincos",
+  "qk_norm": "rms_norm",
+  "sample_size": 128,
+  "use_pos_embed": true
+}

diffusion_pytorch_model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:45bee064703ab9be38fb816c2f9fddadb08c2a30920f686a6fc15b8d09c2cc83
+size 5950710432