SmilingWolf
/

wd-swinv2-tagger-v3

Model card Files Files and versions Community

wd-swinv2-tagger-v3 / README.md

SmilingWolf's picture

Amend the JAX model config file: add image size.

9e6d6a7 over 1 year ago

|

1.34 kB

	---
	license: apache-2.0
	library_name: timm
	---
	# WD SwinV2 Tagger v3

	Supports ratings, characters and general tags.

	Trained using https://github.com/SmilingWolf/JAX-CV.
	TPUs used for training kindly provided by the [TRC program](https://sites.research.google/trc/about/).

	## Dataset
	Last image id: 7220105
	Trained on Danbooru images with IDs modulo 0000-0899.
	Validated on images with IDs modulo 0950-0999.
	Images with less than 10 general tags were filtered out.
	Tags with less than 600 images were filtered out.

	## Validation results
	`P=R: threshold = 0.2521, F1 = 0.4411`

	## What's new
	Model v1.1/Dataset v3:
	Amended the JAX model config file: add image size.
	No change to the trained weights.

	Model v1.0/Dataset v3:
	More training images, more and up-to-date tags (up to 2024-02-28).
	Now `timm` compatible! Load it up and give it a spin using the canonical one-liner!
	ONNX model is compatible with code developed for the v2 series of models.
	The batch dimension of the ONNX model is not fixed to 1 anymore. Now you can go crazy with batch inference.
	Switched to Macro-F1 to measure model performance since it gives me a better gauge of overall training progress.

	## Final words
	Subject to change and updates.
	Downstream users are encouraged to use tagged releases rather than relying on the head of the repo.