HistoGPT is a vision language model that generates highly accurate pathology reports from gigapixel whole slide images. The model takes multiple tissue sections from a patient as input and generates a highly accurate pathology report that includes disease classification, tumor subtype prediction, tumor thickness estimation, and other important clinical information. Most importantly, HistoGPT is fully interpretable, as every word or phrase in the output text can be visualized in the original image.
The full code of our project is available on GitHub. At the moment only the model weights are hosted here on Hugging Face.