Tehran University of Medical Sciences

Science Communicator Platform

Share By
Pathologist-Like Explainable Ai for Interpretable Gleason Grading in Prostate Cancer Publisher Pubmed



Mittmann G ; Laiouarpedari S ; Mehrtens HA ; Haggenmuller S ; Bucher TC ; Chanda T ; Gaisa NT ; Wagner M ; Klamminger GG ; Rau TT ; Neppl C ; Comperat EM ; Gocht A ; Haemmerle M Show All Authors
Authors
  1. Mittmann G
  2. Laiouarpedari S
  3. Mehrtens HA
  4. Haggenmuller S
  5. Bucher TC
  6. Chanda T
  7. Gaisa NT
  8. Wagner M
  9. Klamminger GG
  10. Rau TT
  11. Neppl C
  12. Comperat EM
  13. Gocht A
  14. Haemmerle M
  15. Rupp NJ
  16. Westhoff J
  17. Krucken I
  18. Seidl M
  19. Schurch CM
  20. Bauer M
  21. Solass W
  22. Tam YC
  23. Weber F
  24. Grobholz R
  25. Augustyniak J
  26. Kalinski T
  27. Horner C
  28. Mertz KD
  29. Doring C
  30. Erbersdobler A
  31. Deubler G
  32. Bremmer F
  33. Sommer U
  34. Brodhun M
  35. Griffin J
  36. Lenon MSL
  37. Trpkov K
  38. Cheng L
  39. Chen F
  40. Levi A
  41. Cai G
  42. Nguyen TQ
  43. Amin A
  44. Cimadamore A
  45. Shabaik A
  46. Manucha V
  47. Ahmad N
  48. Messias N
  49. Sanguedolce F
  50. Taheri D
  51. Baraban E
  52. Jia L
  53. Shah RB
  54. Siadat F
  55. Swarbrick N
  56. Park K
  57. Hassan O
  58. Sakhaie S
  59. Downes MR
  60. Miyamoto H
  61. Williamson SR
  62. Hollandletz T
  63. Wies C
  64. Schneider CV
  65. Kather JN
  66. Tolkach Y
  67. Brinker TJ

Source: Nature Communications Published:2025


Abstract

The aggressiveness of prostate cancer is primarily assessed from histopathological data using the Gleason scoring system. Conventional artificial intelligence (AI) approaches can predict Gleason scores, but often lack explainability, which may limit clinical acceptance. Here, we present an alternative, inherently explainable AI that circumvents the need for post-hoc explainability methods. The model was trained on 1,015 tissue microarray core images, annotated with detailed pattern descriptions by 54 international pathologists following standardized guidelines. It uses pathologist-defined terminology and was trained using soft labels to capture data uncertainty. This approach enables robust Gleason pattern segmentation despite high interobserver variability. The model achieved comparable or superior performance to direct Gleason pattern segmentation (Dice score: 0.713±0.003 vs. 0.691±0.010) while providing interpretable outputs. We release this dataset to encourage further research on segmentation in medical tasks with high subjectivity and to deepen insights into pathologists’ reasoning. © 2025 Elsevier B.V., All rights reserved.