Diagnostic Performance of Chatgpt in Tibial Plateau Fracture in Knee X-Ray

Science Communicator Platform

Share By

Diagnostic Performance of Chatgpt in Tibial Plateau Fracture in Knee X-Ray Publisher Pubmed

Summary: Can AI diagnose fractures? Study finds ChatGPT-4o matches physicians for tibial fractures. #AIinMedicine #TibialFractures

Mohammadi M¹ ; Parviz S² ; Parvaz P³ ; Pirmoradi MM¹ ; Afzalimoghaddam M^{1, 4} ; Mirfazaelian H⁴

Source: Emergency Radiology Published:2025

Abstract

Purpose: Tibial plateau fractures are relatively common and require accurate diagnosis. Chat Generative Pre-Trained Transformer (ChatGPT) has emerged as a tool to improve medical diagnosis. This study aims to investigate the accuracy of this tool in diagnosing tibial plateau fractures. Methods: A secondary analysis was performed on 111 knee radiographs from emergency department patients, with 29 confirmed fractures by computed tomography (CT) imaging. The X-rays were reviewed by a board-certified emergency physician (EP) and radiologist and then analyzed by ChatGPT-4 and ChatGPT-4o. The diagnostic performances were compared using the area under the receiver operating characteristic curve (AUC). Sensitivity, specificity, and likelihood ratios were also calculated. Results: The results indicated a sensitivity and negative likelihood ratio of 58.6% (95% CI: 38.9 − 76.4%) and 0.4 (95% CI: 0.3–0.7) for the EP, 72.4% (95% CI: 52.7 − 87.2%) and 0.3 (95% CI: 0.2–0.6) for the radiologist, 27.5% (95% CI: 12.7 − 47.2%) and 0.7 (95% CI: 0.6–0.9) for ChatGPT-4, and 55.1% (95% CI: 35.6 − 73.5%) and 0.4 (95% CI: 0.3–0.7) for ChatGPT4o. The specificity and positive likelihood ratio were 85.3% (95% CI: 75.8 − 92.2%) and 4.0 (95% CI: 2.1–7.3) for the EP, 76.8% (95% CI: 66.2 − 85.4%) and 3.1 (95% CI: 1.9–4.9) for the radiologist, 95.1% (95% CI: 87.9 − 98.6%) and 5.6 (95% CI: 1.8–17.3) for ChatGPT-4, and 93.9% (95% CI: 86.3 − 97.9%) and 9.0 (95% CI: 3.6–22.4) for ChatGPT4o. The area under the receiver operating characteristic curve (AUC) was 0.72 (95% CI: 0.6–0.8) for the EP, 0.75 (95% CI: 0.6–0.8) for the radiologist, 0.61 (95% CI: 0.4–0.7) for ChatGPT-4, and 0.74 (95% CI: 0.6–0.8) for ChatGPT4-o. The EP and radiologist significantly outperformed ChatGPT-4 (P value = 0.02 and 0.01, respectively), whereas there was no significant difference between the EP, ChatGPT-4o, and radiologist. Conclusion: ChatGPT-4o matched the physicians’ performance and also had the highest specificity. Similar to the physicians, ChatGPT chatbots were not suitable for ruling out the fracture. © The Author(s), under exclusive licence to American Society of Emergency Radiology (ASER) 2024.

Related Docs

View other Related Docs

1. Diagnostic Accuracy of Ottawa Knee Rule for Diagnosis of Fracture in Patients With Knee Trauma; a Systematic Review and Meta-Analysis, Archives of Academic Emergency Medicine (2023)

2. Intrarater and Inter-Rater Reliability of Tibial Plateau Fracture Classifications: Systematic Review and Meta-Analysis, JBJS Open Access (2024)

3. Metal-Backed Tibial Components Offer Comparable Patient-Reported Outcome Measures With Lower Revision Rates Compared With All-Polyethylene Tibial Components in Medial Fixed-Bearing Unicompartmental Knee Arthroplasty: A Systematic Review and Meta-Analysis, JBJS Reviews (2025)

Experts (# of related papers)

View all Related Experts

Fardis Vosoughi (4)

Leila Aghaghazvini (2)

Diagnostic Performance of Chatgpt in Tibial Plateau Fracture in Knee X-Ray

Other Related Docs

4. Current Evidence Does Not Support the Use of Tibial Stem Extension in Total Knee Arthroplasty of Obese Patients: A Systematic Review, Journal of Arthroplasty (2025)

5. Diagnostic Efficacy of Cone-Beam Computed Tomography for Detection of Vertical Root Fractures in Endodontically Treated Teeth: A Systematic Review, BMC Medical Imaging (2023)

6. Heberden's Nodes and Knee Osteoarthritis–Related Osseous Structural Damage: Exploratory Study From the Osteoarthritis Initiative, Arthritis and Rheumatology (2019)

7. Biomechanical Study of Effect of Tibial Posteromedial Defect Depth and Area on Primary Tka Implant Stability, Knee (2024)

8. A New Modified Open Posterior Approach for the Fixation of Posterior Cruciate Ligament Tibial Avulsion Fractures, Trauma Monthly (2017)

9. Acute Total Knee Replacement in Rheumatoid Arthritis Patients With Proximal Tibial Fractures: A Case Series, Journal of Knee Surgery (2021)

10. Diagnostic Performance of Dual-Energy Computed Tomography in Detecting Anterior Cruciate Ligament Injuries: A Systematic Review and Meta-Analysis, Skeletal Radiology (2025)

11. Progression of Bone Marrow Lesions and the Development of Knee Osteoarthritis: Osteoarthritis Initiative Data, Radiology (2024)

12. Value of Quantitative Serial Three Phase Bone Scan for Diagnosis of Total Knee Arthroplasty Loosening in Case of Initial Equivocal Findings, Iranian Journal of Nuclear Medicine (2023)

13. Sonography: A Sensitive and Specific Method for Detecting Trochlear Cartilage Pathologies, Journal of Ultrasound (2020)

14. Artificial Intelligence Diagnostic Accuracy in Fracture Detection From Plain Radiographs and Comparing It With Clinicians: A Systematic Review and Meta-Analysis, Clinical Radiology (2024)

15. Langenskiold Classification of Tibia Vara: A Multicenter Study on Interrater Reliability, Journal of Pediatric Orthopaedics Part B (2022)

16. Radiomics Analysis on Blood-Pool Phase of Bone Scintigraphy for the Diagnosis of Juvenile Idiopathic Arthritis, Iranian Journal of Nuclear Medicine (2024)

17. Fluoroscopic Analysis of Anterior Tibial Translation During Eccentric and Concentric Phase of Knee Rehabilitation Exercises in Men With Anterior Cruciate Ligament Injury, Tehran University Medical Journal (2019)

18. Association of Patella Alta With Worsening of Patellofemoral Osteoarthritis-Related Structural Damage: Data From the Osteoarthritis Initiative, Osteoarthritis and Cartilage (2019)

19. Evaluation of Treatment Planning Discrepancies: Ct Versus Plain Radiographic Findings in Patients With Foot and Ankle Trauma, BMC Research Notes (2024)

20. Decoding Tibial Plateau Fracture Classifications: A Century of Individualized Insights in a Systematic Review, EFORT Open Reviews (2025)

Style	Citing Format
MLA	Mohammadi M, et al.. "Diagnostic Performance of Chatgpt in Tibial Plateau Fracture in Knee X-Ray." Emergency Radiology, vol. 32, no. 1, 2025, pp. 59-64.
APA	Mohammadi M, Parviz S, Parvaz P, Pirmoradi MM, Afzalimoghaddam M, Mirfazaelian H (2025). Diagnostic Performance of Chatgpt in Tibial Plateau Fracture in Knee X-Ray. Emergency Radiology, 32(1), 59-64.
Chicago	Mohammadi M, Parviz S, Parvaz P, Pirmoradi MM, Afzalimoghaddam M, Mirfazaelian H. "Diagnostic Performance of Chatgpt in Tibial Plateau Fracture in Knee X-Ray." Emergency Radiology 32, no. 1 (2025): 59-64.
Harvard	Mohammadi M et al. (2025) 'Diagnostic Performance of Chatgpt in Tibial Plateau Fracture in Knee X-Ray', Emergency Radiology, 32(1), pp. 59-64.
Vancouver	Mohammadi M, Parviz S, Parvaz P, Pirmoradi MM, Afzalimoghaddam M, Mirfazaelian H. Diagnostic Performance of Chatgpt in Tibial Plateau Fracture in Knee X-Ray. Emergency Radiology. 2025;32(1):59-64.
BibTex	@article{ author = {Mohammadi M and Parviz S and Parvaz P and Pirmoradi MM and Afzalimoghaddam M and Mirfazaelian H}, title = {Diagnostic Performance of Chatgpt in Tibial Plateau Fracture in Knee X-Ray}, journal = {Emergency Radiology}, volume = {32}, number = {1}, pages = {59-64}, year = {2025} }
RIS	TY - JOUR AU - Mohammadi M AU - Parviz S AU - Parvaz P AU - Pirmoradi MM AU - Afzalimoghaddam M AU - Mirfazaelian H TI - Diagnostic Performance of Chatgpt in Tibial Plateau Fracture in Knee X-Ray JO - Emergency Radiology VL - 32 IS - 1 SP - 59 EP - 64 PY - 2025 ER -

Science Communicator Platform

Authors

Abstract