Conference Coverage

ChatGPT 4.0 Falls Short in Diagnosing Orbital Floor Fractures

Key Highlights

ChatGPT 4.0 showed statistically significant differences compared with an oral and maxillofacial surgeon when diagnosing orbital floor fractures.
AI identified orbital floor fractures in 63.5% of cases, but its diagnostic accuracy was neither reliable nor accurate.
Findings suggest orbital floor fracture diagnosis should remain surgeon-driven, with AI results requiring clinical verification.

In a retrospective cohort study presented at the 107th American Association of Oral and Maxillofacial Surgeons Annual Meeting, Scientific Sessions and Exhibition, researchers evaluated whether ChatGPT 4.0 could serve as an accurate and reliable diagnostic tool for orbital floor fractures. Despite its potential, results showed the artificial intelligence (AI) system’s performance differed significantly from that of a specialist surgeon, disproving its immediate clinical utility.

The emergence of large language models, including ChatGPT, has created excitement about their application in radiology and surgical fields due to their pattern-recognition capabilities. However, there remains a lack of robust evidence supporting their use in clinical diagnosis. Oral and maxillofacial surgeons are particularly interested in AI’s potential to improve diagnostic efficiency in acute trauma settings, prompting this investigation.

The study was performed on 30 cases of orbital floor fractures selected from a trauma database. Computed tomography scans (sagittal and coronal views) were reviewed, and ChatGPT’s diagnostic responses were compared with those of an oral and maxillofacial surgeon, who served as the gold standard. Descriptive analysis and a student’s t-test were applied to determine statistical significance between groups.

The study population had a mean age of 68.4 years, with 66.6% male and 33.3% female patients. ChatGPT identified orbital floor fractures in 19 of 30 cases (63.5%) and no fractures in 11 of 30 cases (36.6%). Statistical analysis demonstrated a significant difference between AI and surgeon responses (P = .03). These findings indicate that ChatGPT tended to classify more cases as fractures but not in a manner consistent with true diagnostic accuracy.

“Orbital floor fractures still require diagnosis by an oral and maxillofacial surgeon, ensuring that surgeons can verify AI-generated responses before using the tool in clinical settings,” the study authors concluded. “Further studies are needed to challenge the clinical applications of this software, particularly given its increasing use among oral and maxillofacial surgeons.”

Reference:
Ferrer JC, Caicedo AJH, Peña-Ruiz AO, Bermudez F. ChatGPT(4.0) has the capacity to diagnose orbital floor fractures: Fiction or reality? Presented at: American Association of Oral and Maxillofacial Surgeons Annual Meeting; September 15-20, 2025; Washington, DC. https://aaoms-annual-meeting-2025.eventscribe.net/.

Current Consultant Issue

Previous Issues

Early View

Research Summaries

Timeline Snapshot

Notable FDA Approvals in May

05/29/2026

May 2026 included several FDA drug and biologic approvals across a range of therapeutic areas. This timeline highlights the key alerts our publication covered throughout the month, giving readers a quick,...

05/29/2026

Conference Coverage

Conference Coverage

Eptinezumab Improved Migraine-Related Cognitive Symptoms at 6 Months in INFUSE

05/28/2026

Anthony Calabro, MA

In interim INFUSE data, ≥50% of adults with prior anti-CGRP preventive failure reported cognitive symptom improvement after eptinezumab.

05/28/2026

Research Summary

Research Summary

Azithromycin Shows No Wheezing-Symptom Benefit in Preschool Children Treated in EDs

05/28/2026

Ashton L. Stahl

A recent multicenter trial assessed azithromycin for moderate-to-severe wheezing in preschool children treated in emergency departments.

05/28/2026

Research Summary

Research Summary

Ensitrelvir Postexposure Prophylaxis Reduced COVID-19 Risk in Household Contacts

05/27/2026

Ashton L. Stahl

In a phase 3 trial, ensitrelvir reduced symptomatic COVID-19 after household exposure when started within 72 hours.

05/27/2026

Research Summary

Research Summary

Optimizing Cervical Cancer Screening by Age at HPV Vaccination in Norway: Modeling Study Suggests Less Frequent Screening

05/26/2026

Ashton L. Stahl

As HPV-vaccinated cohorts age into screening eligibility, researchers evaluated how cervical cancer screening intensity could be tailored by age at vaccination and vaccine type to balance benefits, harms,...

05/26/2026

Trending Articles

Disease State Pillar

Disease State Pillar

HIV Management: A Practical Guide for Primary Care

12/09/2025

With modern antiretroviral therapy (ART), human immunodeficiency virus (HIV) is a chronic, manageable condition—but outcomes hinge on rapid treatment start, durable viral suppression, and proactive primary...

12/09/2025

Disease State Pillar

Disease State Pillar

Chronic Obstructive Pulmonary Disease: A Comprehensive Guide for Primary Care

11/05/2025

What is COPD? Chronic obstructive pulmonary disease (COPD) is a progressive, preventable condition marked by airflow limitation and respiratory symptoms such as dyspnea, chronic cough, and sputum...

11/05/2025

Disease State Pillar

Disease State Pillar

Agitation in Alzheimer Disease: A Comprehensive Guide for Primary Care

09/08/2025

Agitation is a common and challenging behavioral symptom in dementia that primary care providers frequently encounter. Characterized by increased motor activity, restlessness, verbal or physical aggression,...

09/08/2025

Slideshow

Slideshow

Consultant360’s Summer Playlist: Expert Perspectives on Emerging Clinical Issues

09/15/2025

This summer, Consultant360 brought you thought-provoking conversations with leading clinicians on some of the most pressing issues in patient care—from the latest Advisory Committee on Immunization...

09/15/2025

Meeting Coverage

Meeting Coverage

What Clinicians Need to Know About ACIP’s June 2025 Meeting

09/26/2025

Key Highlights Clesrovimab approved for infants under 8 months to prevent respiratory syncytial virus during their first season, and added to the Vaccines for Children program for free access. FluMist...

09/26/2025