News|Articles|June 11, 2025

GPT-4 AI Model Outperforms Traditional Tools in Predicting Cutaneous Squamous Cell Carcinoma Outcomes

Cutaneous squamous cell carcinoma (cSCC) is the second most common form of skin cancer. While most cases are treatable, a small number can become serious and spread, leading to worse outcomes.

A risk prediction tool for cutaneous squamous cell carcinoma (cSCC) built with the GPT-4 large language artificial intelligence model performed better than current systems at identifying patients more likely to have poor outcomes, according to a new study published in JAMA Dermatology.

CSCC is the second most common form of skin cancer. While most cases are treatable, a small number can become serious and spread, leading to worse outcomes.

Accurately identifying which tumors are more dangerous is important for deciding how to treat patients, the report shared.

Existing tools or models, such as the AJCC8 and BWH staging systems, group tumors by certain traits, but they tend to miss important risk factors and can group very different tumors together, making it harder to predict who might do poorly.

Many factors increase the risk of developing cSCC, including immunosuppression, chronic wounds, fair skin, male gender, older age, certain genetic conditions, ultraviolet (UV) radiation exposure and a history of prior squamous cell carcinoma, according to the National Institutes of Health.

In 2012, the estimated incidence was 140 cases per 100,000 American men and 50 per 100,000 women.

To address these limitations, researchers searched PubMed, Embase and the Cochrane Library for studies from 1999 through the end of 2023.

After applying strict criteria, 10 studies that linked risk factors to serious outcomes such as recurrence, spread or death were selected.

These studies were used to inform a large AI model, GPT-4, called AIRIS through a process called retrieval-augmented generation (RAG).

The AI created a new scoring system to predict which cSCC tumors are more dangerous.

AIRIS was tested using tumor data from NYU Langone Health and Mayo Clinic.

The dataset included 2,379 biopsy-proven cSCC cases with full clinical information.

The AI model’s predictions were compared to AJCC8 and BWH systems using statistical tests.

Researchers measured how well AIRIS could predict poor outcomes using standard metrics like sensitivity, specificity and AUC. AIRIS was also tested for consistency and ability to separate high- and low-risk cases.

It was found that AIRIS outperformed BWH and AJCC8 in a number of key areas for predicting poor outcomes in patients with cSCC.

In low-risk groups, AIRIS showed fewer poor outcomes: 50.9% for local recurrence (LR), 26.3% for nodal metastasis (NM), 17.5% for distant metastasis (DM) and 27.8% for disease-specific death (DSD).

In comparison, BWH and AJCC8 systems had nearly twice as many poor outcomes in their low-risk groups, indicating there were less consistent results.

AIRIS also showed further progression, overall.

For high-risk AIRIS classes, the poor outcome rates increased significantly: LR (49.1%), NM (73.7%), DM (82.5%) and DSD (72.2%).

As far as diagnostic performance, AIRIS had higher sensitivity for all outcomes—ranging from 49.1% to 82.5%—but slightly lower compared to BWH and AJCC8.

Although overall accuracy was lower, AIRIS demonstrated stronger predictive power, with AUC values of 0.69 (LR), 0.81 (NM), 0.85 (DM), and 0.80 (DSD)—all higher than the traditional systems.

While much data was collected, the study did have several strengths.

For example, reviewed over 2,000 primary tumors to validate AIRIS. AIRIS included important patient risk factors such as immunosuppression, lymphovascular invasion and in-transit metastasis, which are often missing from traditional staging systems, authors of the study noted.

This helped AIRIS better predict poor outcomes and showed improved sensitivity and risk discrimination compared to current standards.

However, limitations include the relatively low event rate of poor outcomes in cSCC which cab make validation challenging.

In addition, large language models such as GPT rely on probable predictions and can have biases based on their training data and inputs.

While RAG helps ground the model in reliable literature, AI-generated outputs still require careful validation, authors suggest.

Future improvements are recommended to include weighting immunosuppression categories and integrating multimodal data including imaging or gene profiles to personalize risk predictions further.

Get the latest industry news, event updates, and more from Managed healthcare Executive.

Subscribe Now!

GPT-4 AI Model Outperforms Traditional Tools in Predicting Cutaneous Squamous Cell Carcinoma Outcomes

Newsletter

Related Content

AbbVie submits vitiligo indication applications for Rinvoq

Understanding the facial effects of GLP-1s with plastic surgeon, Konstantin Vasyukevich, M.D.

FAQ: A message to health systems and payers: chronic itch is more than a skin problem

Probiotics show benefits for atopic dermatitis, but research gaps remain for other skin conditions

Adding NB-UVB phototherapy to ruxolitinib cream speeds vitiligo repigmentation process

Latest CME

Community Oncology Connections™: Optimizing SCLC Treatment Strategies and Managing Adverse Events Across Disease Stages | South Carolina

Personalized Management in NSCLC: Strategies for Early Detection, Molecular Testing, and Targeted Therapies | Kansas

Personalized Management in NSCLC: Strategies for Early Detection, Molecular Testing, and Targeted Therapies | Wyoming and Montana

Personalized Management in NSCLC: Strategies for Early Detection, Molecular Testing, and Targeted Therapies | New Mexico

Community Oncology Connections™: Optimizing SCLC Treatment Strategies and Managing Adverse Events Across Disease Stages | North Carolina

A Breath of Strength: Managing Cancer Associated LEMS and Lung Cancer as One

Huntington Beach Symposia

Striking the Right Nerve: Managing Cancer Associated LEMS in Lung Cancer Patients

3rd Annual Hawaii Lung: A Multidisciplinary Case-Based Conference

Community Practice Connections™: Incorporating Recent Updates in the Treatment of Metastatic ALK-Positive NSCLC

Virtual Testing Board: Digging Deeper on Your Testing Reports to Elevate Patient Outcomes in Advanced Non–Small Cell Lung Cancer

Mastering Advances in Managing Unresectable and Metastatic NSCLC—Immunotherapy, Targeted Therapies, and Emerging Strategies

Cases & Conversations™: Expert Perspectives on Leveraging Recent Advances to Transform SCLC Treatment

22nd Annual Winter Lung Cancer Conference®

Show Me Your Care Plan!™ Insights for Oncology Nurses on Comprehensive SCLC Treatment and Care Strategies

Show Me Your Care Plan!™ Insights for Oncology Nurses on Comprehensive SCLC Treatment and Care Strategies

(CME Credit) Advancing Outcomes in Limited-Stage Small Cell Lung Cancer: From Evidence to Practice

Medical Crossfire®: Expert Perspectives on Targeting c-Met Overexpression and 𝘔𝘌𝘛 Genomic Alterations in NSCLC – Unveiling the Complexities of 𝘔𝘌𝘛 Dysregulation

PER Tumor Board®: Applying Recent Advances to Transform the Treatment Paradigm in SCLC—Expert Perspectives on New Approvals and Emerging Strategies

Tumor Board: Expert Insights on Managing Classical 𝘌𝘎𝘍𝘙 Mutations, 𝘌𝘎𝘍𝘙 Exon 20 Insertions, and Atypical 𝘌𝘎𝘍𝘙 Mutations in Metastatic NSCLC

Medical Crossfire®: DLL3-Driven Innovations in Small Cell Lung Cancer – How Do Experts Apply Pivotal Advances to Practice?

Medical Crossfire®: The Precision Path for HER2 and TROP2-Targeted Treatments in Non–Small Cell Lung Cancer

Trending on Managed Healthcare Executive

7 things you need to know about the PBM reforms signed into law this week

TrumpRx launches; some experts question its long-term value

PBM reform. It has finally happened

What exactly is managed care today?

FDA issues draft guidance on MRD and complete response as primary endpoints for accelerated multiple myeloma drug approvals