News|Articles|June 10, 2026

Experienced dermatologists outperform AI in real-world skin cancer diagnosis

Listen

0:00 / 0:00

Key Takeaways

Multiclass accuracy rose with dermoscopy experience, peaking at 74.2% for >10-year experts, versus 72.2% for PanDerm image-only and 56.7% for a first-generation CNN.
Binary classification favored AI, with balanced accuracy 0.82 driven by specificity (94%–97% benign clearance), while clinicians traded specificity for sensitivity to avoid missed malignancies.
Incorporating clinical photos and metadata reduced PanDerm multiclass accuracy to 66.3%, consistent with distribution shift between training imagery and more complex test-set clinical images.
Missed malignant cases clustered at acral sites, implicating underrepresentation of acral melanoma in public training datasets and reinforcing generalizability risks across skin phototypes and populations.
Optimal deployment emphasizes human–AI complementarity: decision support and education for novices, and triage/second-read workflows for experts to mitigate fatigue-related diagnostic error.

Artificial intelligence models matched mid-career dermatologists but trailed seasoned experts in diagnosing skin lesions across real-world cases.

Artificial intelligence (AI) has repeatedly been shown to match or beat dermatologists at reading skin lesions—but mostly under tightly controlled conditions. A new study published in JAMA Dermatology tested how that performance holds up against the broader mix of cases seen in the clinic and found that seasoned specialists still came out ahead.

“AI systems demonstrate strong potential as diagnostic support tools, particularly for early-career clinicians,” the authors wrote. “Despite overconfident mainstream narratives about achieving clinical excellence through AI-based technology alone, it remains crucial to continue training primary care physicians to recognize skin lesions, especially cancers, and to continue educating dermatologists toward expertise. This training is important not only in dermoscopy but also in the use of AI, including how it works and its limitations.”

The researchers, led by Julien Anriot, M.D., and Luc Thomas, M.D., Ph.D., of Claude Bernard University Lyon 1 in France, compared three AI systems against 652 physicians ranging from readers with less than 1 year of experience to readers with more than 10 years of experience. Drawing on 1,117 standardized cases that paired clinical and dermoscopic images with patient history and demographics, the team ran 1,092 human test iterations and intentionally included rare and atypical tumors that often challenge clinicians.

On the primary measure, which was accuracy across nine diagnostic categories, experts with more than 10 years of experience led at 74.2%. The strongest AI tool, the image-only version of the foundation model PanDerm, reached 72.2%. That was enough to outperform dermatologists with less than 1 year of experience (59.1%) and statistically match clinicians with three to 10 years of experience. A first-generation convolutional neural network had the lowest accuracy at 56.7%, trailing every group of human readers.

For the simpler benign-versus-malignant question, the image-only model had the highest balanced accuracy at 0.82, compared with 0.65 for humans, driven largely by specificity. The model correctly cleared benign lesions 94% of the time, and a version that also incorporated clinical photos and metadata hit 97%. Human readers were more cautious, accepting lower specificity to avoid missing cancers, and the most experienced clinicians retained the best sensitivity.

“The study confirmed the expected association between dermoscopy experience and diagnostic performance and quantified the gap between training levels,” the authors explained. “Thus, performance depended on the metric considered: the unimodal model achieved the highest binary balanced accuracy, largely associated with higher specificity, whereas the most experienced readers retained the highest multiclass diagnostic accuracy and the best sensitivity.”

One result surprised the investigators: adding clinical context made PanDerm less accurate instead of more accurate. The multimodal version scored 66.3% on multiclass accuracy, below the image-only configuration.

“Unlike human readers who benefit from clinical context, the AI system did not gain accuracy from additional data,” the authors wrote. “A likely explanation is a distribution shift between the close-up clinical images used for the unimodal and multimodal model training and the more distant, complex clinical images presented in the test set. Among the malignant lesions missed by both configurations, an apparent preponderance of acral localizations was noted. This could reflect underrepresentation of acral melanoma in publicly available training datasets.”

The authors caution that the results have limits. Readers were predominantly French, the patient population was largely of European origin, and darker skin phototypes were underrepresented. This generalizability concern mirrors broader questions about how AI performs across diverse populations.

That the multimodal model fell short reinforces a key point: AI progress hinges on how well data is integrated, not simply how much of it there is.

“The future likely lies in collaboration between humans and machines to optimize diagnostic performance. For novice practitioners, AI could serve as a safety net and educational tool. For experts, it could provide an efficient triage modality and a systematic second reading, particularly useful for reducing errors caused by fatigue or inattention."

Get the latest industry news, event updates and more from Managed Healthcare Executive.

Latest CME

Video

PER Global Perspectives: The TROP2-Targeted ADC Landscape in NSCLC and How to Interpret the Evidence

Solange Peters, MD, PhD; Benjamin P. Levy, MD; Tony S.K. Mok, MD, BMSc, FRCP(C), FRCP(Edin), FHKCP, FHKAM(Medicine), FASCO; Maurice Pérol, MD

Video

PER Global Perspectives: Preparing for the Emergence of TROP2 ADCs With Immunotherapy in the Earlier Treatment Settings

Solange Peters, MD, PhD; Myung-Ju Ahn, MD, PhD; Luis Paz-Ares, MD, PhD; Jacob Sands, MD

Video

PER Global Perspectives: Differentiating and Managing Toxicities with TROP2-Targeted ADCs in NSCLC Through Multidisciplinary Pathways

Solange Peters, MD, PhD; Stephanie McDonald, FNP-BC, AOCNP; Antonio Passaro, MD, PhD; Jacob Sands, MD

Audio

Breaking Down the Latest Clinical Data for First-line Maintenance and R/R SCLC

Rahul Gosain, MD, MBA; Rohit Gosain, MD; Hossein Borghaei, DO, MS

Virtual Event

Show Me Your Care Plan™: Navigating Biomarkers to Guide and Support NSCLC Care

August 20, 2026

Virtual Event

Show Me Your Care Plan!™ Navigating ADC Therapies: Oncology Nursing Strategies for Optimal Patient Management

August 24, 2026

Audio

Practical Considerations and Future Directions for New Treatment Strategies in SCLC

Rahul Gosain, MD, MBA; Rohit Gosain, MD; Misty D. Shields, MD, PhD

Multimedia

Community Practice Connections™: Distinguishing Precision Pathways for c-Met and MET Alterations in NSCLC

Jonathan Goldman, MD; Erminia Massarelli, MD, PhD, MS; Jorge J. Nieva, MD; Ignacio I. Wistuba, MD

Video

A New Era of Targeted Therapy for Advanced NSCLC: Exploring Future Directions for Bispecific Antibodies and ADCs

Sandip Patel, MD, FASCO; Myung-Ju Ahn, MD; Giannis Mountzios, MD, MSc, PhD; Zofia Piotrowska, MD, MHS

Video

Advances in Managing EGFR-Mutant NSCLC: Applying Evidence Across the Disease Continuum

Tina Cascone, MD, PhD; Christina Baik, MD, MPH; David Planchard, MD, PhD

Video

26th Annual International Lung Cancer Congress

Roy S. Herbst, MD, PhD; Sandip Patel, MD, FASCO; Heather A. Wakelee, MD, FASCO

Video

(CME Track) Antibody–Drug Conjugates in Oncology: The Essentials of AE Management for Better Patient Outcomes

Michelle L. Taylor, ANP-BC; Emely Alfaro, DNP, RN, CNS, OCN; Paige Griffith, CRNP; Beth Sandy, MSN, CRNP, FAPO

Multimedia

Personalized Approaches in NSCLC: Early Detection, Molecular Testing, and Targeted Therapies

Charu Aggarwal, MD, MPH, FASCO; Zofia Piotrowska, MD, MHS

Video

9th Annual School of Nursing Oncology™

Beth Faiman, PhD, MSN, APN-BC, BMTCN, AOCN, FAAN, FAPO; Beth Sandy, MSN, CRNP, FAPO; Lindsay Adkins, MSN, FNP-BC, BMTCN; Jeneth Aquino, DNP, FNP-BC; Casey Gormley, MSN, FNP-C, AOCNP; Heather J. Jackson, PhD, FNP-BC; Kelsey Martin, AG-ACNP-BC, AOCNP; Nerina T. McDonald, PA-C; Lauren Verity Moore, DNP, MSN, AGACNP-BC; Faith A. Mutale, DNP, CRNP; Tiffany Richards, PhD, ANP-BC, AOCNP; Emily Skotte, DNP, MSN, ACNP-BC; Leslie Smith, DNP, RN, APRN-CNS, AOCNS, BMTCN; Saneese Stephen, PA-C, MPAS; Sara M. Tinsley-Vance, PhD, APRN, AOCN

Multimedia

Community Practice Connections™: Optimizing SCLC Treatment Strategies and Managing Adverse Events Across Disease Stages

Erminia Massarelli, MD, PhD, MS; Misty D. Shields, MD, PhD

Multimedia

Community Practice Connections™: DLL3-Targeting Bispecific Antibodies for Small Cell Lung Cancer—From Innovation to Practice

Jacob Sands, MD; Anne Chiang, MD, PhD

Video

20th Annual New York Lung Cancers Symposium®

Balazs Halmos, MD, MS; Jamie Chaft, MD

Video

Cases and Conversations™: Transforming Small Cell Lung Cancer Treatment Through Emerging Evidence and Expert Insights

Charles M. Rudin, MD, PhD; Anne Chiang, MD, PhD, FASCO; Jacob M. Sands, MD

Video

Hot Seat: Converging Lines in the Management of RAS-Altered Cancers

Shubham Pant, MD, MBBS; Alison Schram, MD; Helena Yu, MD

Video

From Suspicion to Stabilization: Early Recognition and Treatment of Paraneoplastic LEMS

Charu Aggarwal, MD, MPH, FASCO; Erminia Massarelli, MD, PhD, MS; Ruham Nasany, MD

Experienced dermatologists outperform AI in real-world skin cancer diagnosis

Key Takeaways

Related Content

Antipsychotics are effective but can also lead to weight gain, impacting treatment adherence

Zabopegdutide shows 62.5% MASH resolution rate at 48 weeks

Edward Machtinger, M.D.: Clinics can heal or retraumatize

Is 340B good for the healthcare system? Takeaways from an MHE/Drug Topics webinar

Lack of community and awareness makes HIV stigma in women “much more profound” in women than in gay men

Latest CME

PER Global Perspectives: The TROP2-Targeted ADC Landscape in NSCLC and How to Interpret the Evidence

PER Global Perspectives: Preparing for the Emergence of TROP2 ADCs With Immunotherapy in the Earlier Treatment Settings

PER Global Perspectives: Differentiating and Managing Toxicities with TROP2-Targeted ADCs in NSCLC Through Multidisciplinary Pathways

Breaking Down the Latest Clinical Data for First-line Maintenance and R/R SCLC

Show Me Your Care Plan™: Navigating Biomarkers to Guide and Support NSCLC Care

Show Me Your Care Plan!™ Navigating ADC Therapies: Oncology Nursing Strategies for Optimal Patient Management

Practical Considerations and Future Directions for New Treatment Strategies in SCLC

Community Practice Connections™: Distinguishing Precision Pathways for c-Met and MET Alterations in NSCLC

A New Era of Targeted Therapy for Advanced NSCLC: Exploring Future Directions for Bispecific Antibodies and ADCs

Advances in Managing EGFR-Mutant NSCLC: Applying Evidence Across the Disease Continuum

26th Annual International Lung Cancer Congress

(CME Track) Antibody–Drug Conjugates in Oncology: The Essentials of AE Management for Better Patient Outcomes

Personalized Approaches in NSCLC: Early Detection, Molecular Testing, and Targeted Therapies

9th Annual School of Nursing Oncology™

Community Practice Connections™: Optimizing SCLC Treatment Strategies and Managing Adverse Events Across Disease Stages

Community Practice Connections™: DLL3-Targeting Bispecific Antibodies for Small Cell Lung Cancer—From Innovation to Practice

20th Annual New York Lung Cancers Symposium®

Cases and Conversations™: Transforming Small Cell Lung Cancer Treatment Through Emerging Evidence and Expert Insights

Hot Seat: Converging Lines in the Management of RAS-Altered Cancers

From Suspicion to Stabilization: Early Recognition and Treatment of Paraneoplastic LEMS

Trending on Managed Healthcare Executive

1,700 sites closed, 77,000 fewer children received PEPFAR-supported treatment after PEPFAR disruption | IAS 2026

Antipsychotics are effective but can also lead to weight gain, impacting treatment adherence

Zabopegdutide shows 62.5% MASH resolution rate at 48 weeks

Payers lean on step therapy, formulary exclusions as IRA reshapes Part D benefit design

Is 340B good for the healthcare system? Takeaways from an MHE/Drug Topics webinar