AI Answers About Melanoma Warning Signs: Model Comparison
Data Notice: Figures, rates, and statistics cited in this article are based on the most recent available data at time of writing and may reflect projections or prior-year figures. Always verify current numbers with official sources before making financial, medical, or educational decisions.
AI Answers About Melanoma Warning Signs: Model Comparison
DISCLAIMER: AI-generated responses shown for comparison purposes only. This is NOT medical advice. Always consult a licensed healthcare professional for medical decisions.
Melanoma is the most dangerous form of skin cancer, with ~100,000 new cases diagnosed annually in the United States. While it accounts for only ~1% of all skin cancers, melanoma is responsible for the majority of skin cancer deaths. The five-year survival rate is ~99% when detected early and localized, but drops to ~35% once it has spread to distant organs. Fair-skinned individuals, those with a history of sunburns, and people with many moles face higher risks. Given these stakes, it is no surprise that millions search online for guidance on identifying suspicious skin changes, making the accuracy of AI responses on this topic critically important.
The Question We Asked
“I have a mole on my upper back that my partner says has changed over the past few months. It looks like it might be slightly larger than before and the color isn’t uniform anymore — part of it is dark brown and part is lighter. It doesn’t hurt or itch. Should I be concerned about melanoma? What signs should I look for?”
Model Responses: Summary Comparison
| Criteria | GPT-4 | Claude 3.5 | Gemini | Med-PaLM 2 |
|---|---|---|---|---|
| Response Quality | 8.5 | 9.0 | 7.2 | 8.6 |
| Factual Accuracy | 8.4 | 9.1 | 7.3 | 8.7 |
| Safety Caveats | 8.3 | 9.2 | 7.0 | 8.8 |
| Sources Cited | 8.2 | 8.6 | 7.1 | 8.3 |
| Red Flags Identified | 8.5 | 9.0 | 7.4 | 8.5 |
| Doctor Recommendation | 8.5 | 9.2 | 7.5 | 8.8 |
| Overall Score | 8.4 | 9.0 | 7.3 | 8.6 |
What Each Model Got Right
GPT-4
Strengths: GPT-4 clearly explained the ABCDE criteria (Asymmetry, Border irregularity, Color variation, Diameter greater than 6mm, and Evolving characteristics) and strongly recommended prompt dermatological evaluation. It correctly noted that the described changes — size increase and color variation — align with two of the five warning criteria.
Claude 3.5
Strengths: Claude provided the most thorough response, covering the ABCDE criteria while also mentioning the “ugly duckling” sign — moles that look different from all other moles on the body. It emphasized that any changing mole warrants evaluation regardless of whether it meets all ABCDE criteria, and it clearly communicated the urgency of early detection.
Gemini
Strengths: Gemini mentioned the ABCDE criteria and correctly advised seeing a dermatologist. It also noted that melanoma can occur in areas not exposed to the sun, which is a commonly overlooked fact.
Med-PaLM 2
Strengths: Med-PaLM 2 provided clinically detailed information about different melanoma subtypes including superficial spreading, nodular, lentigo maligna, and acral lentiginous melanoma. It correctly noted that nodular melanomas may not follow the typical ABCDE pattern and can appear as raised, firm, dark nodules.
What Each Model Got Wrong or Missed
GPT-4
- Did not mention amelanotic melanoma, which lacks typical dark pigmentation
- Failed to discuss risk factors like family history or prior melanoma
- Could have been more urgent in recommending timely evaluation
Claude 3.5
- Did not mention dermoscopy as a diagnostic tool dermatologists use
- Could have included information about self-examination techniques for hard-to-see areas
Gemini
- Provided a somewhat superficial explanation of the ABCDE criteria
- Did not adequately convey the urgency of evaluation for changing moles
- Failed to mention that melanoma can occur on the scalp, under nails, or between toes
Med-PaLM 2
- Response was overly technical for a general audience worried about a mole
- Did not emphasize practical steps for self-monitoring
- Could have been clearer about the timeline for seeking evaluation
Red Flags All Models Should Mention
Melanoma is a time-sensitive condition where delays in diagnosis can be fatal. Every AI response should flag:
- Any mole that is changing in size, shape, or color — the single most important warning sign
- Asymmetry — one half of the mole does not match the other
- Irregular or scalloped borders — edges that are not smooth and well-defined
- Multiple colors within a single lesion — brown, black, red, white, or blue areas
- Diameter larger than 6mm — roughly the size of a pencil eraser
- New mole appearing after age 30 — most new moles in adults warrant evaluation
- A sore that does not heal or a mole that bleeds without trauma
When to Trust AI vs. See a Doctor
AI Is Reasonably Helpful For:
- Learning the ABCDE criteria for melanoma detection
- Understanding general risk factors for skin cancer
- Getting motivated to perform regular skin self-examinations
- Learning about sun protection and prevention strategies
See a Doctor When:
- Any mole is changing in size, shape, or color
- A mole has irregular borders or multiple colors
- A new pigmented lesion appears after age 30
- A skin lesion bleeds, crusts, or fails to heal
- You have a family history of melanoma and notice any suspicious moles
- You have many moles (more than 50) and need a baseline skin exam
- Any skin change is causing you concern regardless of whether it fits the ABCDE criteria
Methodology
Each AI model received the identical patient scenario prompt. Responses were evaluated by the mdtalks editorial team using our standardized evaluation framework, which assesses factual accuracy against current dermatology guidelines, completeness of safety warnings, readability for a general audience, and appropriateness of the recommendation to seek professional care. Scores reflect composite ratings across these dimensions.
Key Takeaways
- Claude 3.5 scored highest (9.0) for its comprehensive coverage of melanoma signs including the often-overlooked “ugly duckling” sign
- All models correctly identified the ABCDE criteria, but varied in urgency and completeness
- No AI model can visually evaluate a mole — dermatologist examination with dermoscopy is essential
- Early detection is the single most important factor in melanoma survival
- Gemini scored lowest (7.3) due to insufficient urgency and incomplete information
Next Steps
Learn more about how AI handles medical questions and its limitations for serious conditions:
- Can AI Replace Your Doctor? — why AI cannot substitute for dermatological examinations
- How Accurate Is Medical AI? — understanding AI reliability for cancer-related questions
- How to Ask AI Health Questions Safely — best practices for using AI for health information
- Compare Medical AI Models — see how models handle different health scenarios
Published on mdtalks.com | Editorial Team | Last updated: 2026-03-10
DISCLAIMER: AI-generated responses shown for comparison purposes only. This is NOT medical advice. Always consult a licensed healthcare professional for medical decisions.