Editorial illustration for Explainable ML Classifies Alzheimer's Early in 1,641 ADNI Subjects
Explainable ML Classifies Alzheimer's Early in 1,641...
Explainable ML Classifies Alzheimer's Early in 1,641 ADNI Subjects
Alzheimer’s disease touches more than 55 million people worldwide, yet clinicians still lack a reliable, interpretable way to separate normal cognition, mild cognitive impairment and full‑blown dementia using everyday assessments. Why does this matter? Early, accurate labeling could steer treatment and research before irreversible damage sets in.
Here’s the thing: a team built an XGBoost model that ingests just eight routine clinical variables—MMSE, CDR Global, CDR‑SB, MoCA, FAQ, age, sex and education—from the Alzheimer’s Disease Neuroimaging Initiative (ADNI). While the algorithm’s hyperparameters were tuned over 50 Optuna trials and class imbalance handled with SMOTE, its performance was measured with a suite of metrics: macro AUC‑ROC, macro F1, balanced accuracy and Cohen’s κ, each backed by 1,000‑iteration bootstrap confidence intervals.
On a baseline cohort of 1,641 subjects (608 NC, 767 MCI, 266 AD), five‑fold cross‑validation yielded a mean macro AUC of 0.983 and accuracy of 0.944. A held‑out test of 247 participants delivered a macro AUC of 0.982 (95 % CI 0.965–0.995) and κ of 0.909. SHAP values flagged CDR Global as the top driver for NC and MCI, while CDR‑SB and MMSE together steered AD classification. The result is an explainable model that approaches near‑perfect three‑class detection using only standard clinical data.
Performance was evaluated by macro AUC-ROC with 1,000-iteration bootstrap 95% confidence intervals, macro F1, balanced accuracy, and Cohen's kappa. Results: The dataset comprised 1,641 baseline subjects (608 NC, 767 MCI, 266 AD). On five-fold cross-validation, mean macro AUC was 0.983 (SD 0.007), accuracy 0.944 (SD 0.006), and macro F1 0.929 (SD 0.008).
On the held-out test set (n = 247), macro AUC was 0.982 (95% CI: 0.965--0.995), accuracy 0.943, balanced accuracy 0.932, macro F1 0.927, and Cohen's kappa 0.909. SHAP analysis identified CDR Global as the dominant predictor for NC and MCI, while CDR-SB and MMSE together drove AD classification. Conclusion: An explainable machine learning model trained on routine clinical assessments achieves near-perfect three-class Alzheimer's detection.
Why this matters
We see an XGBoost model that classifies normal cognition, mild cognitive impairment, and Alzheimer’s disease with a reported macro AUC of 0.983 on five‑fold cross‑validation. Eight routine clinical measures—MMSE, CDR Global, CDR‑SB among them—drive the predictions, and the authors stress that the approach is explainable. The dataset, 1,641 ADNI baseline subjects, offers a balanced mix of 608 NC, 767 MCI and 266 AD cases, giving the results statistical weight.
Yet the study stops short of testing the model on external cohorts, so its performance outside the ADNI framework is unclear. Likewise, while macro AUC is impressive, the article does not disclose macro F1, balanced accuracy or Cohen’s kappa values, leaving a gap in our understanding of practical utility. For developers, the work showcases that high‑performing, interpretable classifiers can be built from modest clinical inputs, but for founders and researchers the path to deployment still demands validation in real‑world settings and transparent reporting of all metrics.
Further Reading
- Deep Learning Approaches with Explainable AI for Differentiating Alzheimer Disease - arXiv
- An explainable machine learning approach for Alzheimer's disease classification and prediction - PMC
- Early diagnosis of Alzheimer's disease using machine learning - PMC
- Alzheimer's Disease Neuroimaging Initiative (ADNI) - ADNI / USC
- Alzheimer's Disease Neuroimaging Initiative (ADNI) - ClinicalTrials.gov record - ClinicalTrials.gov