Skip to main content

Research Repository

Advanced Search

Predicting probable Alzheimer's disease using linguistic deficits and biomarkers

Orimaye, Sylvester O.; Wong, Jojo S-M.; Golden, Karen J.; Wong, Chee P.; Soyiri, Ireneous N.

Authors

Sylvester O. Orimaye

Jojo S-M. Wong

Karen J. Golden

Chee P. Wong



Abstract

Background
The manual diagnosis of neurodegenerative disorders such as Alzheimer’s disease (AD) and related Dementias has been a challenge. Currently, these disorders are diagnosed using specific clinical diagnostic criteria and neuropsychological examinations. The use of several Machine Learning algorithms to build automated diagnostic models using low-level linguistic features resulting from verbal utterances could aid diagnosis of patients with probable AD from a large population. For this purpose, we developed different Machine Learning models on the DementiaBank language transcript clinical dataset, consisting of 99 patients with probable AD and 99 healthy controls.

Results
Our models learned several syntactic, lexical, and n-gram linguistic biomarkers to distinguish the probable AD group from the healthy group. In contrast to the healthy group, we found that the probable AD patients had significantly less usage of syntactic components and significantly higher usage of lexical components in their language. Also, we observed a significant difference in the use of n-grams as the healthy group were able to identify and make sense of more objects in their n-grams than the probable AD group. As such, our best diagnostic model significantly distinguished the probable AD group from the healthy elderly group with a better Area Under the Receiving Operating Characteristics Curve (AUC) using the Support Vector Machines (SVM).

Conclusions
Experimental and statistical evaluations suggest that using ML algorithms for learning linguistic biomarkers from the verbal utterances of elderly individuals could help the clinical diagnosis of probable AD. We emphasise that the best ML model for predicting the disease group combines significant syntactic, lexical and top n-gram features. However, there is a need to train the diagnostic models on larger datasets, which could lead to a better AUC and clinical diagnosis of probable AD.

Citation

Orimaye, S. O., Wong, J. S.-M., Golden, K. J., Wong, C. P., & Soyiri, I. N. (2017). Predicting probable Alzheimer's disease using linguistic deficits and biomarkers. BMC Bioinformatics, 18(1), Article 34. https://doi.org/10.1186/s12859-016-1456-0

Journal Article Type Article
Acceptance Date Dec 31, 2016
Online Publication Date Jan 14, 2017
Publication Date 2017-12
Deposit Date May 15, 2019
Publicly Available Date May 16, 2019
Journal BMC Bioinformatics
Print ISSN 1471-2105
Publisher Springer Verlag
Peer Reviewed Peer Reviewed
Volume 18
Issue 1
Article Number 34
DOI https://doi.org/10.1186/s12859-016-1456-0
Keywords Alzheimer's disease; Neurolinguistics; Clinical diagnostics; Prediction; Machine learning
Public URL https://hull-repository.worktribe.com/output/1740687
Publisher URL https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-016-1456-0
Contract Date May 16, 2019

Files






You might also like



Downloadable Citations