Dr Neil Gordon N.A.Gordon@hull.ac.uk
Reader
Dr Neil Gordon N.A.Gordon@hull.ac.uk
Reader
Dr Chandrasekhar Kambhampati C.Kambhampati@hull.ac.uk
Reader in Computer Science
Asmaa Alabad
This article provides an optimisation method using a Genetic Algorithm approach to apply feature selection techniques for large data sets to improve accuracy. This is achieved through improved classification, a reduced number of features, and furthermore it aids in interpreting the model. A clinical dataset, based on heart failure, is used to illustrate the nature of the problem and to show the effectiveness of the techniques developed. Clinical datasets are sometimes characterised as having many variables. For instance, blood biochemistry data has more than 60 variables that have led to complexities in developing predictions of outcomes using machine-learning and other algorithms. Hence, techniques to make them more
tractable are required. Genetic Algorithms can provide an efficient and low numerically complex method for effectively selecting features. In this paper, a way to estimate the number of required variables is presented, and a genetic algorithm is used in a “wrapper” form to select features for a case study of heart failure data.
Additionally, different initial populations and termination conditions are used to arrive at a set of optimal features, and these are then compared with the features obtained using traditional methodologies. The paper provides a framework for estimating the number of variables and generations required for a suitable solution.
Gordon, N., Kambhampati, C., & Alabad, A. (2022). Addressing Optimisation Challenges for Datasets with Many Variables, Using Genetic Algorithms to Implement Feature Selection. AI, Computer Science and Robotics Technology, 1, 1-21. https://doi.org/10.5772/acrt.01
Journal Article Type | Article |
---|---|
Acceptance Date | Feb 1, 2022 |
Online Publication Date | Mar 28, 2022 |
Publication Date | Mar 1, 2022 |
Deposit Date | Apr 26, 2022 |
Publicly Available Date | Oct 27, 2022 |
Journal | AI, Computer Science and Robotics Technology |
Peer Reviewed | Peer Reviewed |
Volume | 1 |
Pages | 1-21 |
DOI | https://doi.org/10.5772/acrt.01 |
Keywords | Feature selection; Feature optimisation; Genetic algorithms; Human reasoning; Wrapper selection |
Public URL | https://hull-repository.worktribe.com/output/3984471 |
Published article
(2.8 Mb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0
Copyright Statement
© The Author(s) 2022. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons. org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
A safety analysis approach to clinical workflows : application and evaluation
(2014)
Journal Article
Tetrads of lines spanning PG(7,2)
(2013)
Journal Article
Maximising gain for minimal pain: Utilising natural game mechanics
(2013)
Journal Article
Partial spreads in PG(4,2) and flats in PG(9,2) external to the Grassmannian G1,4,2
(2005)
Journal Article
The quintic Grassmannian g(1,4,2) in PG(9,2)
(2004)
Journal Article
About Repository@Hull
Administrator e-mail: repository@hull.ac.uk
This application uses the following open-source libraries:
Apache License Version 2.0 (http://www.apache.org/licenses/)
Apache License Version 2.0 (http://www.apache.org/licenses/)
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Advanced Search