Skip to main content

Research Repository

Advanced Search

Addressing Optimisation Challenges for Datasets with Many Variables, Using Genetic Algorithms to Implement Feature Selection

Gordon, Neil; Kambhampati, Chandrasekhar; Alabad, Asmaa

Authors

Asmaa Alabad



Abstract

This article provides an optimisation method using a Genetic Algorithm approach to apply feature selection techniques for large data sets to improve accuracy. This is achieved through improved classification, a reduced number of features, and furthermore it aids in interpreting the model. A clinical dataset, based on heart failure, is used to illustrate the nature of the problem and to show the effectiveness of the techniques developed. Clinical datasets are sometimes characterised as having many variables. For instance, blood biochemistry data has more than 60 variables that have led to complexities in developing predictions of outcomes using machine-learning and other algorithms. Hence, techniques to make them more
tractable are required. Genetic Algorithms can provide an efficient and low numerically complex method for effectively selecting features. In this paper, a way to estimate the number of required variables is presented, and a genetic algorithm is used in a “wrapper” form to select features for a case study of heart failure data.
Additionally, different initial populations and termination conditions are used to arrive at a set of optimal features, and these are then compared with the features obtained using traditional methodologies. The paper provides a framework for estimating the number of variables and generations required for a suitable solution.

Citation

Gordon, N., Kambhampati, C., & Alabad, A. (2022). Addressing Optimisation Challenges for Datasets with Many Variables, Using Genetic Algorithms to Implement Feature Selection. AI, Computer Science and Robotics Technology, 1, 1-21. https://doi.org/10.5772/acrt.01

Journal Article Type Article
Acceptance Date Feb 1, 2022
Online Publication Date Mar 28, 2022
Publication Date Mar 1, 2022
Deposit Date Apr 26, 2022
Publicly Available Date May 4, 2022
Journal AI, Computer Science and Robotics Technology
Peer Reviewed Peer Reviewed
Volume 1
Pages 1-21
DOI https://doi.org/10.5772/acrt.01
Keywords Feature selection; Feature optimisation; Genetic algorithms; Human reasoning; Wrapper selection
Public URL https://hull-repository.worktribe.com/output/3984471

Files


Published article (2.8 Mb)
PDF

Publisher Licence URL
http://creativecommons.org/licenses/by/4.0

Copyright Statement
© The Author(s) 2022. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons. org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.





You might also like



Downloadable Citations