Professor Neil Gordon N.A.Gordon@hull.ac.uk
Professor in Computer Science
Addressing Optimisation Challenges for Datasets with Many Variables, Using Genetic Algorithms to Implement Feature Selection
Gordon, Neil; Kambhampati, Chandrasekhar; Alabad, Asmaa
Authors
Dr Chandrasekhar Kambhampati C.Kambhampati@hull.ac.uk
Reader in Computer Science
Asmaa Alabad
Abstract
This article provides an optimisation method using a Genetic Algorithm approach to apply feature selection techniques for large data sets to improve accuracy. This is achieved through improved classification, a reduced number of features, and furthermore it aids in interpreting the model. A clinical dataset, based on heart failure, is used to illustrate the nature of the problem and to show the effectiveness of the techniques developed. Clinical datasets are sometimes characterised as having many variables. For instance, blood biochemistry data has more than 60 variables that have led to complexities in developing predictions of outcomes using machine-learning and other algorithms. Hence, techniques to make them more
tractable are required. Genetic Algorithms can provide an efficient and low numerically complex method for effectively selecting features. In this paper, a way to estimate the number of required variables is presented, and a genetic algorithm is used in a “wrapper” form to select features for a case study of heart failure data.
Additionally, different initial populations and termination conditions are used to arrive at a set of optimal features, and these are then compared with the features obtained using traditional methodologies. The paper provides a framework for estimating the number of variables and generations required for a suitable solution.
Citation
Gordon, N., Kambhampati, C., & Alabad, A. (2022). Addressing Optimisation Challenges for Datasets with Many Variables, Using Genetic Algorithms to Implement Feature Selection. AI, Computer Science and Robotics Technology, 1, 1-21. https://doi.org/10.5772/acrt.01
Journal Article Type | Article |
---|---|
Acceptance Date | Feb 1, 2022 |
Online Publication Date | Mar 28, 2022 |
Publication Date | Mar 1, 2022 |
Deposit Date | Apr 26, 2022 |
Publicly Available Date | May 4, 2022 |
Journal | AI, Computer Science and Robotics Technology |
Peer Reviewed | Peer Reviewed |
Volume | 1 |
Pages | 1-21 |
DOI | https://doi.org/10.5772/acrt.01 |
Keywords | Feature selection; Feature optimisation; Genetic algorithms; Human reasoning; Wrapper selection |
Public URL | https://hull-repository.worktribe.com/output/3984471 |
Files
Published article
(2.8 Mb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0
Copyright Statement
© The Author(s) 2022. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons. org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
You might also like
Disease progression in chronic heart failure is linear: Insights from multistate modelling
(2024)
Journal Article
A LDA-Based Social Media Data Mining Framework for Plastic Circular Economy
(2024)
Journal Article
Locally fitting hyperplanes to high-dimensional data
(2022)
Journal Article
Ionic Imbalances and Coupling in Synchronization of Responses in Neurons
(2019)
Journal Article
Downloadable Citations
About Repository@Hull
Administrator e-mail: repository@hull.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search