Haitao He
Ensemble multiboost based on ripper classifier for prediction of imbalanced software defect data
He, Haitao; Zhang, Xu; Wang, Qian; Ren, Jiadong; Liu, Jiaxin; Zhao, Xiaolin; Cheng, Yongqiang
Authors
Xu Zhang
Qian Wang
Jiadong Ren
Jiaxin Liu
Xiaolin Zhao
Yongqiang Cheng
Abstract
Identifying defective software entities is essential to ensure software quality during software development. However, the high dimensionality and class distribution imbalance of software defect data seriously affect software defect prediction performance. In order to solve this problem, this paper proposes an Ensemble MultiBoost based on RIPPER classifier for prediction of imbalanced Software Defect data, called EMR_SD. Firstly, the algorithm uses principal component analysis (PCA) method to find out the most effective features from the original features of the data set, so as to achieve the purpose of dimensionality reduction and redundancy removal. Furthermore, the combined sampling method of adaptive synthetic sampling (ADASYN) and random sampling without replacement is performed to solve the problem of data class imbalance. This classifier establishes association rules based on attributes and classes, using MultiBoost to reduce deviation and variance, so as to achieve the purpose of reducing classification error. The proposed prediction model is evaluated experimentally on the NASA MDP public datasets and compared with existing similar algorithms. The results show that EMR-SD algorithm is superior to DNC, CEL and other defect prediction techniques in most evaluation indicators, which proves the effectiveness of the algorithm.
Citation
He, H., Zhang, X., Wang, Q., Ren, J., Liu, J., Zhao, X., & Cheng, Y. (2019). Ensemble multiboost based on ripper classifier for prediction of imbalanced software defect data. IEEE Access, 7, 110333-110343. https://doi.org/10.1109/ACCESS.2019.2934128
Journal Article Type | Article |
---|---|
Acceptance Date | Jul 30, 2019 |
Online Publication Date | Aug 9, 2019 |
Publication Date | Aug 9, 2019 |
Deposit Date | Aug 30, 2019 |
Publicly Available Date | Aug 30, 2019 |
Journal | IEEE Access |
Electronic ISSN | 2169-3536 |
Publisher | Institute of Electrical and Electronics Engineers |
Peer Reviewed | Peer Reviewed |
Volume | 7 |
Pages | 110333-110343 |
DOI | https://doi.org/10.1109/ACCESS.2019.2934128 |
Keywords | General Engineering; General Materials Science; General Computer Science |
Public URL | https://hull-repository.worktribe.com/output/2505243 |
Publisher URL | https://ieeexplore.ieee.org/document/8793088 |
Contract Date | Aug 30, 2019 |
Files
Published article
(6.7 Mb)
PDF
Copyright Statement
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see http://creativecommons.org/licenses/by/4.0/
You might also like
A LDA-Based Social Media Data Mining Framework for Plastic Circular Economy
(2024)
Journal Article
Using outlier elimination to assess learning-based correspondence matching methods
(2024)
Journal Article
Information Rich Voxel Grid for Use in Heterogeneous Multi-Agent Robotics
(2023)
Journal Article
Downloadable Citations
About Repository@Hull
Administrator e-mail: repository@hull.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search