Dr Temitayo Matthew Fagbola Temitayo-Matthew.Fagbola@hull.ac.uk
Teaching Fellow
Dr Temitayo Matthew Fagbola Temitayo-Matthew.Fagbola@hull.ac.uk
Teaching Fellow
Colin Surendra Thakur
Oludayo Olugbara
News article classification is a recently growing area of interest in text classification because of its associated multiple matching categories. However, the weak reliability indices and ambiguities associated with state-of-the-art classifiers often employed make success in this domain very limited. Also, the high sensitivity and large disparity in performance results of classifiers to the varying nature of real-world datasets make the need for comparative evaluation inevitable. In this paper, the accuracy and computational time efficiency of the Kolmogorov Complexity Distance Measure (KCDM) and Artificial Neural Network (ANN) were experimentally evaluated for a prototype large dimensional news article classification problem. 2000 News articles from a dataset of 2225 British Broadcasting Corporation (BBC) news documents (including examples from sport, politics, entertainment, education and technology, and business) were used for categorical testing purposes. Porter's algorithm was used for word stemming after tokenization and stop-words removal, and a Normalized Term Frequency-Inverse Document Frequency (NTF-IDF) technique was adopted for feature extraction. Experimental results revealed that ANN performs better in terms of accuracy while the KCDM produced better results than ANN in terms of computational time efficiency.
Fagbola, T. M., Thakur, C. S., & Olugbara, O. (2019). News article classification using Kolmogorov complexity distance measure and artificial neural network. International Journal of Technology, 10(4), 710-720. https://doi.org/10.14716/ijtech.v10i4.2339
Journal Article Type | Article |
---|---|
Acceptance Date | Jan 1, 2019 |
Online Publication Date | Jul 29, 2019 |
Publication Date | Jul 29, 2019 |
Deposit Date | Jan 28, 2024 |
Publicly Available Date | Feb 5, 2024 |
Journal | International Journal of Technology |
Print ISSN | 2086-9614 |
Electronic ISSN | 2087-2100 |
Publisher | Universitas Indonesia |
Peer Reviewed | Peer Reviewed |
Volume | 10 |
Issue | 4 |
Pages | 710-720 |
DOI | https://doi.org/10.14716/ijtech.v10i4.2339 |
Public URL | https://hull-repository.worktribe.com/output/4161535 |
Published paper
(741 Kb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by-nc-nd/4.0/
Copyright Statement
© The Authors
UCTH Breast Cancer Dataset
(2023)
Data
About Repository@Hull
Administrator e-mail: repository@hull.ac.uk
This application uses the following open-source libraries:
Apache License Version 2.0 (http://www.apache.org/licenses/)
Apache License Version 2.0 (http://www.apache.org/licenses/)
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search