Qian Wang
Mining frequent biological sequences based on bitmap without candidate sequence generation
Wang, Qian; Davis, Darryl N.; Ren, Jiadong
Authors
Darryl N. Davis
Jiadong Ren
Abstract
Biological sequences carry a lot of important genetic information of organisms. Furthermore, there is an inheritance law related to protein function and structure which is useful for applications such as disease prediction. Frequent sequence mining is a core technique for association rule discovery, but existing algorithms suffer from low efficiency or poor error rate because biological sequences differ from general sequences with more characteristics. In this paper, an algorithm for mining Frequent Biological Sequence based on Bitmap, FBSB, is proposed. FBSB uses bitmaps as the simple data structure and transforms each row into a quicksort list QS-list for sequence growth. For the continuity and accuracy requirement of biological sequence mining, tested sequences used during the mining process of FBSB are real ones instead of generated candidates, and all the frequent sequences can be mined without any errors. Comparing with other algorithms, the experimental results show that FBSB can achieve a better performance on both run time and scalability.
Citation
Wang, Q., Davis, D. N., & Ren, J. (2016). Mining frequent biological sequences based on bitmap without candidate sequence generation. Computers in biology and medicine, 69, 152-157. https://doi.org/10.1016/j.compbiomed.2015.12.016
Journal Article Type | Article |
---|---|
Acceptance Date | Dec 22, 2015 |
Online Publication Date | Dec 30, 2015 |
Publication Date | Feb 1, 2016 |
Deposit Date | Jan 6, 2016 |
Publicly Available Date | Nov 23, 2017 |
Journal | Computers in biology and medicine |
Print ISSN | 0010-4825 |
Publisher | Elsevier |
Peer Reviewed | Peer Reviewed |
Volume | 69 |
Pages | 152-157 |
DOI | https://doi.org/10.1016/j.compbiomed.2015.12.016 |
Keywords | Biological sequence; Frequent pattern; Bitmap; Quicksort list |
Public URL | https://hull-repository.worktribe.com/output/383829 |
Publisher URL | http://www.sciencedirect.com/science/article/pii/S0010482515004096 |
Additional Information | Authors' accepted manuscript of an article published in: Computers in biology and medicine, 2016, v.69. |
Contract Date | Nov 23, 2017 |
Files
Article.pdf
(419 Kb)
PDF
Copyright Statement
© 2015, Elsevier. Licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International http://creativecommons.org/licenses/by-nc-nd/4.0/
You might also like
A multi-agent decision support system for stock trading
(2002)
Journal Article
Control states and complete agent architectures
(2001)
Journal Article
A "Society of Mind" cognitive architecture based on the principles of artificial economics
(2010)
Journal Article
Generating and verifying risk prediction models using data mining
(2009)
Book Chapter
Alert rules for remote monitoring of cardiovascular patients
(2012)
Journal Article
Downloadable Citations
About Repository@Hull
Administrator e-mail: repository@hull.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search