Xintao Ding
Local keypoint-based Faster R-CNN
Ding, Xintao; Li, Qingde; Cheng, Yongqiang; Wang, Jinbao; Bian, Weixin; Jie, Biao
Abstract
Region-based Convolutional Neural Network (R-CNN) detectors have achieved state-of-the-art results on various challenging benchmarks. Although R-CNN has achieved high detection performance, the research of local information in producing candidates is insufficient. In this paper, we design a Keypoint-based Faster R-CNN (K-Faster) method for object detection. K-Faster incorporates local keypoints in Faster R-CNN to improve the detection performance. In detail, a sparse descriptor, which first detects the points of interest in a given image and then samples a local patch and describes its invariant features, is first employed to produce keypoints. All 2-combinations of the produced keypoints are second selected to generate keypoint anchors, which are helpful for object detection. The heterogeneously distributed anchors are then encoded in feature maps based on their areas and center coordinates. Finally, the keypoint anchors are coupled with the anchors produced by Faster R-CNN, and the coupled anchors are used for Region Proposal Network (RPN) training. Comparison experiments are implemented on PASCAL VOC 07/12 and MS COCO. The experimental results show that our K-Faster approach not only increases the mean Average Precision (mAP) performance but also improves the positioning precision of the detected boxes.
Citation
Ding, X., Li, Q., Cheng, Y., Wang, J., Bian, W., & Jie, B. (in press). Local keypoint-based Faster R-CNN. Applied Intelligence, https://doi.org/10.1007/s10489-020-01665-9
Journal Article Type | Article |
---|---|
Acceptance Date | Feb 6, 2020 |
Online Publication Date | Apr 28, 2020 |
Deposit Date | Jul 20, 2020 |
Publicly Available Date | Apr 29, 2021 |
Journal | Applied Intelligence |
Print ISSN | 0924-669X |
Publisher | Springer (part of Springer Nature) |
Peer Reviewed | Peer Reviewed |
DOI | https://doi.org/10.1007/s10489-020-01665-9 |
Keywords | Keypoint; SIFT; Convolutional neural network; Faster R-CNN |
Public URL | https://hull-repository.worktribe.com/output/3546331 |
Publisher URL | https://link.springer.com/article/10.1007/s10489-020-01665-9 |
Additional Information | First Online: 28 April 2020 |
Files
Article
(1.8 Mb)
PDF
Copyright Statement
©2020 University of Hull
You might also like
ScribFormer: Transformer Makes CNN Work Better for Scribble-based Medical Image Segmentation
(2024)
Journal Article
Using outlier elimination to assess learning-based correspondence matching methods
(2024)
Journal Article
LViT: Language meets Vision Transformer in Medical Image Segmentation
(2023)
Journal Article
Downloadable Citations
About Repository@Hull
Administrator e-mail: repository@hull.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search