DOI QR코드

DOI QR Code

RLDB: Robust Local Difference Binary Descriptor with Integrated Learning-based Optimization

  • Sun, Huitao (Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology) ;
  • Li, Muguo (State Key Laboratory of Coastal and Offshore Engineering, Dalian University of Technology)
  • Received : 2016.10.24
  • Accepted : 2018.04.24
  • Published : 2018.09.30

Abstract

Local binary descriptors are well-suited for many real-time and/or large-scale computer vision applications, while their low computational complexity is usually accompanied by the limitation of performance. In this paper, we propose a new optimization framework, RLDB (Robust-LDB), to improve a typical region-based binary descriptor LDB (local difference binary) and maintain its computational simplicity. RLDB extends the multi-feature strategy of LDB and applies a more complete region-comparing configuration. A cascade bit selection method is utilized to select the more representative patterns from massive comparison pairs and an online learning strategy further optimizes descriptor for each specific patch separately. They both incorporate LDP (linear discriminant projections) principle to jointly guarantee the robustness and distinctiveness of the features from various scales. Experimental results demonstrate that this integrated learning framework significantly enhances LDB. The improved descriptor achieves a performance comparable to floating-point descriptors on many benchmarks and retains a high computing speed similar to most binary descriptors, which better satisfies the demands of applications.

Keywords

References

  1. M. U. Kim and K. Yoon, "Performance evaluation of large-scale object recognition system using bag-of-visual words model," Multimedia Tools & Applications, vol. 74, no. 7, pp. 2499-2517, April, 2015. https://doi.org/10.1007/s11042-014-2152-6
  2. L. Mansourian, M. T. Abdullah, L. N. Abdullah, A. Azman and M. R. Mustaffa, "A Salient Based Bag of Visual Word model (SBBoVW): improvements toward difficult object recognition and object location in image retrieval," KSII Transactions on Internet and Information Systems, vol. 10, no. 2, pp. 769-786, February, 2016. https://doi.org/10.3837/tiis.2016.02.018
  3. Y. Furukawa and J. Ponce, "Accurate, dense, and robust multiview stereopsis," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 32, no. 8, pp. 1362-1376, August, 2010. https://doi.org/10.1109/TPAMI.2009.161
  4. H. Li, Y. Guan, L. Liu, F. Wang and L. Wang, "Re-ranking for microblog retrieval via multiple graph model," Multimedia Tools & Applications, vol. 75, no. 15, pp. 8939-8954, August, 2016. https://doi.org/10.1007/s11042-014-2336-0
  5. X. Zhang, B. Guo and Y. Yan, "Image retrieval method based on IPDSH and SRIP," KSII Transactions on Internet and Information Systems, vol. 8, no. 5, pp. 1676-1689, May, 2014. https://doi.org/10.3837/tiis.2014.05.010
  6. Y. Guo, G. Zhao, Z. Zhou and M. Pietikainen, "Video texture synthesis with multi-frame LBP-TOP and diffeomorphic growth model," IEEE Transactions on Image Processing, vol. 22, no. 10, pp. 3879-3891, October, 2013. https://doi.org/10.1109/TIP.2013.2263148
  7. D. G. Lowe, "Distinctive image features from scale-invariant keypoints," International Journal of Computer Vision, vol. 60, no. 2, pp. 91-110, November, 2004. https://doi.org/10.1023/B:VISI.0000029664.99615.94
  8. H. Bay, A. Ess, T. Tuytelaars and L. V. Gool, "Speeded-Up Robust Features (SURF)," Computer Vision & Image Understanding, vol. 110, no. 3, pp. 346-359, June, 2008. https://doi.org/10.1016/j.cviu.2007.09.014
  9. E. Tola, V. Lepetit and P. Fua, "DAISY: an efficient dense descriptor applied to wide-baseline stereo," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 32, no. 5, pp. 815-830, May, 2010. https://doi.org/10.1109/TPAMI.2009.77
  10. M. Calonder, V. Lepetit, C. Strecha and P. Fua, "BRIEF: Binary Robust Independent Elementary Features," in Proc. of European Conference on Computer Vision, pp. 778-792, September 5-11, 2010.
  11. E. Rublee, V. Rabaud, K. Konolige and G. Bradski, "ORB: an efficient alternative to SIFT or SURF," in Proc. of IEEE International Conference on Computer Vision, pp. 2564-2571, November 6-13, 2011.
  12. Y. Ke and R. Sukthankar, "PCA-SIFT: a more distinctive representation for local image descriptors," in Proc. of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 506-513, June 27-July 2, 2004.
  13. C. Strecha, A. Bronstein, M. Bronstein and P. Fua, "LDAHash: improved matching with smaller descriptors," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 34, no. 1, pp. 66-78, January 2012. https://doi.org/10.1109/TPAMI.2011.103
  14. H. Cai, K. Mikolajczyk and J. Matas, "Learning linear discriminant projections for dimensionality reduction of image descriptors," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 33, no. 2, pp. 338-352, February, 2011. https://doi.org/10.1109/TPAMI.2010.89
  15. R. Ortiz, "FREAK: fast retina keypoint," in Proc. of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 510-517, June 16-21, 2012.
  16. S. Leutenegger, M. Chli and R. Y. Siegwart, "BRISK: binary robust invariant scalable keypoints," in Proc. of IEEE International Conference on Computer Vision, pp. 2548-2555, November 6-13, 2011.
  17. X. Yang and K. T. T. Cheng, "Local difference binary for ultrafast and distinctive feature description," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 36, no. 1, pp. 188-194, January, 2014. https://doi.org/10.1109/TPAMI.2013.150
  18. B. Fan, Q. Kong, T. Trzcinski, Z. Wang and C. Pan, "Receptive fields selection for binary feature description," IEEE Transactions on Image Processing, vol. 23, no. 6, pp. 2583-2595, June, 2014. https://doi.org/10.1109/TIP.2014.2317981
  19. Y. Gao, W. Huang and Y. Qiao, "Local multi-grouped binary descriptor with ring-based pooling configuration and optimization," IEEE Transactions on Image Processing, vol. 24, no. 12, pp. 4820-4833, December, 2015. https://doi.org/10.1109/TIP.2015.2469093
  20. V. Balntas, L. Tang and K. Mikolajczyk, "BOLD-binary online learned descriptor for efficient image matching," in Proc. of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 2367-2375, June 7-12, 2015.
  21. T. Trzcinski and V. Lepetit, "Efficient discriminative projections for compact binary descriptors," in Proc. of European Conference on Computer Vision, pp. 228-242, October 7-13, 2012.
  22. T. Trzcinski, M. Christoudias and V. Lepetit, "Learning image descriptors with boosting," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 37, no. 3, pp. 597-610, March, 2015. https://doi.org/10.1109/TPAMI.2014.2343961
  23. S. Winder, G. Hua and M. Brown, "Picking the best daisy," in Proc. of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 178-185, June 20-26, 2009.
  24. V. Balntas, E. Johns, L. Tang and K. Mikolajczyk, "PN-Net: conjoined triple deep network for learning local image descriptors," arXiv preprint, January, 2016.
  25. X. Han, T. Leung, Y. Jia, R. Sukthankar and A. C. Berg, "MatchNet: unifying feature and metric learning for patch-based matching," in Proc. of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 3279-3286, June 7-12, 2015.
  26. K. Simonyan, A. Vedaldi and A. Zisserman, "Learning local feature descriptors using convex optimisation," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 36, no. 8, pp. 1573-1585, August, 2014. https://doi.org/10.1109/TPAMI.2014.2301163
  27. M. Brown, G. Hua and S. Winder, "Discriminative learning of local image descriptors," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 33, no. 1, pp. 43-57, January, 2011. https://doi.org/10.1109/TPAMI.2010.54
  28. A. Richardson and E. Olson, "TailoredBRIEF: online per-feature descriptor customization," in Proc. of IEEE/RSJ International Conference on Intelligent Robots and Systems, September 28-October 2, 2015.
  29. G. Hua, M. Brown and S. Winder, "Discriminant embedding for local image descriptors," in Proc. of IEEE International Conference on Computer Vision, pp. 1-8, October 14-20, 2007.
  30. P. Viola and M. J. Jones, "Robust real-time face detection," International Journal of Computer Vision, vol. 57, no. 2, pp. 137-154, May, 2004. https://doi.org/10.1023/B:VISI.0000013087.49260.fb
  31. B. Guo and J. Liu, "Real-time keypoint-based object tracking via online learning," in Proc. of IEEE International Conference on Information Science and Technology, pp. 907-911, March 23-25, 2013.
  32. J. M. Morel and G. Yu, "ASIFT: a new framework for fully affine invariant image comparison," Siam Journal on Imaging Sciences, vol. 2, no. 2, pp. 438-469, April, 2009. https://doi.org/10.1137/080732730
  33. M. Ozuysal, M. Calonder, V. Lepetit and P. Fua, "Fast keypoint recognition using random ferns," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 32, no. 3, pp. 448-61, March, 2010. https://doi.org/10.1109/TPAMI.2009.23
  34. S. A. J. Winder and M. Brown, "Learning local image descriptors," in Proc. of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 1-8, June 17-22, 2007.
  35. M. Brown, "Multi-view stereo correspondence dataset,"
  36. K. Mikolajczyk and C. Schmid, "A performance evaluation of local descriptors," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 27, no. 10, pp. 1615-1630, October 2005. https://doi.org/10.1109/TPAMI.2005.188
  37. Visual Geometry Group, Department of Engineering Science, University of Oxford, "Affine covariant regions datasets,"
  38. H. Shao, T. Svoboda and L. V. Gool, "Zubud-zurich buildings database for image based recognition, "
  39. D. Nister and H. Stewenius, "Scalable recognition with a vocabulary tree," in Proc. of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 2161-2168, June 17-22, 2006.
  40. D. Nister and H. Stewenius, "Recognition benchmark images,"
  41. M. Oszust, "An optimisation approach to the design of a fast, compact and distinctive binary descriptor," Signal Image & Video Processing, vol. 10, no. 8, pp. 1-8, November 2016. https://doi.org/10.1007/s11760-014-0693-9
  42. S. Liao, X. Zhu, Z. Lei, L. Zhang and S. Z. Li, "Learning multi-scale block local binary patterns for face recognition," in Proc. of International Conference on Biometrics, pp. 828-837, August 27-29, 2007.