DISCRIMINATIVE DICTIONARY PAIR LEARNING FOR IMAGE CLASSIFICATION

Nguyen Hoang Vu, Tran Quoc Cuong, Tran Thanh Phong
Author affiliations

Authors

  • Nguyen Hoang Vu Tien Giang University
  • Tran Quoc Cuong Tien Giang University
  • Tran Thanh Phong Tien Giang University

DOI:

https://doi.org/10.15625/1813-9663/36/4/15105

Keywords:

Dictionary learning, synthesis and analysis dictionary, incoherent dictionary, classification, face recognition

Abstract

Dictionary learning (DL) for sparse coding has been widely applied in the field of computer vision. Many DL approaches have been developed recently to solve pattern classification problems and have achieved promising performance. In this paper, to improve the discriminability of the popular dictionary pair learning (DPL) algorithm, we propose a new method called discriminative dictionary pair learning (DDPL) for image classification. To achieve the goal of signal representation and discrimination, we impose the incoherence constraints on the synthesis dictionary and the low-rank regularization on the analysis dictionary. The DDPL method ensures that the learned dictionary has the powerful discriminative ability and the signals are more separable after coding. We evaluate the proposed method on benchmark image databases in comparison with existing DL methods. The experimental results demonstrate that our method outperforms many recently proposed dictionary learning approaches.

Metrics

Metrics Loading ...

References

J. Wright, A. Y. Yang, A. Ganesh, S. S. Sastry, and Y. Ma, “Robust face recognition via sparse representation,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 31, no. 2, pp. 210–227, 2009, doi: 10.1109/TPAMI.2008.79.

Y. Xu, D. Zhang, J. Yang, and J. Y. Yang, “A two-phase test sample sparse representation method for use with face recognition,” IEEE Trans. Circuits Syst. Video Technol., vol. 21, no. 9, pp. 1255–1262, Sep. 2011, doi: 10.1109/TCSVT.2011.2138790.

R. He, W. S. Zheng, B. G. Hu, and X. W. Kong, “Two-stage nonnegative sparse representation for large-scale face recognition,” IEEE Trans. Neural Networks Learn. Syst., vol. 24, no. 1, pp. 35–46, 2013, doi: 10.1109/TNNLS.2012.2226471.

Z. Li, Z. Lai, Y. Xu, J. Yang, and D. Zhang, “A Locality-Constrained and Label Embedding Dictionary Learning Algorithm for Image Classification,” IEEE Trans. Neural Networks Learn. Syst., vol. 28, no. 2, pp. 278–293, Feb. 2017, doi: 10.1109/TNNLS.2015.2508025.

M. Elad and M. Aharon, “Image denoising via sparse and redundant representations over learned dictionaries,” IEEE Trans. Image Process., vol. 15, no. 12, pp. 3736–3745, Dec. 2006, doi: 10.1109/TIP.2006.881969.

X. Y. Jing, F. Wu, X. Zhu, X. Dong, F. Ma, and Z. Li, “Multi-spectral low-rank structured dictionary learning for face recognition,” Pattern Recognit., vol. 59, pp. 14–25, Nov. 2016, doi: 10.1016/j.patcog.2016.01.023.

M. Aharon, M. Elad, and A. Bruckstein, “K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation,” IEEE Trans. Signal Process., vol. 54, no. 11, pp. 4311–4322, 2006, doi: 10.1109/TSP.2006.881199.

R. Rubinstein, T. Peleg, M. E.-I. T. on Signal, and undefined 2012, “Analysis K-SVD: A dictionary-learning algorithm for the analysis sparse model,” ieeexplore.ieee.org.

S. Gu, L. Zhang, W. Zuo, and X. Feng, “Projective dictionary pair learning for pattern classification,” Adv. Neural Inf. Process. Syst., vol. 1, no. January, pp. 793–801, 2014.

Q. Zhang and B. Li, “Discriminative K-SVD for dictionary learning in face recognition,” Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., pp. 2691–2698, 2010, doi: 10.1109/CVPR.2010.5539989.

Z. Jiang, Z. Lin, and L. S. Davis, “Label consistent K-SVD: Learning a discriminative dictionary for recognition,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 35, no. 11, pp. 2651–2664, 2013, doi: 10.1109/TPAMI.2013.88.

I. Ramirez, P. Sprechmann, and G. Sapiro, “Classification and clustering via dictionary learning with structured incoherence and shared features,” in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010, pp. 3501–3508, doi: 10.1109/CVPR.2010.5539964.

M. Yang, L. Zhang, X. Feng, and D. Zhang, “Fisher Discrimination Dictionary Learning for sparse representation,” in Proceedings of the IEEE International Conference on Computer Vision, 2011, pp. 543–550, doi: 10.1109/ICCV.2011.6126286.

H. Nguyen, W. Yang, B. Sheng, and C. Sun, “Discriminative low-rank dictionary learning for face recognition,” Neurocomputing, vol. 173, pp. 541–551, Jan. 2016, doi: 10.1016/j.neucom.2015.07.031.

T. H. Vu and V. Monga, “Fast low-rank shared dictionary learning for image classification,” IEEE Trans. Image Process., vol. 26, no. 11, pp. 5160–5175, Nov. 2017, doi: 10.1109/TIP.2017.2729885.

B. Mailhé, D. Barchiesi, and M. D. Plumbley, “INK-SVD: Learning incoherent dictionaries for sparse representations,” in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2012, pp. 3573–3576, doi: 10.1109/ICASSP.2012.6288688.

T. Lin, S. Liu, and H. Zha, “Incoherent dictionary learning for sparse representation,” in Proceedings - International Conference on Pattern Recognition, 2012, pp. 1237–1240.

C.-F. Chen, C.-P. Wei, and Y.-C. F. Wang, “Low-rank matrix recovery with structural incoherence for robust face recognition,” ieeexplore.ieee.org, 2012, doi: 10.1109/CVPR.2012.6247981.

H. Yin and X. Wu, “Face recognition based on structural incoherence and low rank projection,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016, vol. 9937 LNCS, pp. 68–78, doi: 10.1007/978-3-319-46257-8_8.

X. Chen and J. Gao, “Discrimination projective dictionary pair methods in dictionary learning1,” Proc. - 2015 8th Int. Congr. Image Signal Process. CISP 2015, no. October 2015, pp. 204–208, 2016, doi: 10.1109/CISP.2015.7407876.

M. Yang, W. Liu, W. Luo, and L. Shen, “Analysis-Synthesis dictionary learning for universality-particularity representation based classification,” 30th AAAI Conf. Artif. Intell. AAAI 2016, pp. 2251–2257, 2016.

M. Yang, H. Chang, W. Luo, and J. Yang, “Fisher discrimination dictionary pair learning for image classification,” Neurocomputing, vol. 269, pp. 13–20, 2017, doi: 10.1016/j.neucom.2016.08.146.

M. Yang, H. Chang, and W. Luo, “Discriminative analysis-synthesis dictionary learning for image classification,” Neurocomputing, vol. 219, no. September 2016, pp. 404–411, 2017, doi: 10.1016/j.neucom.2016.09.037.

B. Chen, J. Li, B. Ma, and G. Wei, “Discriminative dictionary pair learning based on differentiable support vector function for visual recognition,” Neurocomputing, vol. 272, pp. 306–313, 2018, doi: 10.1016/j.neucom.2017.07.003.

Z. Li, Z. Zhang, J. Qin, S. Li, and H. Cai, “Low-rank analysis–synthesis dictionary learning with adaptively ordinal locality,” Neural Networks, vol. 119, pp. 93–112, 2019, doi: 10.1016/j.neunet.2019.07.013.

Z. Zhang, W. Jiang, Z. Zhang, S. Li, G. Liu, and J. Qin, “Scalable Block-Diagonal Locality-Constrained Projective Dictionary Learning,” IJCAI Int. Jt. Conf. Artif. Intell., vol. 2019-August, pp. 4376–4382, May 2019.

Y. Sun et al., “Discriminative Local Sparse Representation by Robust Adaptive Dictionary Pair Learning,” IEEE Trans. Neural Networks Learn. Syst., pp. 1–15, Jan. 2020, doi: 10.1109/tnnls.2019.2954545.

J. A. Tropp, “Greed is Good: Algorithmic Results for Sparse Approximation,” IEEE Trans. Inf. THEORY, vol. 50, no. 10, 2004, doi: 10.1109/TIT.2004.834793.

D. Gabay and B. Mercier, “A dual algorithm for the solution of nonlinear variational problems via finite element approximation,” Comput. Math. with Appl., vol. 2, no. 1, pp. 17–40, Jan. 1976, doi: 10.1016/0898-1221(76)90003-1.

J. F. Cai, E. J. Candès, and Z. Shen, “A singular value thresholding algorithm for matrix completion,” SIAM J. Optim., vol. 20, no. 4, pp. 1956–1982, 2010, doi: 10.1137/080738970.

A. S. Georghiades, P. N. Belhumeur, and D. J. Kriegman, “From few to many: Illumination cone models for face recognition under variable lighting and pose,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 23, no. 6, pp. 643–660, Jun. 2001, doi: 10.1109/34.927464.

A. Mart Nez and R. Benavente, “The AR Face Database,” CVC Tech. Rep., 1998.

F. S. Samaria and A. C. Harter, “Parameterisation of a stochastic model for human face identification,” in IEEE Workshop on Applications of Computer Vision - Proceedings, 1994, pp. 138–142, doi: 10.1109/acv.1994.341300.

D. B. Graham and N. M. Allinson, “Characterising Virtual Eigensignatures for General Purpose Face Recognition,” in Face Recognition, Springer Berlin Heidelberg, 1998, pp. 446–456.

L. Fei-Fei, R. Fergus, and P. Perona, “Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories,” Comput. Vis. Image Underst., vol. 106, no. 1, pp. 59–70, Apr. 2007, doi: 10.1016/j.cviu.2005.09.012.

Downloads

Published

14-12-2020

How to Cite

[1]
N. Hoang Vu, T. Quoc Cuong, and T. Thanh Phong, “DISCRIMINATIVE DICTIONARY PAIR LEARNING FOR IMAGE CLASSIFICATION”, JCC, vol. 36, no. 4, p. 347–363, Dec. 2020.

Issue

Section

Articles