DISCRIMINATIVE DICTIONARY PAIR LEARNING FOR IMAGE CLASSIFICATION
Keywords:Dictionary learning, synthesis and analysis dictionary, incoherent dictionary, classification, face recognition
Dictionary learning (DL) for sparse coding has been widely applied in the field of computer vision. Many DL approaches have been developed recently to solve pattern classification problems and have achieved promising performance. In this paper, to improve the discriminability of the popular dictionary pair learning (DPL) algorithm, we propose a new method called discriminative dictionary pair learning (DDPL) for image classification. To achieve the goal of signal representation and discrimination, we impose the incoherence constraints on the synthesis dictionary and the low-rank regularization on the analysis dictionary. The DDPL method ensures that the learned dictionary has the powerful discriminative ability and the signals are more separable after coding. We evaluate the proposed method on benchmark image databases in comparison with existing DL methods. The experimental results demonstrate that our method outperforms many recently proposed dictionary learning approaches.
J. Wright, A. Y. Yang, A. Ganesh, S. S. Sastry, and Y. Ma, “Robust face recognition via sparse representation,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 31, no. 2, pp. 210–227, 2009, doi: 10.1109/TPAMI.2008.79.
Y. Xu, D. Zhang, J. Yang, and J. Y. Yang, “A two-phase test sample sparse representation method for use with face recognition,” IEEE Trans. Circuits Syst. Video Technol., vol. 21, no. 9, pp. 1255–1262, Sep. 2011, doi: 10.1109/TCSVT.2011.2138790.
R. He, W. S. Zheng, B. G. Hu, and X. W. Kong, “Two-stage nonnegative sparse representation for large-scale face recognition,” IEEE Trans. Neural Networks Learn. Syst., vol. 24, no. 1, pp. 35–46, 2013, doi: 10.1109/TNNLS.2012.2226471.
Z. Li, Z. Lai, Y. Xu, J. Yang, and D. Zhang, “A Locality-Constrained and Label Embedding Dictionary Learning Algorithm for Image Classification,” IEEE Trans. Neural Networks Learn. Syst., vol. 28, no. 2, pp. 278–293, Feb. 2017, doi: 10.1109/TNNLS.2015.2508025.
M. Elad and M. Aharon, “Image denoising via sparse and redundant representations over learned dictionaries,” IEEE Trans. Image Process., vol. 15, no. 12, pp. 3736–3745, Dec. 2006, doi: 10.1109/TIP.2006.881969.
X. Y. Jing, F. Wu, X. Zhu, X. Dong, F. Ma, and Z. Li, “Multi-spectral low-rank structured dictionary learning for face recognition,” Pattern Recognit., vol. 59, pp. 14–25, Nov. 2016, doi: 10.1016/j.patcog.2016.01.023.
M. Aharon, M. Elad, and A. Bruckstein, “K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation,” IEEE Trans. Signal Process., vol. 54, no. 11, pp. 4311–4322, 2006, doi: 10.1109/TSP.2006.881199.
R. Rubinstein, T. Peleg, M. E.-I. T. on Signal, and undefined 2012, “Analysis K-SVD: A dictionary-learning algorithm for the analysis sparse model,” ieeexplore.ieee.org.
S. Gu, L. Zhang, W. Zuo, and X. Feng, “Projective dictionary pair learning for pattern classification,” Adv. Neural Inf. Process. Syst., vol. 1, no. January, pp. 793–801, 2014.
Q. Zhang and B. Li, “Discriminative K-SVD for dictionary learning in face recognition,” Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., pp. 2691–2698, 2010, doi: 10.1109/CVPR.2010.5539989.
Z. Jiang, Z. Lin, and L. S. Davis, “Label consistent K-SVD: Learning a discriminative dictionary for recognition,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 35, no. 11, pp. 2651–2664, 2013, doi: 10.1109/TPAMI.2013.88.
I. Ramirez, P. Sprechmann, and G. Sapiro, “Classification and clustering via dictionary learning with structured incoherence and shared features,” in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010, pp. 3501–3508, doi: 10.1109/CVPR.2010.5539964.
M. Yang, L. Zhang, X. Feng, and D. Zhang, “Fisher Discrimination Dictionary Learning for sparse representation,” in Proceedings of the IEEE International Conference on Computer Vision, 2011, pp. 543–550, doi: 10.1109/ICCV.2011.6126286.
H. Nguyen, W. Yang, B. Sheng, and C. Sun, “Discriminative low-rank dictionary learning for face recognition,” Neurocomputing, vol. 173, pp. 541–551, Jan. 2016, doi: 10.1016/j.neucom.2015.07.031.
T. H. Vu and V. Monga, “Fast low-rank shared dictionary learning for image classification,” IEEE Trans. Image Process., vol. 26, no. 11, pp. 5160–5175, Nov. 2017, doi: 10.1109/TIP.2017.2729885.
B. Mailhé, D. Barchiesi, and M. D. Plumbley, “INK-SVD: Learning incoherent dictionaries for sparse representations,” in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2012, pp. 3573–3576, doi: 10.1109/ICASSP.2012.6288688.
T. Lin, S. Liu, and H. Zha, “Incoherent dictionary learning for sparse representation,” in Proceedings - International Conference on Pattern Recognition, 2012, pp. 1237–1240.
C.-F. Chen, C.-P. Wei, and Y.-C. F. Wang, “Low-rank matrix recovery with structural incoherence for robust face recognition,” ieeexplore.ieee.org, 2012, doi: 10.1109/CVPR.2012.6247981.
H. Yin and X. Wu, “Face recognition based on structural incoherence and low rank projection,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016, vol. 9937 LNCS, pp. 68–78, doi: 10.1007/978-3-319-46257-8_8.
X. Chen and J. Gao, “Discrimination projective dictionary pair methods in dictionary learning1,” Proc. - 2015 8th Int. Congr. Image Signal Process. CISP 2015, no. October 2015, pp. 204–208, 2016, doi: 10.1109/CISP.2015.7407876.
M. Yang, W. Liu, W. Luo, and L. Shen, “Analysis-Synthesis dictionary learning for universality-particularity representation based classification,” 30th AAAI Conf. Artif. Intell. AAAI 2016, pp. 2251–2257, 2016.
M. Yang, H. Chang, W. Luo, and J. Yang, “Fisher discrimination dictionary pair learning for image classification,” Neurocomputing, vol. 269, pp. 13–20, 2017, doi: 10.1016/j.neucom.2016.08.146.
M. Yang, H. Chang, and W. Luo, “Discriminative analysis-synthesis dictionary learning for image classification,” Neurocomputing, vol. 219, no. September 2016, pp. 404–411, 2017, doi: 10.1016/j.neucom.2016.09.037.
B. Chen, J. Li, B. Ma, and G. Wei, “Discriminative dictionary pair learning based on differentiable support vector function for visual recognition,” Neurocomputing, vol. 272, pp. 306–313, 2018, doi: 10.1016/j.neucom.2017.07.003.
Z. Li, Z. Zhang, J. Qin, S. Li, and H. Cai, “Low-rank analysis–synthesis dictionary learning with adaptively ordinal locality,” Neural Networks, vol. 119, pp. 93–112, 2019, doi: 10.1016/j.neunet.2019.07.013.
Z. Zhang, W. Jiang, Z. Zhang, S. Li, G. Liu, and J. Qin, “Scalable Block-Diagonal Locality-Constrained Projective Dictionary Learning,” IJCAI Int. Jt. Conf. Artif. Intell., vol. 2019-August, pp. 4376–4382, May 2019.
Y. Sun et al., “Discriminative Local Sparse Representation by Robust Adaptive Dictionary Pair Learning,” IEEE Trans. Neural Networks Learn. Syst., pp. 1–15, Jan. 2020, doi: 10.1109/tnnls.2019.2954545.
J. A. Tropp, “Greed is Good: Algorithmic Results for Sparse Approximation,” IEEE Trans. Inf. THEORY, vol. 50, no. 10, 2004, doi: 10.1109/TIT.2004.834793.
D. Gabay and B. Mercier, “A dual algorithm for the solution of nonlinear variational problems via finite element approximation,” Comput. Math. with Appl., vol. 2, no. 1, pp. 17–40, Jan. 1976, doi: 10.1016/0898-1221(76)90003-1.
J. F. Cai, E. J. Candès, and Z. Shen, “A singular value thresholding algorithm for matrix completion,” SIAM J. Optim., vol. 20, no. 4, pp. 1956–1982, 2010, doi: 10.1137/080738970.
A. S. Georghiades, P. N. Belhumeur, and D. J. Kriegman, “From few to many: Illumination cone models for face recognition under variable lighting and pose,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 23, no. 6, pp. 643–660, Jun. 2001, doi: 10.1109/34.927464.
A. Mart Nez and R. Benavente, “The AR Face Database,” CVC Tech. Rep., 1998.
F. S. Samaria and A. C. Harter, “Parameterisation of a stochastic model for human face identification,” in IEEE Workshop on Applications of Computer Vision - Proceedings, 1994, pp. 138–142, doi: 10.1109/acv.1994.341300.
D. B. Graham and N. M. Allinson, “Characterising Virtual Eigensignatures for General Purpose Face Recognition,” in Face Recognition, Springer Berlin Heidelberg, 1998, pp. 446–456.
L. Fei-Fei, R. Fergus, and P. Perona, “Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories,” Comput. Vis. Image Underst., vol. 106, no. 1, pp. 59–70, Apr. 2007, doi: 10.1016/j.cviu.2005.09.012.
How to Cite
License1. We hereby assign copyright of our article (the Work) in all forms of media, whether now known or hereafter developed, to the Journal of Computer Science and Cybernetics. We understand that the Journal of Computer Science and Cybernetics will act on my/our behalf to publish, reproduce, distribute and transmit the Work.
2. This assignment of copyright to the Journal of Computer Science and Cybernetics is done so on the understanding that permission from the Journal of Computer Science and Cybernetics is not required for me/us to reproduce, republish or distribute copies of the Work in whole or in part. We will ensure that all such copies carry a notice of copyright ownership and reference to the original journal publication.
3. We warrant that the Work is our results and has not been published before in its current or a substantially similar form and is not under consideration for another publication, does not contain any unlawful statements and does not infringe any existing copyright.
4. We also warrant that We have obtained the necessary permission from the copyright holder/s to reproduce in the article any materials including tables, diagrams or photographs not owned by me/us.