HU Lin-miao,ZHANG Yong,LOU Chen-feng.Shortwave infrared visible-light face recognition based on content feature extraction[J].Optics and Precision Engineering,2021,29(01):160-171.
HU Lin-miao,ZHANG Yong,LOU Chen-feng.Shortwave infrared visible-light face recognition based on content feature extraction[J].Optics and Precision Engineering,2021,29(01):160-171. DOI: 10.37188/OPE.20212901.0160.
Shortwave infrared visible-light face recognition based on content feature extraction
To recognize shortwave-infrared(SWIR) face images according to enrolled visible-light(VIS) face images, a SWIR-VIS face recognition framework based on content feature extraction is proposed. Initially, a SWIR-VIS face image dataset was established. DRIT–an image translation frame–is modified to extract content features more accurately, and consequently obtains better translation results. Then, the content feature extractors in the improved DRIT framework overcome the interference of the modal difference on the recognition. The network used to recognize SWIR faces based on content features was adopted to complete the cross-modal SWIR-VIS face recognition task. The proposed network is evaluated on a self-built SWIR-VIS face image dataset, and compared with the existing widely used methods. Experimental results indicate that the improved DRIT could extract content features more accurately, and consequently the recognition accuracy with content extractors from the improved DRIT model is 12.89% higher than that with the original DRIT content extractors. The recognition accuracy of the proposed framework in the task of SWIR-VIS recognition was 88.86%. The proposed framework can effectively overcome the modality gap and improves the recognition accuracy.
关键词
Keywords
references
DUTTA A K . Imaging beyond human vision [C]. International Conference on Electrical and Control Engineering , 2014 : 224 - 229 .
CAO Z , SCHMID N A , BOURLAI T . Composite multilobe descriptors for cross-spectral recognition of full and partial face [J]. Optical Engineering , 2016 , 55 ( 8 ): 083107 .
BIHN M , GUNTHER M , LEMMOND D , et al . Evaluating a convolutional neural network on short-wave infra-red images [C]. 2018 IEEE Winter Applications of Computer Vision Workshops (WACVW). IEEE , 2018 : 18 - 27
HAREL M , MOSHE Y . Tone mapping for shortwave infrared face images [C]. 2014 IEEE 28th Convention of Electrical & Electronics Engineers in Israel (IEEEI). IEEE , 2014 : 1 - 5 .
DENG J , GUO J , XUE N , et al . Arcface: Additive angular margin loss for deep face recognition [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , 2019 : 4690 - 4699 .
LEZAMA J , QIU Q , SAPIRO G . Not afraid of the dark: Nir-vis face recognition via cross-spectral hallucination and low-rank embedding [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , 2017 : 6628 - 6637 .
FU C , WU X , HU Y , et al . Dual variational generation for low shot heterogeneous face recognition [C]. Advances in Neural Information Processing Systems , 2019 : 2670 - 2679 .
PARKHI O M , VEDALDI A , ZISSERMAN A . Deep face recognition [J]. Proceedings of the British Machine Vision Conference (BMVC) , 2015 : 41 . 1-41 . 12 .
DU ZH L , SHEN H Y , SONG G M , et al . Image style transfer based on improved CycleGAN [J]. Opt. Precision Eng. , 2019 , 27 ( 8 ): 1836 - 1844 . (in Chinese)
SHAHAM T R , DEKEL T , MICHAELI T , et al . SinGAN: Learning a generative model from a single natural image [C]. Proceedings of the IEEE International Conference on Computer Vision , 2019 : 4570 - 4580 .
XIANG X , TIAN Y , ZHANG Y , et al . Zooming Slow-Mo: Fast and accurate one-stage space-time video super-resolution [J]. arXiv preprint arXiv: 2002 . 11616 , 2020 .
ZHU F ZH , LIU Y , HUANG X , et al . Remote sensing image super-resolution based on improved sparse representation [J]. Opt. Precision Eng. , 2019 , 27 ( 3 ): 718 - 725 . (in Chinese)
YANG ZH K , BU L P , WANG T , et al . Scenemigration of indoor flame image based on Cycle-Consistent adversarial networks [J]. Opt. Precision Eng. , 2020 , 28 ( 3 ): 745 - 758 . (in Chinese)
ISOLA P , ZHU J Y , ZHOU T , et al . Image-to-image translation with conditional adversarial networks [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , 2017 : 5967 - 5976 .
ZHU J Y , PARK T , ISOLA P , et al . Unpaired image-to-image translation using cycle-consistent adversarial networks [C]. Proceedings of the IEEE International Conference on Computer Vision , 2017 : 2223 - 2232 .
HUANG X , LIU M Y , BELONGIE S , et al . Multimodal unsupervised image-to-image translation [C]. Proceedings of the European Conference on Computer Vision (ECCV) , 2018 : 172 - 189 .
LEE H , TSENG H , HUANG J , et al . Diverse Image-to-Image Translation via Disentangled Representations [C]. Proceedings of the European Conference on Computer Vision (ECCV) , 2018 : 36 - 52 .
EMAMI H , ALIABADI M M , DONG M , et al . SPA-GAN: Spatial Attention GAN for Image-to-Image Translation [J]. IEEE Transactions on Multimedia , 2020 : 1 - 1 .
KIM J , KIM M , KANG H , et al . U-GAT-IT: Unsupervised generative attentional networks with adaptive layer-instance normalization for image-to-image translation [C]. International Conference on Learning Representations , 2020 .
WU X , HE R , SUN Z . A lightened cnn for deep face representation [J]. arXiv preprint arXiv:1511.02683 , 2015 , 4 ( 8 ).
SCHROFF F , KALENICHENKO D , PHILBIN J . Facenet: A unified embedding for face recognition and clustering [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , 2015 : 815 - 823 .