Abstract
The convolution neural network based infrared small target detection suffers from the problems of limited receptive field of convolution kernel, information loss caused by down sampling operation, and limited power of the convolution neural network in relative information extraction. To solve these problems, a multi-layer multi-direction Transformer based neural network is proposed. Firstly, the Transformer block is adopted as the basic operator since it has a larger receptive field and more powerful in extracting relative information. The proposed network is a U-shaped network, and fuses local and global information with multi-layers structure. Meanwhile, to enhance the network’s ability to detect the infrared small target, a dual-direction attention operator which calculates the attention information along spatial and channel directions is designed for the decoder network. Finally, an additional network is added to the backbone network to calculate the number of the detected infrared small targets. This additional network reduces the number of falsely detected targets by comparing the calculated number with ground truth. The proposed method is tested on several datasets and the evaluation metrics in comparison with state-of-the-art methods. The proposed method achieves an improvement by 35% at most, which proves the effectiveness of the proposed method.
References
1
DENG H, SUN X P, LIU M L, et al. Small infrared target detection based on weighted local difference measure [J]. IEEE Transactions on Geoscience and Remote Sensing, 2016, 54 (7): 4204-4214.
2
LIANG J, LI L, REN J, et al. Infrared image occlusion interference detection method based on deep learning [J]. Acta Armamentarii, 2019, 40 (7): 1401-1410 (in Chinese).
3
WANG W T, QIN H L, CHENG W X, et al. Small target detection in infrared image using convolutional neural networks [C] // AOPC 2017: Optical Sensing and Imaging Technology and Applications. Bellingham: SPIE, 2017: 1335-1340.
4
YU C, LIU Y P, WU S H, et al. Infrared small target detection based on multiscale local contrast learning networks [J]. Infrared Physics and Technology, 2022, 123: 104107.
5
SHI Q, ZHANG C X, CHEN Z, et al. An infrared small target detection method using coordinate attention and feature fusion [J]. Infrared Physics and Technology, 2023, 131: 104614.
6
WU X, HONG D F, CHANUSSOT J. UIU-Net: U-Net in U-Net for infrared small object detection [J]. IEEE Transactions on Image Processing, 2022, 32: 364-376.
7
LI C Q, HUANG Z C, XIE X M, et al. IST-TransNet: Infrared small target detection based on transformer network [J]. Infrared Physics & Technology, 2023, 132: 104723.
8
LONG Y L, XU H, AN W, et al. Spatial-temporal fused filtering for infrared clutter suppression based on restricted sequential M-estimation [J]. Acta Aeronautica et Astronautica Sinica, 2011, 32 (8): 1531-1541 (in Chinese).
9
REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39 (6): 1137-1149.
10
REDMON J, DIVVALA S, GIRSHICK R, et al. You Only Look Once: Unified, real-time object detection [C] // 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2016: 779-788.
11
RABBI J, RAY N, SCHUBERT M, et al. Small-object detection in remote sensing images with end-to-end edge-enhanced GAN and object detector network [J]. Remote Sensing, 2020, 12 (9): 1432.
12
CHENG G, YUAN X, YAO X W, et al. Towards large-scale small object detection: survey and benchmarks [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45 (11): 13467-13488.
13
ZHANG K, WANG K D, YANG X, et al. Anti-interference recognition algorithm based on DNET for infrared aerial target [J]. Acta Aeronautica et Astronautica Sinica, 2021, 42 (2): 324223 (in Chinese).
14
PHILIP CHEN C L, LI H, WEI Y T, et al. A local contrast method for small infrared target detection [J]. IEEE Transactions on Geoscience and Remote Sensing, 2014, 52 (1): 574-581.
15
GAO C Q, MENG D Y, YANG Y, et al. Infrared patch-image model for small target detection in a single image [J]. IEEE Transactions on Image Processing, 2013, 22 (12): 4996-5009.
16
BAI X Z, ZHOU F G. Analysis of new top-hat transformation and the application for infrared dim small target detection [J]. Pattern Recognition, 2010, 43 (6): 2145-2156.
17
DENG H, SUN X P, LIU M L, et al. Small infrared target detection based on weighted local difference measure [J]. IEEE Transactions on Geoscience and Remote Sensing, 2016, 54 (7): 4204-4214.
18
WANG H, ZHOU L P, WANG L. Miss detection vs false alarm: Adversarial learning for small object segmentation in infrared images [C] // 2019 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE Press, 2019: 8508-8517.
19
DAI Y M, WU Y Q, ZHOU F, et al. Attentional local contrast networks for infrared small target detection [J]. IEEE Transactions on Geoscience and Remote Sensing, 2021, 59 (11): 9813-9824.
20
LI B Y, XIAO C, WANG L G, et al. Dense nested attention network for infrared small target detection [J]. IEEE Transactions on Image Processing, 2022, 32: 1745-1758.
21
ZHANG M J, BAI H C, ZHANG J, et al. RKformer: Runge-Kutta transformer with random-connection attention for infrared small target detection [C] // Proceedings of the 30th ACM International Conference on Multimedia. New York: ACM, 2022: 1730-1738.
22
YING X Y, LIU L, WANG Y Q, et al. Mapping degeneration meets label evolution: Learning infrared small target detection with single point supervision [C] // 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2023: 15528-15538.
23
VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need [C] // Proceedings of the 31st International Conference on Neural Information Processing Systems. New York: ACM, 2017: 6000-6010.
24
CARION N, MASSA F, SYNNAEVE G, et al. End-to-end object detection with transformers [C] // 16th European Conference on Computer Vision (ECCV). Cham: Springer, 2020: 213-229.
25
LIU Z, LIN Y T, CAO Y, et al. Swin Transformer: Hierarchical vision transformer using shifted windows [C] // 2021 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE Press, 2021: 9992-10002.
26
WANG Z D, CUN X D, BAO J M, et al. Uformer: A general U-shaped transformer for image restoration [C] // 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2022: 17662-17672.
27
ZHANG M J, ZHANG R, YANG Y X, et al. ISNet: Shape matters for infrared small target detection [C] // 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE Press, 2022: 867-876.
28
RIVEST J F, FORTIN R. Detection of dim targets in digital infrared imagery by morphological image processing [J]. Optical Engineering, 1996, 35 (7): 1886-1893.
29
HAN J H, LIANG K, ZHOU B, et al. Infrared small target detection utilizing the multiscale relative local contrast measure [J]. IEEE Geoscience and Remote Sensing Letters, 2018, 15 (4): 612-616.
30
MORADI S, MOALLEM P, SABAHI M F. A false-alarm aware methodology to develop robust and efficient multi-scale infrared small target detection algorithm [J]. Infrared Physics & Technology, 2018, 89: 387-397.
31
ZHANG L D, PENG Z M. Infrared small target detection based on partial sum of the tensor nuclear norm [J]. Remote Sensing, 2019, 11 (4): 382-390.
32
WANG Q, WU L T, WANG Y, et al. An infrared small target detection method based on key point [J]. Acta Aeronautica et Astronautica Sinica, 2023, 44 (10): 328173 (in Chinese).