最新刊期

    33 18 2025

      Modern Applied Optics

    • 在燃烧化学反应机理领域,专家提出了一种修正的中红外激光吸收光谱测温方法,大幅提高火焰温度测量的可靠性,为高效清洁燃烧技术提供有效解决方案。
      WAN Tao, LU Shengyao, MA Liuhao, SUN Bowen, HUANG Yitao, WANG Yu
      Vol. 33, Issue 18, Pages: 2833-2843(2025) DOI: 10.37188/OPE.20253318.2833
      摘要:Accurate measurement of the temperature in counterflow diffusion flame is crucial for constructing high-fidelity combustion chemical reaction mechanisms, revealing pollutant formation processes, and developing efficient and clean combustion technologies. This holds significant theoretical and engineering application value. Laser Absorption Spectroscopy (LAS), a widely used non-contact temperature measurement technique, has garnered considerable attention in recent years. However, due to the non-uniform distribution of thermodynamic and chemical parameters along the absorption path, traditional Line-of-Sight (LOS) integration-based measurement methods exhibit significant uncertainties when acquiring flame temperature information, affecting their accuracy and reliability. While traditional tomography reconstruction methods and multi-line profile fitting techniques can, to some extent, recover the temperature information of the flame center, these approaches are often complex and computationally intensive, making them difficult to meet the demands of high-fidelity diagnostic testing. This paper proposed a line-of-sight (LOS) temperature measurement method using mid-infrared laser absorption spectroscopy, incorporating a correction strategy for LOS effects. The strategy quantified the deviation in the LOS temperature measurements caused by the non-uniformities at different flame positions, based on computational fluid dynamics (CFD) and spectral numerical simulations. The correction was applied to the LOS temperature measurements. To validate this strategy, systematic LOS temperature measurements were conducted using the strong absorption band of H2O molecules around 2.5 μm, with comparisons made against tomography reconstruction measurements and CFD simulation results, which incorporate detailed chemical reaction mechanisms. Experimental results indicate that the proposed temperature correction strategy significantly enhances the reliability of LOS temperature measurements for counterflow diffusion flames with various fuels. Compared to tomography reconstruction experiments and CFD results, the relative differences in the measurements were reduced from 10%~20% before correction to within 5%, with most measurement locations showing discrepancies of less than 2%. The temperature correction strategy presented in this study substantially improves the reliability of the LOS measurement method, while eliminating the time-consuming mechanical tomography scanning steps. This provides an effective solution for the rapid and accurate measurement of counterflow diffusion flames temperatures.  
      关键词:line of sight;mid-infrared tomographic absorption spectroscopy;temperature distributions;temperature correction   
      68
      |
      20
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 132155803 false
      更新时间:2025-11-04
    • Monte Carlo based smartphone zoom lenses design AI导读

      在消费电子产品领域,专家利用蒙特卡洛搜索程序,探索了变焦镜头轻量化设计方法,为高性能小型化变焦光学成像系统设计提供了新思路。
      GUO Jiajian, XIE Yuanhao, MU Shuaiwei, ZHANG Xin, LIU Mingxin, SHI Guangwei, ZHAO Shangnan
      Vol. 33, Issue 18, Pages: 2844-2856(2025) DOI: 10.37188/OPE.20253318.2844
      摘要:With the advancement of consumer electronics, there is an increasing demand for high-performance, miniaturized zoom lenses. In zoom lenses, the distribution of optical power among components and their zoom cam curve-collectively known as first-order configurations-significantly influence imaging performance. However, selecting optimal first-order configurations is challenging through simple analytical methods, thereby complicating the design of lightweight zoom lenses. To globally assess the potential performance of first-order configurations in zoom lenses and explore design methodologies for lightweight zoom lenses, this study employed a Monte Carlo search algorithm to identify optimal first-order configurations. Initially, focal lengths for each component were generated based on the Monte Carlo method, and configurations satisfying the zoom constraint equations were filtered. Subsequently, lens design files were created for the valid configurations, followed by ray tracing and rapid optimization to evaluate performance, ultimately selecting configurations with superior optical power distributions. Finally, taking a smartphone zoom lens as an example, a compact zoom lens was designed using the high-performance configuration as the initial structure. This lens maintained a constant F-number of 2.9, achieved an equivalent focal length range of 61-183 mm, and had a total track length of 90 mm. It featured smooth zoom cam curve, modulation transfer function (MTF) values exceeding 0.45 at 80 lp/mm and 0.15 at 160 lp/mm across all fields of view and focal lengths, distortion below 4%, and relative illumination above 20%. Compared to conventional zoom lenses, it offered significant advantages in terms of weight reduction. The design results demonstrate the effectiveness of the proposed method, which combines Monte Carlo-generated first-order configurations with ray tracing evaluations, providing a novel approach for designing compact zoom lenses.  
      关键词:optical design;zoom lenses;Monte Carlo;automatic design   
      47
      |
      24
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 132155682 false
      更新时间:2025-11-04
    • 在二维数字图像相关技术领域,专家提出了一种融合维纳滤波和自适应局部平滑的离焦散斑去模糊方法,有效拓展了深度量程,提高了测量精度,为复杂动态场景下的全场振动监测提供技术支持。
      HUANG Haisheng, NI Zihao, BAI Yulei, DONG Bo
      Vol. 33, Issue 18, Pages: 2857-2867(2025) DOI: 10.37188/OPE.20253318.2857
      摘要:In the vibration measurement of two-dimensional digital image correlation technology(2D-DIC), the limited depth range will cause speckle image blurring when the out-of-plane displacement of the speckle is too large. In order to realize the expansion of the depth range, this paper proposed a deblurring method for defocusing speckles that combined Wiener filtering and adaptive local smoothing. The method first analyzed the mechanism of defocus blur in speckle images and used Wiener filtering to restore the original blurred images. Subsequently, an adaptive local smoothing strategy was introduced to suppress the ringing effect caused by Wiener filter deconvolution. Finally, to verify the effectiveness of the proposed method, experiments were conducted on defocused speckle deblurring, out-of-plane displacement measurement, and vibration measurement. In the experiment of defocused deblurring effect, the speckle image obtained a higher average intensity gradients and a smaller optimal subset sizes, indicating that the new method could effectively improve the speckle contrast and produced high-quality restored images. In the out-of-plane displacement reconstruction experiment, the maximum dynamic measurement range was increased by 9%, and the dispersion degree of the measured value of the out-of-plane displacement was reduced by 8.8% compared with the real value, which effectively extended the depth range of 2D-DIC. In the actual vibration measurement, the method improved the signal-to-noise ratio of periodic vibration by 5.5%, and reduced the mean error by 2.4%, which effectively increased the measurement accuracy of 2D-DIC.The proposed method in this paper significantly expands the measurement capability of traditional 2D-DIC in the depth direction, and provides reliable technical support for full-field vibration monitoring in complex dynamic scenarios.  
      关键词:speckle deblurring;depth range extension;digital image correlation;out-of-plane displacement;Wiener filtering   
      60
      |
      13
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 132155685 false
      更新时间:2025-11-04
    • 在多光谱硫化锌表面处理领域,专家通过优化化学机械抛光工艺参数,实现了表面高精度与高质量的协同控制,为高性能光学器件应用提供技术支持。
      REN Jiatong, QIN Lin, ZHU Beibei, CAI Gen, CHU Jianning, ZHANG Chupeng, CHEN Xiao
      Vol. 33, Issue 18, Pages: 2868-2881(2025) DOI: 10.37188/OPE.20253318.2868
      摘要:In order to solve the problem that it was difficult to obtain high surface quality of multi-spectral ZnS in chemical-mechanical polishing (CMP), this paper focused on the synergistic optimization of flatness and surface roughness to carry out experimental research. Through single-factor experiments, the optimal values and ranges for polishing time, abrasive type, rotational speed of the polishing disc, and polishing pressure were determined. Subsequently, response surface methodology was employed to design the experimental framework, and grey relational analysis was introduced to investigate the influence of these parameters and their interactions on surface morphology. The results show that the speed of polishing disk is the dominant factor affecting the surface quality, followed by polishing pressure and grain size. The optimum process parameters were determined by model optimization: abrasive grain size of 0.5 μm, polishing disc speed of 55 r/min, and polishing pressure of 16 N. Under these conditions, the process was validated, and the surface flatness of the workpiece reached 112.49 nm, the roughness was reduced to 1.11 nm, and the surface morphology was uniform with fewer defects. The synergistic control of high precision and high quality of multi-spectral ZnS surfaces can be achieved by optimizing the CMP process parameters, providing technical support for its application in high-performance optical devices.  
      关键词:multi-spectral ZnS;Chemical Mechanical Polishing (CMP);response surface method;high surface quality;grey relational analysis   
      46
      |
      18
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 132155683 false
      更新时间:2025-11-04

      Micro/Nano Technology and Fine Mechanics

    • 在工业测量领域,专家提出了多频磁场耦合的绝对式时栅角位移传感器设计方案,有效解决了磁场式传感器体积大和码道串扰问题,提升了信噪比与测量精度,满足了空间约束测量需求。
      YANG Jisen, LIU Bangquan, YI Jingsong, CAO Junjie, LUO Yunpei, ZHANG Jing
      Vol. 33, Issue 18, Pages: 2882-2898(2025) DOI: 10.37188/OPE.20253318.2882
      摘要:To address the issues of large volume in the dual-channel structure of magnetic field sensors and code channel crosstalk caused by same-frequency excitation, while improving signal-to-noise ratio and measurement accuracy, a design scheme for an absolute time-grid angular displacement sensor with multi-frequency magnetic field coupling was proposed. This enabled real-time absolute angular displacement measurement to meet the spatial constraint measurement requirements in industrial applications. First, a theoretical model for the transient magnetic field coupling of planar coils was established, and the magnetic field distribution characteristics of excitation coils with different shapes were analyzed. Based on the principle of incremental measurement, an absolute single-channel angular displacement sensor model was constructed; An absolute time-resolved displacement calculation scheme with mutually prime pole numbers was established. Under a dual-frequency synchronous excitation scheme of 500 kHz (coarse machine) and 1 MHz (fine machine), the high-frequency-to-low-frequency principle was utilized to resolve the contradiction between high signal-to-noise ratio and high resolution under high-frequency excitation signals. Through electromagnetic simulation and error analysis, the optimal coupling gap for the sensor was determined to be 1.0 mm; Finally, a sensor prototype with an outer diameter of 84 mm was developed and performance testing was completed. Experimental results show that the sensor successfully suppresses magnetic field crosstalk between coarse and fine code tracks while achieving miniaturization, with a maximum measurement error of ±78 across the entire measurement range.  
      关键词:absolute displacement measurement;multi-frequency magnetic field coupling;signal-to-noise ratio;displacement sensor   
      26
      |
      10
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 132155686 false
      更新时间:2025-11-04
    • 在能量转化领域,研究人员设计了新型三自由度压电俘能器,有效提升能量转化效率,为低功耗无线传感器提供能源支持。
      XU Dongmei, YANG Yang, YU Simiao, ZHANG Chi, PAN Jianan, ZHANG Xuhui
      Vol. 33, Issue 18, Pages: 2899-2913(2025) DOI: 10.37188/OPE.20253318.2899
      摘要:To improve the energy conversion efficiency of a single-degree-of-freedom (SDOF) piezoelectric energy harvester and broaden its operational direction, a novel three-degree-of-freedom (3-DOF) piezoelectric energy harvester had been designed by integrating the piezoelectric energy conversion mechanism, multi-degree-of-freedom piezoelectric harvesting principles, and force amplification structures. The proposed harvester had consisted of orthogonally arranged cantilever-beam harvesters and a force-amplified compression-tension mode harvester. A simulation model had been established using MATLAB/Simulink to analyze the effects of excitation and structural parameters on the system’s dynamic response. Based on the simulation, a prototype had been fabricated and experimentally validated. The results had demonstrated that the harvester could effectively generate electrical energy along the X, Y, and Z directions, with structural parameters (cantilever spacing, proof mass, and spring stiffness) significantly influencing its performance. Experiments showed that the X and Y directions achieved an output power of 2.924 mW at a load resistance of 15 kΩ, capable of illuminating 22 LEDs, while the Z direction delivered 4.871 mW at 40 kΩ, powering 33 LEDs. The findings confirm the harvester’s stable energy output across different vibration directions, demonstrating its feasibility for powering low-power wireless sensors. This study provides theoretical guidance for the design of novel multi-degree-of-freedom piezoelectric energy harvesters.  
      关键词:three degrees of freedom;vibration energy collection;force amplification structure;positive piezoelectric effect;tension-compression mode   
      21
      |
      7
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 132155684 false
      更新时间:2025-11-04
    • 在大视场测量领域,提出了一种新型十字形标定靶标及其识别方法,有效解决了传统靶标制造困难、携带不便等问题,为工业现场大视场相机快速标定提供了解决方案。
      SHI Baoquan, HOU Xuanzhe, HE Qian, CHEN Xianming, ZHANG Wendong, YANG Ling
      Vol. 33, Issue 18, Pages: 2914-2928(2025) DOI: 10.37188/OPE.20253318.2914
      摘要:To address the challenges of manufacturing, portability, and deformation associated with traditional large-scale calibration targets in large field-of-view (FOV) measurements, a cross-shaped calibration target based on cross-ratio invariance and its recognition method was proposed. First, a cross-structured target was designed by vertically connecting two equal-length rulers. The target featured four arms of equal length, with circular uncoded markers arranged along each arm such that the cross-ratio among any four adjacent points remained constant at 4/3. The main structure was made of aluminum alloy or carbon fiber composite material, ensuring both lightweight and high strength. Second, a marker encoding rule was developed based on the principle of cross-ratio invariance, and a recognition algorithm was designed to enable automatic detection and decoding of target points in calibration images. Finally, a physical prototype of the target was fabricated and tested. Experimental results demonstrate that the proposed target can be accurately recognized even at a large tilt angle of up to 80°, and its simple structure makes it easy to manufacture, assemble, disassemble, and carry, making it well-suited for rapid calibration of wide FOV cameras in complex industrial environments.  
      关键词:camera calibration;cross-ratio invariance;cross-shaped calibration target;large fields of view   
      20
      |
      13
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 132155833 false
      更新时间:2025-11-04

      Information Sciences

    • 在开放词汇目标检测领域,研究者提出了KI-DBFOVD方法,有效提升了新类检测精度,为开放场景下的目标检测提供了新方案。
      JIN You, DENG Zhen, LIU Libo
      Vol. 33, Issue 18, Pages: 2929-2943(2025) DOI: 10.37188/OPE.20253318.2929
      摘要:To address the issues of weak understanding of new class concepts, label confusion, and insufficient detection performance of new classes in open-set scenarios, a Knowledge Integration-guided Dual-branch Fusion Open-Vocabulary Object Detection (KI-DBFOVD) method was proposed in this paper. Firstly, a Knowledge Integration (KI) module was designed, where pseudo-labels generated by a Vision-Language Model were embedded into the detector to learn about new class concepts. Subsequently, a Label Match (LM) module was introduced to refine the label matching process through multi-level threshold adjustment and independent matching between base and new classes, thereby alleviating the label confusion between base and new classes during detection. Finally, a novel Dual-branch Fusion module (DBF) was constructed by fusing the traditional visual branch and the vision-language branch via geometric averaging. This fusion maintained the detection accuracy of base classes and more effectively detected and localized new class objects, then enhanced the overall detection performance of the KI-DBFOVD method. Experimental results demonstrate that this method achieves a detection accuracy of 38.6% for new classes on the COCO dataset and 25.4% on the more challenging LVIS dataset, which contains a larger number of categories. These results outperform several mainstream methods and indicate that this approach is more suitable for different open-set scenarios..  
      关键词:open-vocabulary object detection;knowledge integration;label match;dual-branch fusion   
      25
      |
      15
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 132155846 false
      更新时间:2025-11-04
    • 在图像质量评价领域,研究者提出了一种基于视觉感知的无参考IQA方法BCTCSP,通过综合亮度、色度、纹理、清晰度和局部对比度等特征,构建了一种有效的IQA模型,实验结果表明其性能优于现有28个模型。
      YAO Juncai, SHEN Jing
      Vol. 33, Issue 18, Pages: 2944-2961(2025) DOI: 10.37188/OPE.20253318.2944
      摘要:It aimed to propose an image quality assessment (IQA) method that conformed to visual perception and had high comprehensive benefits of accuracy, generalization, and complexity, which would better control image processing and meet its practical application needs. Based on the image tableau information and their features such as brightness, chromaticity, texture, clarity, and local contrast, and considering human perception effects such as contrast sensitivity, non-linear perception of brightness, visual comfort, and texture complexity perception, a no-reference IQA method, namely BCTCSP, was proposed. In BCTCSP, firstly, by analyzing the relationship between image quality and image brightness, grayscale distribution, color depth and saturation, non-linear perception of brightness, and visual perception comfort, a quantitative and computational method was proposed to obtain the contribution and impact of image brightness, chromaticity, and visual perception on IQA. Then, combining the gray-gradient co-occurrence matrix to calculate and statistically analyze image texture features, and using texture weighted averaging and the HVS complex object perception model, a method was proposed to quantify and calculate the contribution and impact of image texture information entropy and its visual perception on IQA. Next, the contrast value and detection threshold of each point in the image were calculated, subsequently combining the contrast sensitivity characteristics and their models, and masking properties of HVS, the contribution and impact of local contrast and visual perception of the image on IQA were quantified and calculated. Afterwards, four factors including sharpness, signal-to-noise ratio, proportion of high-frequency components, and resolution, were used to describe the clarity of the image, and their quantification and calculation methods were proposed to obtain the clarity index of the image. Finally, synthesizing four factors, an IQA model was constructed, and its measurement standards were quantified. Meanwhile, 6 430 distorted images from 6 open databases (TID2013, CSIQ, LIVE, IVC, SPAQ, and Koniq-10k) were tested and verified, and in terms of accuracy, complexity, generalization, and their comprehensive benefits, BCTCSP was compared with 28 existing and typical IQA models. The experimental results show that the accuracy PLCC of the proposed model reaches a minimum of 0.892 1, a maximum of 0.966 4 among the 6 databases, and the weighted PLCC of 6 databases reaches 0.917 4. Its comprehensive benefits are higher than those of the 28 existing IQA models. The comprehensive results indicate that the proposed model is an effective and high-performance NR-IQA model.  
      关键词:image quality assessment;image tableau;visual perception;texture information entropy   
      37
      |
      13
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 132156021 false
      更新时间:2025-11-04
    • Underwater image restoration driven by the mean intensity difference prior AI导读

      在水下图像复原领域,研究者提出了一种均值强度差先验驱动的复原模型,有效提升了色彩还原度、对比度和细节锐度,为水下视觉感知任务提供可靠数据支撑。
      SHANG Jiashuo, LI Ying, YUAN Jingyi, GUO Shen, XING Hu
      Vol. 33, Issue 18, Pages: 2962-2979(2025) DOI: 10.37188/OPE.20253318.2962
      摘要:Underwater images are degraded by wavelength-dependent absorption and multiple scattering. As a result, they exhibit low contrast, strong color cast, and blurred detail. To address these issues, an underwater image restoration model driven by the mean intensity difference prior (MIDP) was proposed. First, the mean intensity difference between the ‘blue-green, red’ and ‘green-blue, red’ branches was calculated in parallel within each local window and averaged to construct a noise-robust three-channel coupled attenuation prior. Subsequently, based on the physical relationship of the ‘intensity difference–depth map-transmission map’ chain, exponential mapping with an adaptive slope was introduced. The three-channel transmission maps were accurately estimated through a first-order Taylor expansion combined with a finite-difference approximation. Meanwhile, the mean of the set of pixels with the largest intensity difference was calculated to estimate the background light. Finally, the transmission maps and the background light were substituted into the inverse underwater imaging model to obtain restored images with natural colors and sharp details.First, image channels were split into two spectral branches: blue-green versus red, and green-blue versus red. Mean intensity differences were computed in parallel for both branches inside every sliding window. The two branch differences were averaged, producing a coupled three-channel attenuation prior for estimating intensity difference. The intensity differences estimated by MIDP were highly robust against image noise and local outliers. Next, a chained physical relation linked intensity difference, scene depth, and transmission map. An exponential mapping with adaptive slope transformed differences into initial transmission map estimation. A first-order Taylor expansion and finite-difference approximation then refined per-channel transmission maps. Pixels with the largest intensity differences were selected, and their mean was used to estimate background light. The estimated transmission map and background light were inserted into the inverse underwater imaging model to restore clear images. The resulting image exhibited natural colors, stronger contrast, and sharper detail.Extensive experiments are conducted on real datasets UIEB and UCCS. Quantitative evaluation shows MIDP obtains the highest combined scores on UIQM, UCIQE, CCF, and MWF. Quantitative evaluation shows that MIDP obtains color fidelity, local contrast, and gradient sharpness rise significantly. Additional application experiments were carried out on three vision tasks. In key-point detection, the restored images yielded many more stable keypoints . In edge detection, the restored images produced clearer and more coherent results. In underwater segmentation mask, the restored images produce tighter mask boundaries and lower false positive rates. These results demonstrate that MIDP adapts to varied water conditions and restores images with high quality. These results also supply high-quality input data for downstream underwater vision tasks.  
      关键词:underwater image restoration;low-quality image enhancement;intensity difference prior;transmission map estimation;background light estimation   
      31
      |
      11
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 132155986 false
      更新时间:2025-11-04
    • Efficient mamba-driven end-to-end light field image compression AI导读

      在光场图像压缩领域,研究者提出了一种高效Mamba驱动的端到端压缩方法,实现了码率降低和PSNR提升,同时显著提升了编解码效率。
      FENG Zheyu, JIANG Zhidi, WAN Lifei, XU Haiyong, JIANG Gangyi
      Vol. 33, Issue 18, Pages: 2980-2995(2025) DOI: 10.37188/OPE.20253318.2980
      摘要:Light field images capture both spatial and angular information of light rays, providing richer visual information than traditional 2D images. However, their high-dimensional nature poses challenges for existing compression methods in terms of global feature utilization, long-range correlation exploration, and computational complexity, limiting the improvements of compression performance and efficiency. To address these issues, this paper proposed an efficient Mamba-driven end-to-end light field image compression method. Firstly, 2D slices containing spatial and epipolar plane information were extracted from the 4D light field image, and Mamba was employed to fully capture their global contextual information. Secondly, to scan the light field image in multiple directions while avoiding a significant increase in computational complexity, a channel-efficient 2D selective scanning strategy was introduced to extract light field features accurately and efficiently. Finally, on the decoding end, a residual reconstruction module was designed to enhance the reconstructed image quality while reducing the number of parameters and decreasing the encoding and decoding time. The experimental results show that compared with the existing representative method SADN, the proposed method achieves an average bitrate reduction of 7.4% and a PSNR improvement of 0.37 dB on light field images with a 7×7 angular resolution, while also demonstrating superior subjective visual quality. In terms of encoding and decoding time, the proposed method has achieved a significant improvement of 10 to 20 times. Furthermore, compared to the state-of-the-art method LFIC-DRASC, the proposed method achieves an average bitrate reduction of 19.5% and a PSNR improvement of 0.58 dB on light field images with a 13×13 angular resolution.  
      关键词:light field;image compression;end-to-end;Mamba   
      40
      |
      13
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 132155877 false
      更新时间:2025-11-04
    • A bimodal fusion method for remote sensing images to cloud removal AI导读

      在合成孔径雷达图像去云领域,研究者提出了一种结合光学图像的去云网络,有效提升了图像去云性能。
      LIU Changyuan, CAO Qing, LIU Jinfeng
      Vol. 33, Issue 18, Pages: 2996-3007(2025) DOI: 10.37188/OPE.20253318.2996
      摘要:In order to solve the problem that the difference between cloud and background could not be accurately handled in the process of cloud removal from Synthetic Aperture Radar (SAR) images combined with optical image data, this paper proposed a cloud removal network based on the combination of SAR images and optical remote sensing images with a double-branched structure. In the feature extraction stage, a multi-scale attention mechanism was introduced to effectively capture the global and local information in the image, and provide a more representative representation of feature information for subsequent fusion and cloud removal operations. The parallel structure of the local fusion branch and the differential branch was redesigned, and the contribution of the two was dynamically balanced through the gating mechanism, so as to fully explore the complementarity of the two, and refine the cloud edge in the optical cloudy image, so as to restore a more fine cloudless optical image. The location-aware enhanced Swin Transformer was used to densely connect local features, so that the network model had better robustness in the face of complex environments. The Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index (SSIM) of the proposed model are 0.833 1 dB and 0.024 6 higher than those of the current optimal algorithm, respectively. It is proved that the proposed algorithm has better performance than other methods in the image removal task.  
      关键词:remotely sensed imagery;synthetic aperture radar image;optical images;multimodal;data fusion;cloud removal   
      41
      |
      17
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 132156063 false
      更新时间:2025-11-04
    0