最新刊期

    34 1 2026

      Modern Applied Optics

    • Computational spectral imaging systems and reconstruction algorithms AI导读

      计算光谱成像技术在智能手机、无人机等领域应用广泛,专家系统阐述了其理论框架与方法体系,为多领域发展提供重要支撑。
      LIU Xinyu, CHEN Yating, WU Jiachen, MA Yuchen, LI Yumei, ZHANG Shuhe, ZHENG Zhenrong, CAO Liangcai
      Vol. 34, Issue 1, Pages: 1-25(2026) DOI: 10.37188/OPE.20263401.0001
      摘要:Computational spectral imaging, grounded in compressed sensing theory, incorporates optical encoding elements to project high-dimensional spectral image data into low-dimensional measurements, which are subsequently decoded into spectral images using advanced reconstruction algorithms. This paradigm offers notable advantages in system compactness, acquisition speed, and manufacturing cost. In recent years, rapid progress has been achieved in both theoretical development and system implementation, resulting in a growing body of high-quality research. Concurrently, consumer-oriented deployments have expanded to platforms such as smartphones, unmanned aerial vehicles, and remote-sensing satellites, enabling diverse applications in color imaging, environmental monitoring, and medical diagnostics. In this paper, the theoretical foundations and methodological advances of computational spectral imaging are systematically reviewed. Representative optical encoding strategies-including amplitude encoding, wavelength encoding, wavefront encoding, and multi-aperture encoding-are examined, along with mainstream reconstruction approaches ranging from iterative algorithms with prior constraints to end-to-end deep learning models. Finally, emerging trends and key challenges are discussed. Given its strong relevance to strategic emerging industries, including intelligent manufacturing, artificial intelligence, the low-altitude economy, and smart agriculture, computational spectral imaging is expected to play an increasingly important role across a broad range of applications.  
      关键词:computational imaging;spectral imaging;Compressed sensing;deep learning   
      57
      |
      8
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 146561963 false
      更新时间:2026-01-21
    • 在光学系统领域,专家提出一种轻量化设计方法,融合多参数优化与碳化硅增材制造技术,实现双面共体反射镜轻量化、高刚度目标。
      WANG Chengbin, SUN Shengli, SUN Xiaojin, MA Xiaohao, HU Kai
      Vol. 34, Issue 1, Pages: 26-34(2026) DOI: 10.37188/OPE.20263401.0026
      摘要:To meet the requirements of high lightweighting and high stiffness for a Φ516 mm double-sided integral reflector-comprising the primary and quaternary mirrors in a coaxial four-mirror optical system-a method integrating multi-parameter optimization with silicon carbide (SiC) additive manufacturing is proposed based on conventional lightweight design principles. First, an initial reflector model was established from engineering experience, and key structural dimensions were selected to formulate a multi-parameter optimization framework. The parametric model was then optimized using a multi-objective genetic algorithm. The optimized reflector weighs 6.8 kg, achieving a 32.6% reduction relative to the initial design. After completion of 3D printing, a structural modal test was conducted, yielding a first-order natural frequency of 1 964 Hz. Following optical processing of both mirror surfaces, gravity impact, vacuum high–low temperature, and vibration tests were performed. The maximum change in surface figure accuracy before and after these tests satisfied the specified requirements. These results confirm the reliability and stability of the SiC double-sided reflector design and demonstrate the robustness of the SiC additive manufacturing process for high-performance integral reflectors.  
      关键词:Additive manufacturing;SiC;double-sided reflector;light weighting;reliability verification   
      29
      |
      10
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 146561778 false
      更新时间:2026-01-21
    • Design of high-performance geometric waveguide head-mounted displays AI导读

      在增强现实领域,研究人员设计了一种轻薄型透射式头戴显示器光学系统,实现了大视场角、轻薄化及高沉浸感,突破了量产化关键技术难点。
      XU Hengshen, JIANG Yuting, HU Yueqiang
      Vol. 34, Issue 1, Pages: 35-43(2026) DOI: 10.37188/OPE.20263401.0035
      摘要:To satisfy augmented reality (AR) optical waveguide requirements for a large field of view (FOV), thin and lightweight form factor, and high immersion, a mass-producible thin-lightweight transmissive head-mounted display (HMD) optical system based on an arrayed geometric thin-film waveguide architecture was designed and implemented. The system comprises an eyepiece, an arrayed waveguide module, and an illumination module. The arrayed waveguide employs an array of semi-transparent, semi-reflective films with customized angle-selective coatings as the optical combiner, enabling pupil expansion. A dual-waveplate polarization-mixing scheme was introduced, improving display uniformity from 59.66% to 61.12% under single-polarized illumination and to 76.61% in practical measurements. Through comprehensive optical simulation and system-level optimization, and supported by a mature domestic optical manufacturing chain-including precision cutting, grinding and polishing, vacuum deposition of complex multilayer films, and high-precision bonding-together with an added waveplate lamination process, key technical barriers limiting the mass production of arrayed waveguides were overcome. A prototype was fabricated, and mass production was achieved. The proposed approach resolves the longstanding trade-off between optical performance and manufacturability in AR HMDs, supporting both consumer and industrial applications.  
      关键词:waveguide;near-eye display technology;optical design;augmented reality;head-mounted display   
      19
      |
      9
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 146562009 false
      更新时间:2026-01-21
    • 在空间天气监测等领域,我国研制出高灵敏度单光子计数紫外成像探测系统,为紫外波段高灵敏度成像探测提供关键技术支撑。
      GUO Naifei, SUN Liang, HE Lingping, MA Tianqi, TAO Chen, WANG Wei, YAN Wanhong
      Vol. 34, Issue 1, Pages: 44-52(2026) DOI: 10.37188/OPE.20263401.0044
      摘要:To meet the increasing demand for high-sensitivity, large-dynamic-range detection in space weather monitoring, ultraviolet (UV) early-warning reconnaissance, and advanced analytical instrumentation, a single-photon-counting UV imaging detection system is presented. High sensitivity and low noise in the UV band are achieved by integrating an array single-photon counter with a UV-enhanced digital micromirror device (DMD) featuring small pixel size and a high spatial light-modulation rate. Dynamic adjustment of spectral sequences on a per-pixel basis is enabled, and pixel-level modulation of incident photons is implemented at the array single-photon counter. When weak target spectral signals are identified, an encoding modulation strategy is applied to improve the signal-to-noise ratio (SNR) within the target imaging region. Based on region identification and spatial-domain correlation, multi-level irradiance control is imposed on corresponding sensor pixels, allowing saturated regions to be rapidly driven into the optimal response range. The coupled internal optical architecture is described, and key performance metrics including wavelength range, dark count rate, and dynamic range are evaluated. Experiments demonstrate a response wavelength range of 180-320 nm, a dynamic range of 163.9 dB, a dark count rate of 0.481 cps/cm², and a system transfer function exceeding 0.6, consistent with the design targets. The developed system supports high-sensitivity UV spectral imaging with low dark counts, fine spatial resolution, and high-speed spatial modulation, providing enabling technology for high-sensitivity UV imaging applications in China.  
      关键词:shortwave optics;UV imaging detection;single-photon-counting imaging;spatial light modulation;coupled imaging;high-sensitivity detection   
      94
      |
      109
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 140445669 false
      更新时间:2026-01-21
    • 在激光显示领域,研究者提出了一种基于聚苯乙烯微球与聚二甲基硅氧烷混合物的体散射片,有效抑制了激光散斑问题,为激光显示系统提供了一种无需外部驱动、无噪音且高效的静态激光散斑抑制方案,具有重要的工程应用价值。
      WU Qizhou, JIA Bei, CHEN Yalin, YAN Yuxiang, MA Weiguang, TONG Zhaomin
      Vol. 34, Issue 1, Pages: 53-60(2026) DOI: 10.37188/OPE.20263401.0053
      摘要:To mitigate speckle arising from the high coherence of lasers in display applications, a static speckle-suppression approach is proposed based on a volume scattering diffuser (VSD) formed from a mixture of polystyrene (PS) microspheres and polydimethylsiloxane. PS microspheres with an average diameter of 600-700 nm were synthesized via soap-free emulsion polymerization, and VSDs with varying thicknesses were fabricated by spin coating. The scattering capability of the VSD was enhanced with increasing thickness as the spin-coating speed decreased from 1 300 r/min to 600 r/min, and the scattering angle increased from 44° to 100°. In an optical test system, thicker VSDs more effectively reduced the temporal and spatial coherence of the laser, lowering the speckle contrast to 0.35. This work demonstrates an external-driver-free, noiseless, and high-efficiency static technique for laser speckle suppression in laser displays, with significant application potential.  
      关键词:laser display;laser speckle;volume scattering diffuser;static speckle suppression   
      12
      |
      4
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 146562106 false
      更新时间:2026-01-21

      Micro/Nano Technology and Fine Mechanics

    • 铝基碳化硅反射镜表面改性技术取得新进展,优化制备镍磷非晶膜工艺,提升镀层界面结合强度与材料致密度,满足空间光学反射镜高稳定性需求。
      ZHANG Zhaoqian, ZHOU Tianfeng, WU Xunwei, YU Qian, ZHAO Bin
      Vol. 34, Issue 1, Pages: 61-71(2026) DOI: 10.37188/OPE.20263401.0061
      摘要:Aluminum-based silicon carbide is an attractive material for next-generation space optical mirrors because of its high specific stiffness and low density; however, surface alumina and multiphase interfacial defects hinder the attainment of optical-grade surfaces. To meet the optical finishing requirements of aluminum-based silicon carbide mirrors, an electroless nickel-phosphorus amorphous film was deposited on aluminum-based silicon carbide substrates for surface modification. The deposition technology for an amorphous Ni-P film on aluminum-based silicon carbide (SiCp volume fraction: 20%) was investigated using the process route of cleaning-primary zinc immersion-dezincification-zinc removal-secondary zinc immersion-electroless nickel plating, with emphasis on oxide-film elimination, interface activation, enhancement of interfacial bonding strength, and coating-thickness control. During primary zinc immersion, the surface oxide layer was partially removed via the formation of a friable reaction layer; however, the resulting activation layer was shallow and prone to peeling. Zinc nitrate dezincification effectively removed the reaction layer and exposed the substrate. Secondary zinc immersion enabled large-area, continuous activation, and an activation layer was also formed on the surfaces of silicon carbide particles. Consequently, the interfacial bonding strength was markedly improved relative to the untreated substrate. An L9 orthogonal design indicated that electroless plating time exerted the greatest influence on coating thickness, followed by secondary and primary zinc-immersion times. The optimal parameter set comprised 8 h electroless plating, 10 s primary zinc immersion, and 2 min secondary zinc immersion, yielding a coating thickness of approximately 50-60 μm. The proposed process enhances coating density and interfacial bonding through stepwise activation and a crystal-cell stacking growth mechanism, providing an effective surface-modification strategy for the high-stability requirements of space optical mirrors.  
      关键词:Aluminum-based silicon carbide;composite materials;Surface activation;Electroless nickel plating;Plating thickness   
      109
      |
      61
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 136748753 false
      更新时间:2026-01-21
    • 在柔性光电探测器领域,研究人员利用压电光电子效应,通过电纺直写技术构建了ZnO@(Cu(NH3))(CN)多层纳米纤维堆叠结构,显著提升了器件性能,为低功耗、高响应的柔性光电探测系统提供了新思路。
      DU Xianruo, WANG Junxiang, CHEN Ruixin, CHEN Huatan, ZHENG Gaofeng
      Vol. 34, Issue 1, Pages: 72-85(2026) DOI: 10.37188/OPE.20263401.0072
      摘要:Flexible photodetectors show considerable potential for wearable electronics, curved imaging, and intelligent sensing; however, their performance is frequently constrained by weak interfacial coupling between micro/nanoscale photoactive structures and flexible substrates, leading to high dark current and elevated power consumption. Here, a performance-enhancement strategy exploiting the piezophototronic effect is proposed. A dual-component self-assembled ZnO@(Cu(NH3))(CN) multilayer nanofiber architecture was fabricated via electrohydrodynamic direct writing, in which ZnO functions as the primary photoactive fiber and (Cu(NH3))(CN) serves as an auxiliary modulation layer. This stacked configuration markedly strengthens interfacial coupling stability and, through piezophototronic modulation at the stacking interfaces, introduces asymmetric barriers and built-in electric fields that effectively suppress thermally excited carrier transport. As a result, the dark current is reduced to 1.12×10-7 A, substantially decreasing static power consumption. By adjusting the number of stacked layers (5-25), the threshold voltage can be tuned from 6 to 20 V, enabling programmable logic control. Under 254 nm ultraviolet illumination, a responsivity of 13.3 A/W is achieved, with response and recovery times of 11 ms and 9 ms, respectively, demonstrating excellent photoelectric detection performance. These results indicate that the orthogonally stacked ZnO and (Cu(NH3))(CN) nanofiber architecture enables superior local electric-field regulation and enhanced photoelectric conversion efficiency, offering strong promise for low-power, high-response flexible photodetectors.  
      关键词:Electrohydrodynamic direct-writing;flexible photodetector;piezophototronic enhancement effect;Light controlled circuit   
      57
      |
      51
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 136750261 false
      更新时间:2026-01-21
    • 在手机保护壳领域,研究人员采用LIBT技术制备出高效抗菌手机保护壳,突破了传统技术局限,为抗菌表面设计与制造提供新参考。
      WENG Zhankun, LUO Yinghui, HU Junting, XIA Liqun, ZHAO Zhen, LI Tao, WANG Guanqun, LIU Tong, QIAO Jian
      Vol. 34, Issue 1, Pages: 86-97(2026) DOI: 10.37188/OPE.20263401.0086
      摘要:A commercially available transparent phone case was used as the substrate, and an Ag@Cu micro-/nano-composite antibacterial coating was fabricated on its surface via laser-induced backward transfer (LIBT). The results indicated a positive correlation between laser power and the elemental content of the microstructured coating, with the mass ratios of Ag and Cu increasing as laser power increased. This approach enables efficient fabrication under ambient atmospheric conditions with a simple and cost-effective process. The resulting microstructured coating exhibited pronounced antibacterial activity against both Gram-positive Staphylococcus aureus and Gram-negative Escherichia coli. At laser powers of ≥16 W, the antibacterial rate exceeded 99.99%, and the increased copper mass ratio further enhanced antibacterial efficacy. These findings address key limitations of conventional technologies by enabling high-precision construction of micro-/nano-structures on transparent substrates. Moreover, the processing parameters are precisely controllable, supporting scalable manufacturing and offering a general strategy for functionalizing the surfaces of other transparent organic materials. Overall, a practical route for producing antibacterial phone cases is established, providing methodological guidance for the design and fabrication of advanced antibacterial surfaces and addressing bacterial contamination and insufficient antibacterial performance in mobile phone casings.  
      关键词:Laser-induced backward transfer;Ag@Cu micro-nano structure;antibacterial performance;mobile protective case   
      11
      |
      4
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 146561816 false
      更新时间:2026-01-21
    • 在飞行汽车领域,专家提出了一种涵道螺旋桨设计方法,通过叶素动量理论结合CFD修正,提升了悬停状态下的拉力和效率,为飞行汽车推进系统设计提供技术支撑。
      ZHANG Zengkui, LI Jingyang, YIN Mingwei, HAO Wenqian, XIE Jiamiao, WANG Zhe, GAO Shuyu
      Vol. 34, Issue 1, Pages: 98-111(2026) DOI: 10.37188/OPE.20263401.0098
      摘要:To design a ducted propeller that satisfies the requirement for high thrust efficiency during hover in a ducted flying car, a dedicated design methodology is proposed. Blade geometry is developed using blade element momentum theory in conjunction with ducted propeller vortex theory, and the resulting designs are refined through CFD-based corrections to meet target performance. Under hover conditions, the aerodynamic performance of the ducted propeller is evaluated and compared with that of an isolated propeller. The results indicate that, due to suppression of blade-tip vortices and the additional lift generated by the duct, total thrust is increased by 11.1% while power consumption is reduced by 10.5% relative to the isolated propeller. To further enhance hover performance, an innovative variable-duct propeller configuration is proposed, in which the flare expansion angle is dynamically adjusted to maximize inlet airflow during hover, yielding a 15.53% increase in total thrust. The study validates a high-accuracy design framework that integrates blade element momentum theory with CFD correction, providing theoretical and technical support for efficient aerodynamic design of propulsion systems for flying vehicles.  
      关键词:flying car;ducted propeller;blade element momentum theory;numerical simulation;aerodynamic characteristic;variable duct design   
      11
      |
      3
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 146561814 false
      更新时间:2026-01-21
    • 在精密驱动领域,跨尺度纵切模态压电电机通过双模式工作,实现了高速与高精度的兼容,为跨尺度精密定位驱动提供了有效技术方案。
      ZENG Sizhe, FENG Yaohui, WANG Haoran, WANG Ya, LI Ruijun, PAN Qiaosheng
      Vol. 34, Issue 1, Pages: 112-123(2026) DOI: 10.37188/OPE.20263401.0112
      摘要:To overcome the limitation of existing piezoelectric motors in simultaneously achieving high-speed motion and high-resolution positioning, a cross-scale longitudinal-shearing hybrid vibration piezoelectric motor was designed. Dual-mode operation in quasi-static and resonant states was implemented to satisfy the combined requirements of speed and precision in precision-drive applications. The stator integrates a longitudinal vibration transducer driven by the piezoelectric d33 effect and a sandwiched shear transducer driven by the d15 effect. Elliptical motion at the driving foot is generated by orthogonal vibrations at identical frequencies. Key parameters were optimized via finite-element simulation (COMSOL). For quasi-static operation, the primary hinge stiffness was set to 16.66 N/μm to improve displacement output efficiency; for resonant operation, structural parameters were tuned to achieve modal degeneration of the longitudinal (7 829.6 Hz) and bending (7 860.7 Hz) vibrations. The experiments showed that under quasi-static excitation (800 Hz, preload 0.38 N), a minimum displacement resolution of 39 nm and a maximum speed of 0.95 mm/s were achieved. Under resonant excitation (6 780 Hz, 30° phase difference), a maximum no-load speed of 125.33 mm/s and a peak load capacity of 0.45 N were obtained. The proposed motor outperforms conventional single-mode piezoelectric motors in both speed (125.33 mm/s) and resolution (39 nm), effectively mitigating the intrinsic trade-off between high speed and high precision and providing a practical solution for cross-scale precision positioning drives.  
      关键词:piezoelectric motor;longitudinal-shearing mode;cross-scale driving;piezoelectric ceramics;quasi-static   
      13
      |
      3
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 146561608 false
      更新时间:2026-01-21

      Information Sciences

    • 在无源定位领域,研究者提出了一种基于图像匹配的新方法,通过特征增强和聚类筛选策略,显著提升了定位精度,为该技术的发展提供了新方向。
      JIA Ping, LI Changhao, SUN Hui, SONG Yueming, MA Zhuoluo, XU Fang
      Vol. 34, Issue 1, Pages: 124-138(2026) DOI: 10.37188/OPE.20263401.0124
      摘要:The workflow begins with approximate orthorectification of aerial images using coarse positioning results and flight parameters, followed by extraction of the corresponding satellite image regions. RepVGG is employed to extract coarse image features, and initial correspondences are established via nearest-neighbor matching. Candidate pairs are then filtered using MiHo clustering and normalized cross-correlation (NCC). The refined correspondences are further enhanced by a Transformer module to achieve precise alignment. Based on the resulting high-precision matches, an angular error correction matrix is constructed, and iterative refinement is applied to effectively compensate for systematic errors. Experimental results indicate that the proposed method substantially improves positioning accuracy relative to conventional approaches, achieving an approximately 70% enhancement in representative scenarios; even at a slant range of 90 km, the positioning error remains approximately 120 m. By leveraging image matching under challenging viewing geometries, the proposed passive localization method provides a novel solution to limitations of conventional algorithms. A two-step matching strategy integrating Transformer-based feature enhancement with MiHo clustering-based filtering is introduced, effectively mitigating the accuracy degradation of traditional passive localization under high-altitude, large-oblique viewing conditions in the presence of small angular errors.  
      关键词:image matching;target location;airborne photoelectric system;passive localization;large strabismus;Transformer.   
      10
      |
      3
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 146561891 false
      更新时间:2026-01-21
    • Multi-core compression imaging system based on JPEG-LS AI导读

      在航天CMOS相机领域,提出了基于FPGA的JPEG-LS图像压缩方案,实现了高分辨率成像系统的实时压缩,满足了遥感图像压缩速率和质量需求。
      LIU He, ZHANG Liu, LI Guiyang, SHEN Sangyuan, ZHANG Jiabao
      Vol. 34, Issue 1, Pages: 139-149(2026) DOI: 10.37188/OPE.20263401.0139
      摘要:To address the constraints of hardware cost and onboard data storage in high-resolution aerospace CMOS cameras for small satellites, a full-pipeline FPGA-based implementation of the JPEG-LS image compression algorithm is proposed. A multicore real-time compression imaging system is integrated on a single FPGA using a parallel grouping architecture. First, multichannel high-speed image streams from CMOS detectors are received by the FPGA. Second, the JPEG-LS algorithm is realized using an eleven-stage pipeline, and the encoding-parameter calculation and context-update modules are structurally optimized to reduce the critical path. Finally, multiple JPEG-LS compression kernels are employed to group and compress the multichannel data in parallel. Experimental results indicate that, in the implemented system, the optimized JPEG-LS core achieves a maximum operating frequency of 46 MHz. With the compression parameter near set to 1, a near-lossless compression ratio exceeding 4 is obtained, and the peak signal-to-noise ratio (PSNR) of the decompressed images is approximately 50 dB. These results satisfy the compression-rate and image-quality requirements for remote-sensing imagery, providing a practical reference for designing high-resolution spaceborne CMOS cameras with integrated image compression.  
      关键词:image compression;CMOS detector;Field Programmable Gate Array(FPGA);Pipeline;multi-core compression imaging   
      51
      |
      57
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 136748713 false
      更新时间:2026-01-21
    • 科技媒体新闻记者报道,最新研究提出了一种时空对比学习驱动的弱监督图像语义分割网络,通过挖掘时间、空间监督信息,提高分割精度。实验结果表明,在PASCAL VOC和MS COCO数据集上的平均交并比分别达到72.7%以及43.6%,证明了所提出方法的优越性。
      LIANG Zhen, HU Yanzhu, YANG Yang
      Vol. 34, Issue 1, Pages: 150-166(2026) DOI: 10.37188/OPE.20263401.0150
      摘要:Existing image-level weakly supervised semantic segmentation methods based on Vision Transformer (ViT) primarily rely on self-attention to extract limited semantic information and often fail to fully exploit multi-dimensional feature relationships, resulting in coarse target region identification. To address this limitation, a Spatio-temporal Contrastive Learning network (STCL) is proposed to improve segmentation accuracy by mining supervisory signals from both spatial and temporal perspectives. Specifically, a spatial feature contrastive learning module is introduced based on ViT token representations, integrating patch-level and class-level token contrastive strategies to capture implicit semantic relationships in image space. In addition, a temporal context contrastive learning module is developed, in which a memory bank is leveraged to incorporate prior knowledge from historical images to guide current segmentation, together with a memory bank update strategy and an adaptive memory contrastive loss to enhance discrimination of fine-grained regions. STCL achieves a mean Intersection over Union (mIoU) of 72.7% on PASCAL VOC and 43.6% on MS COCO, demonstrating superior performance.  
      关键词:computer vision;semantic segmentation;weakly supervised learning;class activation map;vision transformer;contrastive Learning   
      13
      |
      4
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 146561812 false
      更新时间:2026-01-21
    • Real-time image semantic segmentation based on three-branch network AI导读

      在自动驾驶环境感知领域,研究人员提出了一种基于三分支网络的实时图像语义分割算法,有效平衡了模型精度与推理速度,显著提升了语义分割性能。
      REN Fenglei, GAO Ziyang, ZHANG Yan, ZHOU Haibo, YANG Lu, QIN Zhichang
      Vol. 34, Issue 1, Pages: 167-177(2026) DOI: 10.37188/OPE.20263401.0167
      摘要:To meet the stringent requirements for both accuracy and real-time performance in applications such as autonomous driving, a real-time image semantic segmentation algorithm based on a triple-branch network is proposed to achieve a favorable balance between segmentation accuracy and inference speed. Inspired by PIDNet, a triple-branch architecture is designed to extract fine-grained detail information, semantic contextual information, and edge cues from the input image, respectively. An efficient pyramid pooling module is integrated into the semantic context branch to capture multi-scale contextual information and enlarge the network receptive field. In addition, a lightweight multi-scale channel interaction attention mechanism is introduced into both the detail and edge branches to enhance feature representations. Features from the three branches are subsequently fused, and a semantic segmentation head is employed to produce the final result. The proposed network achieves 79.2% mIoU at 88.5 frame/s on the Cityscapes dataset and 80.5% mIoU at 140.1 frame/s on the CamVid dataset. Experimental results demonstrate that the proposed method performs semantic segmentation efficiently, providing an effective trade-off between real-time performance and accuracy while significantly outperforming existing baseline methods.  
      关键词:semantic segmentation;deep learning;real time;attention mechanism;multi-scale features   
      17
      |
      4
      |
      0
      <HTML>
      <L-PDF><WORD><Meta-XML>
      <引用本文> <批量引用> 146561864 false
      更新时间:2026-01-21
    0