摘要:To address the challenges of low robustness and accuracy in structured light 3D measurement systems caused by environmental interference and the geometric characteristics of measured objects, this study focuses on improving the system's projection encoding method, projection efficiency, and related algorithms. First, to enhance system robustness, a novel approach combining complementary Gray code and phase-shifting is proposed. Second, to reduce the extended projection time resulting from multiple patterns, a synthetic pattern parallel loading method is introduced, significantly accelerating projection speed. Third, to mitigate issues such as light occlusion and shadows caused by object geometry, a dual-projector single-camera system is designed, expanding the measurement field of view and effectively eliminating shadows. Finally, a comprehensive performance comparison with existing methods, including complementary Gray code with phase-shifting and the multi-frequency heterodyne method, is conducted to validate the proposed approach. Experimental results demonstrate that the proposed method substantially improves system robustness and overall performance. Specifically, the synthetic pattern parallel loading method enhances projection efficiency by 55.51% compared to the baseline approach, achieving a measurement standard deviation of 0.014 mm. The proposed method significantly enhances system robustness and accuracy, surpassing traditional encoding techniques.
摘要:The six-degree-of-freedom measurement system based on laser tracking plays a critical role in the production, manufacturing, and assembly of large-scale equipment, where accurate pose measurement is essential for ensuring precise determination of position and orientation. To address the challenges of calibration in laser tracking pose measurement systems operating in long-distance and wide-field scenarios, this study presents a novel calibration method leveraging multi-position targets. The methodology focuses on laser tracking systems utilizing cameras, starting with a detailed introduction to the system's architecture and pose measurement principles. Subsequently, the pose-solving algorithm is analyzed to identify key factors influencing calibration accuracy. Based on this analysis, a parameter calibration method is developed using spatial multi-pose targets, employing spatial geometric constraints and the least squares principle for optimal parameter estimation. A laser tracking experimental platform, constructed with a total station and a movable calibration board designed to create a stereotactic target, is used to evaluate the proposed method. Experimental results demonstrate that, within a measurement range of 3-15 m, the proposed calibration method enhances pose measurement accuracy by over 25.4% compared to conventional techniques. This method offers significant potential for applications requiring high-precision pose measurements in long-distance and wide-field environments.
关键词:Laser tracking;attitude measurement;long-range;large field of view;calibration
摘要:To address the limitations of current direct optical film thickness monitoring systems, including the dispersion of halogen lamp light sources, weak detector signal amplitudes, and poor monitoring accuracy in the short-wave band, as well as the spectral non-rectangularity-induced errors in the fabrication of narrow-band filter films (bandwidth <5 nm), this study proposes an innovative optical system design. By analyzing the intensity distribution of the HLX64623 halogen lamp and the principles of optical fiber coupling for signal reception, an initial structure for a collimated focusing coupled optical system was developed based on the structural parameters of the coating machine. The design incorporates an aspheric U-type reflector and free-form surface lenses to achieve collimation of the dispersive light source, focusing of optical signals, and efficient optical fiber coupling. Using Zemax software, the system was iteratively optimized with the irradiance and spot size on the substrate and the optical fiber receiving end as key objectives. After optimization, the irradiance intensity on the substrate increased by 147%, while the irradiance intensity at the optical fiber receiving end improved from 0 to 1.53%, compared to the independent light source configuration. When installed on a coating machine, the collimated focusing coupled optical system demonstrated a 362.3% improvement in signal intensity and a 91.9% enhancement in the signal-to-noise ratio compared to the independent light source. Experimental validation involved the fabrication of narrow-band filter films with a central wavelength of 365 nm and a bandwidth of 10 nm. Continuous deposition across three coating cycles achieved a central wavelength shift of less than 1 nm and a full width at half maximum of 10 nm. These results confirm that the proposed system enables high-precision monitoring of narrow-band filter films across wavelengths ranging from the near-ultraviolet to the visible and near-infrared regions.
关键词:optical film;film thickness monitoring;optical path design;fiber optic coupling;optical film thickness
摘要:Addressing the challenge of scale discrepancies between point clouds in dual-projected structured light systems, which hinder their direct fusion, this study presents a novel multi-scale point cloud fusion method. First, the dual-projected structured light system is calibrated to determine the intrinsic and extrinsic parameters of the cameras and projectors, along with the relative external parameters of the two systems. The weighted least squares phase expansion method is applied to extract the absolute phase, and point cloud data are computed using the calibrated parameters. Next, the spatial distances between any two points within the overlapping regions of the two systems are calculated to determine the relative scale factors, achieving scale unification between the point clouds. Coarse alignment is then performed using principal component analysis, followed by the derivation of the global optimal transformation matrix using an iterative nearest neighbor algorithm. The source point cloud is subsequently transformed to align with the target point cloud based on the calculated transformation matrix. Finally, the Euclidean distance is employed to evaluate the fused neighboring points, and outlier points are removed using a predefined threshold, ensuring accurate multi-scale point cloud fusion. Experimental results demonstrate that the standard deviation of the plane fitting error decreases by approximately 19.56% after fusion, validating the method’s effectiveness in fusing point clouds and reconstructing object surfaces at varying scales. In conclusion, this research provides a robust solution for addressing scale-induced errors in dual-projected structured light systems and successfully mitigates missing data caused by occlusion.
摘要:To generate a broader spectrum, higher power, and a more streamlined white light supercontinuum output, this research investigates the application of self Q-switching technology in producing high-power white light supercontinuum through stimulated Brillouin scattering (SBS) and Rayleigh scattering (RS) mechanisms in optical fibers. The experimental setup employed a semi-open cavity structure comprising ytterbium-doped fiber, photonic crystal fiber, and high-reflectivity grating. Without external pulse modulation, the study successfully achieved a high-power white light supercontinuum spectrum spanning 410-1 700 nm, with an output power of 24 W and a repetition frequency of 188.7 kHz at a pump power of 97.4 W. The light-to-light conversion efficiency reached 24.6%. These findings demonstrate that the white light supercontinuum fiber laser based on the SBS self Q-switching mechanism represents a novel light source with significant advantages, including a simplified structure, extensive spectral range, and superior spectral uniformity. The research provides an innovative design approach for developing high-power white light supercontinuum spectral light sources.
摘要:When a line laser scans metal workpieces, the high reflectivity of certain metal surfaces often leads to poor scanning results. While grid laser scanning is faster than single-line and multi-line scanning, challenges such as center extraction and optical plane positioning hinder its ability to meet the demands of efficient real-time detection of metal workpieces. This study proposes a novel method to address these issues by calibrating the rotation angle of a polarizer to optimize the suppression of high surface reflectivity through the polarization state of the grid laser. In addition, a laser center mapping technique based on normal vectors is developed, where the laser center is mapped to the optical plane using the eigenvector corresponding to the maximum eigenvalue of the pixel Hessian matrix. A center extraction method based on affine transformation is also introduced, enabling the transformation of non-orthogonal grids into orthogonal ones to improve the accuracy of 3D reconstruction. Furthermore, an error correction model is established to reduce measurement system errors by accounting for variations in height. Experimental results demonstrate that the traditional grid laser algorithm struggles with missing edges when scanning highly reflective metal surfaces, resulting in a measurement error of 0.8 mm. In contrast, the proposed method effectively suppresses the impact of high reflectivity, achieves accurate mapping between the laser center and the optical plane, and reduces the measurement error to 0.39 mm. After error correction, the accuracy is further improved, achieving a measurement error of just 0.1 mm, representing an 87.5% reduction compared to the traditional grid laser algorithm.In conclusion, this study investigates 3D reconstruction using grid laser scanning under high-reflection suppression for metal surfaces, significantly improving both scanning efficiency and measurement accuracy. The proposed method provides a valuable approach and reference for future industrial measurement applications.
摘要:This study explores the application of precision glass molding (PGM) technology to enhance the machining accuracy and efficiency of aspherical microlens arrays (AMLA) for fiber array collimation. The research focuses on the control of mold machining accuracy and the implementation of a compensation approach to address form errors in AMLA fabrication. A localized spiral diamond milling method is introduced for machining nickel-phosphorus mold materials, tailored to the structural characteristics and stringent precision requirements of AMLA. Furthermore, a form error compensation method is developed, considering the high-temperature properties of mold and glass materials as well as process parameters affecting accuracy. By optimizing the mold profile through compensation design, the precision of PGM is significantly improved, achieving the high-accuracy fabrication of a 10×10 AMLA. The proposed process controls the form error peak-to-valley (PV) within 220-380 nm and achieves a surface roughness (Ra) of 7-10 nm, thereby enhancing optical performance. This work provides a viable and efficient technical framework for the high-precision, large-scale production of AMLA.
摘要:To enhance the reusability of nickel slag, a novel approach is proposed to modify nickel slag and utilize it in the precision polishing of magnetic compound fluids (MCF), leveraging the magnetic elements inherent in nickel slag. This study investigates the relationship between the modified slag content and the polishing performance of MCF. The methodology involves several steps: first, nickel slag is processed through melting, oxidation, crushing, and magnetic separation to produce modified slag, whose composition and magnetic properties are subsequently analyzed. Next, the influence of MCF containing varying mass fractions of modified slag on surface roughness (Ra) and material removal rate is evaluated by examining the surface morphology of polymethyl methacrylate after polishing. The effect of modified slag on MCF formation is further explored by observing the morphology of MCF magnetic clusters before and after polishing under an external magnetic field. The intrinsic relationship between MCF morphology and polishing forces is analyzed by measuring the polishing forces of different slurries. Finally, the polishing mechanism of MCF containing modified slag is constructed by integrating observations of MCF polishing cluster microstructures, morphology, and polishing force characteristics. The experimental results indicate that the saturation magnetization of the modified slag is 5.64 times higher than that of unmodified nickel slag. As the mass fraction of modified slag increases, the polishing performance of MCF decreases. When the modified slag content reaches 10%, the surface roughness of the workpiece is significantly reduced from 0.502 μm to 0.010 μm within 10 minutes, corresponding to a surface roughness reduction rate of 97.966%. This reduction rate is only 0.482% lower than that achieved with MCF without modified slag but is 3.603% higher than the rate obtained with 15% modified slag content. In addition, the material removal rate reaches 1.237×10⁸ μm³/min. As the modified slag content increases, the curvature of the chain-like structures formed by magnetic particles during polishing increases, leading to a decrease in polishing force and, consequently, polishing performance. The proposed polishing mechanism reveals that modified slag occupies the middle and rear segments of the magnetic cluster and exhibits strong shear resistance. These findings demonstrate that modified nickel slag is suitable for MCF polishing applications, with optimal polishing performance achieved when the modified slag content in the MCF remains below 10%.
摘要:This study explores the clamping-driving coordination mechanism and control strategies for a flexible clamp-type inchworm actuator, introducing an innovative high-speed clamping switching method based on real-time clamping state information. By utilizing the parasitic motion characteristics of the flexible clamping mechanism, the proposed strategy minimizes clamping alternation time, thereby achieving higher driving speeds under identical design parameters. A static model of the parasitic motion of the clamping mechanism was first established based on the actuator's configuration and verified through simulations. The real-time clamping switching mechanism was subsequently analyzed, leading to the development of a driving strategy incorporating real-time clamping state feedback. An experimental setup was constructed to validate the theoretical model and simulation results. Experimental findings demonstrate that the proposed strategy reduces single-step switching time by 28% and increases continuous multi-step driving speed by 25% under the same design and driving voltage conditions. These results confirm the accuracy of the theoretical and simulation analyses within an acceptable margin of error. The proposed strategy significantly reduces the single-cycle operating time of the inchworm actuator, enabling continuous and stable actuation while substantially enhancing driving performance.
关键词:Flexible mechanism;Inchworm actuator;Parasitic movement;Collaborative switching control
摘要:In industrial vision systems, subjective assessment is costly, pre-training for no-reference quality evaluation is time-intensive, and there is a critical need for a highly accurate full-reference image quality assessment model. To address these challenges, this study proposes a novel full-reference image quality assessment model based on singular value decomposition (SVD) with weighted texture information. First, SVD is applied to the reference image blocks, and the singular values of the distorted blocks are estimated using the singular vectors of both the reference and distorted image blocks, yielding the brightness similarity component. Next, the estimated singular values of the distorted image blocks are used to quantify average offset distortion and contrast change distortion, resulting in the contrast similarity component. The structural similarity of the images is then determined by analyzing the deviation of the singular vectors of the distorted image blocks from the unit matrix of the reference image blocks. Finally, the brightness, contrast, and structural similarity components are weighted using texture information to construct the full-reference image quality assessment model. The proposed method was evaluated on six widely used image quality assessment databases across four performance criteria. Experimental results demonstrate that the model achieves a weighted Spearman rank correlation coefficient of 0.896 3 across the datasets. For contrast change distortion, the model attains a Spearman rank correlation coefficient of 0.859 5, outperforming the second-best method by 85%. Compared to existing full-reference image quality assessment models, the proposed approach offers significant advantages in prediction accuracy, generalization capability, and computational efficiency.
关键词:image quality assessment;full reference;singular value decomposition;texture information;image contrast
摘要:To address the challenges of high computational complexity, limited detail extraction, and fuzzy boundaries in the current DeepLabv3+ semantic segmentation network, this study proposes an enhanced DeepLabv3+ model incorporating attention mechanisms. Specifically, the lightweight MobileNetV2 is employed as the backbone to balance high representational capacity with a significant reduction in model parameters. A parameter-free lightweight attention mechanism (SimAM) is integrated into the low-level features of the backbone network to prioritize key features and enhance feature extraction capabilities. Furthermore, the global average pooling in the ASPP module is replaced with Haar Wavelet Transform Downsampling (HWD) to preserve spatial information. An External Attention Mechanism (EANet) is also introduced after the ASPP module to leverage contextual information and achieve multi-scale feature fusion, thereby improving semantic understanding and segmentation accuracy. Experimental results demonstrate that the proposed model achieves a 2.82% improvement in mean Intersection over Union (mIoU) on the VOC2012 dataset compared to the original DeepLabv3+ model. This research enhances the precision of semantic segmentation and offers novel insights for advancing applications in computer vision.
摘要:The accurate extraction of text content from images is hindered by the absence of scale transformation in feature representation and insufficient resolution, which misguides the reconstruction network. To address this challenge, this paper proposes a novel multi-modal semantic interactive text image super-resolution reconstruction method. By incorporating an attention mask within the semantic inference module, the method corrects text content information and employs semantic prior knowledge to constrain and guide the reconstruction of semantically accurate super-resolution text images. To enhance the network's representational capacity and accommodate text images of varying shapes and lengths, a multimodal semantic interaction block is introduced. This block consists of three key components: a visual dual-flow integration module, a cross-modal adaptive fusion module, and an orthogonal bidirectional gated recurrent unit. First, the visual dual-flow integration module captures multi-granularity visual information, including contextual understanding, by leveraging the complementary strengths of global statistical features and robust local approximations. Next, the cross-modal adaptive fusion module dynamically facilitates interaction and alignment between semantic information and multi-granularity visual features, effectively reducing cross-modal feature discrepancies. Finally, the orthogonal bidirectional gated recurrent unit establishes multimodal feature dependencies in both vertical and horizontal orientations. Experimental results on the TextZoom test set demonstrate that the proposed method outperforms state-of-the-art approaches in terms of quantitative metrics, achieving significant improvements in PSNR and SSIM. Compared to the TPGSR model, the proposed method increases the average recognition accuracy of ASTER, MORAN, and CRNN by 2.9%, 3.6%, and 3.7%, respectively. These findings highlight the effectiveness of multimodal semantic interaction in enhancing text image super-resolution and improving text recognition accuracy.
摘要:To address the challenges of the absence of baseline ground truth and the underutilization of visible light information in infrared and visible light image fusion using denoising diffusion models, this study introduces a novel conditional diffusion and multi-channel high-low frequency parallel infrared and visible light image fusion model. First, a conditional diffusion model is developed, employing a splicing technique to generate spliced source images as ground truth during training, thereby facilitating an optimal prior distribution for feature extraction in infrared and visible images. During the reverse denoising process, a multi-channel likelihood correction module is incorporated to effectively model the intricate multi-channel distribution of these images. Subsequently, a detail-adaptive denoising network is proposed to perform multi-channel high- and low-frequency feature extraction for infrared and visible light images. The model also integrates a multi-channel high- and low-frequency parallel fusion module within the fusion network, which utilizes a regional consistency fusion network and a multi-channel low-frequency feature fusion network to merge high- and low-frequency features, respectively. This approach introduces a trainable diffusion-based paradigm for feature extraction in infrared and visible light image fusion tasks, leveraging specialized convolutional neural networks for feature integration. Comparative experiments on the MSRS and RoadScene datasets, against nine state-of-the-art methods, reveal that the proposed model improves the average performance across eight objective evaluation metrics by 4.52% to 59.62%. The method demonstrates superior performance in maintaining color fidelity and preserving texture details, aligning well with human visual perception, and proves robust in handling diverse lighting and environmental conditions for infrared and visible light image fusion tasks.
关键词:image fusion;infrared and visible;Conditional Diffusion Model;Detail Adaptive Denoising Block;Multi-Channel High-Low Frequency Parallel Fusion Block