A Novel Robotic-Vision-Based Defect Inspection System for Bracket Weldments in a Cloud–Edge Coordination Environment

Li, Hao; Wang, Xiaocong; Liu, Yan; Liu, Gen; Zhai, Zhongshang; Yan, Xinyu; Wang, Haoqi; Zhang, Yuyan

doi:10.3390/su151410783

Open AccessArticle

A Novel Robotic-Vision-Based Defect Inspection System for Bracket Weldments in a Cloud–Edge Coordination Environment

by

Hao Li

¹

,

Xiaocong Wang

¹,

Yan Liu

^2,*

,

Gen Liu

¹,

Zhongshang Zhai

³,

Xinyu Yan

¹,

Haoqi Wang

¹ and

Yuyan Zhang

¹

Henan Provincial Key Laboratory of Intelligent Manufacturing of Mechanical Equipment, Zhengzhou University of Light Industry, Zhengzhou 450002, China

²

School of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China

³

Tianjin Miracle Intelligent Equipment Co., Ltd., Tianjin 300131, China

^*

Author to whom correspondence should be addressed.

Sustainability 2023, 15(14), 10783; https://doi.org/10.3390/su151410783

Submission received: 19 May 2023 / Revised: 1 July 2023 / Accepted: 5 July 2023 / Published: 10 July 2023

(This article belongs to the Special Issue Digital Technology in Sustainable Manufacturing Systems)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Arc-welding robots are widely used in the production of automotive bracket parts. The large amounts of fumes and toxic gases generated during arc welding can affect the inspection results, as well as causing health problems, and the product needs to be sent to an additional checkpoint for manual inspection. In this work, the framework of a robotic-vision-based defect inspection system was proposed and developed in a cloud–edge computing environment, which can drastically reduce the manual labor required for visual inspection, minimizing the risks associated with human error and accidents. Firstly, a passive vision sensor was installed on the end joint of the arc-welding robot, the imaging module was designed to capture bracket weldments images after the arc-welding process, and datasets with qualified images were created in the production line for deep-learning-based research on steel surface defects. To enhance the detection precision, a redesigned lightweight inspection network was then employed, while a fast computation speed was ensured through the utilization of a cloud–edge-computing computational framework. Finally, virtual simulation and Internet of Things technologies were adopted to develop the inspection and control software in order to monitor the whole process remotely. The experimental results demonstrate that the proposed approach can realize the faster identification of quality issues, achieving higher steel production efficiency and economic profits.

Keywords:

steel surface defects inspection; robotic-vision-based system; deep learning; cloud–edge computing

1. Introduction

Arc welding is an extremely important technology primarily embraced by the automobile manufacturing industry [1,2,3]. The adoption of arc-welding robots in the production process has yielded many benefits, including enhanced productivity, cost savings, and improved efficiency [4,5]. The arc-welding robot usually needs to be supplied with a stable voltage and current in the process of joining parts, but this is not easy to achieve in practice. Consequently, steel surface defects cannot be avoided, and their quality inspection is particularly important. The post-manufacturing inspection of parts is usually performed at checkpoints due to the open-loop production process. Often, experienced workers examine steel surface defects with their eyes, leading to judgment subjectivity.

With the increasing demand for automatic inspection, the vision-based inspection system has opened a window of opportunity for automation-assisted detection. The vision-based inspection system has created opportunities for automation-assisted detection due to the increasing demand for automatic inspection [6,7,8,9]. As the computing power of hardware devices develops, the vision-based inspection system using detection algorithms with high accuracy is becoming a popular topic. Early machine vision algorithms focused on morphological characteristics for identifying key issues. Because the geometrical features of various defects are selected manually, some important feature information may be ignored in the process of feature extraction. Currently, convolutional neural networks (CNNs) [10], a type of promising deep learning model that relies on the network itself for automatic feature extraction, have been successfully applied to challenging visual tasks like image classification [11,12,13,14], object detection [15,16,17,18], image segmentation [19,20,21], etc. In pursuit of a better detection performance, the network has been improved according to various real-world scenarios and has become increasingly complex, meaning that higher device computing power is required. If this cannot be provided in actual manufacturing industries, the speed of defect detection will be affected, reducing production efficiency.

Edge computing, with the advantage of being closer to the data source, is proposed and usually used on-site to provide a faster response speed. Researchers have demonstrated that combining deep-learning-based approaches with edge computing can help to develop diverse application systems [22,23]. Edge computing can also be used in combination with cloud computing that provides on-demand services for intensive calculation [24,25,26,27,28]. In order to complete the inspection task in a hash industrial production environment, a more sophisticated system can be created by integrating advanced information technologies such as the Internet of Things (IoT), virtual simulation and cloud–edge computing. In this work, the framework of a robotic-vision-based defects inspection system is proposed, making use of advanced information technologies in a cloud–edge computing environment to attain a fast speed for computing performance. In the stage of steel surface defect inspection, the improved YOLOv5 object detection algorithm is applied to inspect multi-scale defects, in which a YOLO-SPD-Conv module, consisting of the space-to-depth layer and non-striped convolution layer, is added to further retain discriminant information on small defects.

The following structure is applied in this paper: Section 2 presents the related work. The framework of the robotic-vision-based defect inspection system for bracket weldments is described in Section 3. The details of the surface defect detection algorithm for bracket weldments are explained in Section 4. The detection performance verified on the edge through experiments, the cloud–edge coordination inspection and the control software developed are described in Section 5. Finally, Section 6 discusses the conclusion and future work.

2. Related Works

As problems have emerged about manual intervention during the manufacturing process, the introduction of automation systems has assisted workers considerably in production lines. Lee et al. [29] developed a laser-vision-based inspection system for welding defects on automobiles’ shock absorbers. Because of noise in industrial environments, however, it is difficult to apply the laser vision sensor in production lines. Therefore, the open-loop solution is applied to the system, in which the laser sensor is fixed and the shock absorber is moved. To solve these problems, Nguyen et al. [30] developed an improved laser-vision-based inspection system for small beads, in which a complementary metal oxide semiconductor (CMOS) camera is also applied to capture laser stripes. With this system, it has been proven that a higher detection accuracy can be ensured through the laser–camera mechanism. Chu et al. [31] developed an automated-vision-based system for non-destructive post-welding quality inspection. A new image-processing technique was proposed that can extract weld joint profiles and feature points and measure bead size and detect defects. The 3D surface of the weld can also be reconstructed to monitor weld quality online. Nevertheless, the above detection algorithms involved in these systems cannot meet changing production demands that require a variety of defect features to be considered during algorithm design.

Senthil Kumar et al. [32] proposed a system to classify various defects on butt joints. The specific algorithm process is divided into the following two steps: Firstly, the regions of interest in defect images are localized and segmented. Then, a back-propagation neural network is utilized as the classifier. However, this method heavily relies on the manual definitions of defect features; thus, it can be subjective and time-consuming. Weimer et al. [33] explored an alternative method that uses deep convolutional neural networks (CNNs). Unlike manual approaches, deep CNNs autonomously generate features by learning from large training datasets. The results of that study are promising, as the use of deep CNNs has been shown to significantly improve defect detection accuracy while keeping false alarm rates low. Sassi et al. [34] proposed a smart system for quality control assessment in industry using deep learning techniques. It successfully detects welding defects on fuel injector assemblies with an accuracy of 97.22% using approximately 7 million trained network parameters and a limited number of images. It demonstrates that deep neural networks can replace conventional human quality inspection efforts. Shin et al. [35] proposed a non-destructive testing method for detecting and predicting porosity defects in real time during the welding of zinc-coated steel. A deep neural network is utilized that can analyze the welding voltage signal without requiring any additional device in the proposed system. A high-strength steel sheet used in the automotive industry was employed to measure the process signals. Feature variables were extracted through preprocessing, and their correlation with weld porosity was analyzed. As detection algorithms become increasingly complex, with new data being continuously accumulated during operations, the computing power requirement of the system are becoming stricter. There is an urgent need for powerful computational frameworks.

To support inspection systems, computing environments are widely developed for customers’ different requirements [36,37,38]. Li et al. [39] developed an automatic pipeline inspection system that can overcome data accessibility and security limitations by deploying a cloud-based framework. The pipeline defect detection cloud system was proposed with role encryption for video collection and access security and a hybrid information method for defect detection. Huang et al. [40] introduced a unified framework for deep neural networks used for image defect detection and classification in edge computing. Transfer learning and data augmentation are included in the framework, resulting in a better accuracy with small samples. Implementing this approach in a rolling manufacturing system, it reduced computational time and satisfied real-time requirements. Chu et al. [41] also developed an inspection application in an edge-computing environment. A novel method that combines supervised and unsupervised learning was proposed to reduce the frequency of retraining and optimize the CNN-based detection model. On this basis, few annotated data points and iterative times are needed for the edge, which greatly improves the system’s efficiency. Vater et al. [26] combined edge and cloud computing to optimize the manufacturing of electric motors. A cloud–edge architecture was proposed for the efficient processing of vast amounts of data, which highlights its advantage over pure-cloud solutions. Wang et al. [42] also developed an inspection system in a cloud–edge computing environment in which the final model was applied to detect defects at the edge. The Faster Region-based CNN [38] model was selected and trained, considering the fact that two-stage methods can yield excellent results in the cloud.

CNN-based algorithms are not only suitable for image classification but can also be improved for object detection. In particular, desirable detection results can be gained using the two-stage method, in which a large number of candidate boxes are first obtained and then the CNN is used to evaluate whether they are in an image. However, most industrial sectors are production-oriented, with the pursuit of higher production efficiency. A faster detection speed may be considered in real-time applications. Additionally, open-loop production, where the image sensors are fixed and the objects to be detected are moved at checkpoints, may be less efficient. Therefore, more effort needs to be devoted to deploying a more efficient automation-assisted inspection system in the production line.

3. Framework of a Robotic-Vision-Based Defect Inspection System for Bracket Weldments

The proposed robotic-vision-based system framework has four primary elements, including smart terminals, an edge-computing center, cloud-computing center, and control center. The proposed system framework is presented in Figure 1.

Smart terminals: The smart terminals mainly include the hardware for image acquirement (the industrial camera and the light source) and the arc-welding robot. To achieve instant messaging, functional interfaces and multi-protocol technologies are applied for data interaction and transmission, such as MODBUS, WIFI, and 4G edge gateway. In this work, the imaging module is designed as follows: the CMOS industrial camera is chosen, characterized by its image independence from the connecting requirements, and the light-emitting diode (LED) is selected for illumination. To simplify the inspection process, the image acquirement hardware is installed on the end arm of the arc-welding robot. The data source transferred to the edge-computing center usually includes the images of the bracket weldments that need to be inspected, and the bracket weldments come from the arc-welding-robot-based production line.
Edge-computing center: The edge-computing center can provide the on-site services, such as historical data queries that need to be uploaded to the cloud-computing center, providing a fast response to operations. The edge devices are mainly composed of the Raspberry-Pi-powered board and the edge gateway. To realize a fast response in defect inspection, the Raspberry-Pi-powered board is equipped with the computational resource as the edge server and connected to the CMOS industrial camera. Firstly, image data are transmitted from the edge gateway to the edge-computing center through the transfer interface and protocol. Then, the images are pre-processed in the edge-computing center and are collected through the imaging module. Finally, the processed images are uploaded to the cloud-computing center for historical data storage. In this way, the edge-computing center can effectively save time and relieve stress on the cloud, and a fast response can be efficiently completed.
Cloud-computing center: Considering that images are captured from different shooting distances, different detection models may be needed. Therefore, the cloud server is responsible for storing the detection models, except for the processed images. In addition, sufficient computing resources can be provided by the cloud-computing center, which ensures efficiency to train and update the detection models. In this way, the detection models will be trained and updated in the cloud and then distributed to the edge for defect inspection. As a result, defect inspection results are obtained locally, in close proximity to the data source, within a short time. At the same time, the inspection results will be transmitted to the part of the control center used for further parameter adjustments. Additionally, information from the training process can be saved as images or files and sent to the control center.
Control center: The control center mainly provides the management service for the entire inspection process, including the control devices, control strategy and control software. After the devolved model has been implemented in the edge-computing center, the inspection results will be shown at the control software’s interface. The control strategy is formed based on the state data collected from the control devices (e.g., the robot controller), which examine welding parameters such as the voltage, current, and wire feed speed, determining whether they are within the recommended range. The above control process can be visualized by means of Unity 3D and programmatically adjusted according to the actual industrial situation using C#. In this way, the large number of fumes and toxic gases generated during arc welding can effectively be avoided.

4. Surface Defect Detection Approach Based on YOLOv5

In this part of the study, the one-stage method that involves evaluating a small number of predefined boxes in cross-scale feature maps was applied due to its fast detection speed. Meanwhile, the edge server could further improve the detection speed. In the proposed system, accuracy is another important evaluation criterion of system performance. In the actual industrial situation, some challenges may increase the difficulty of accurate identification and location. Evolved from you-only-look-once (YOLO)-based object detection algorithms [43,44], the YOLOv5 object detection algorithm [45], one of the most accurate object detection algorithms, was applied and trained on the created datasets. Moreover, the improved module is proposed to further improve the detection precision.

4.1. Data Pre-Processing

Considering that welding images are collected vertically, their rotation operation can make the final model more robust in terms of direction. Therefore, the rotation operation is introduced, and the function is defined as:

[\begin{matrix} x_{i}^{'} \\ y_{i}^{'} \end{matrix}] = [\begin{matrix} c o s θ \\ s i n θ \end{matrix} \begin{matrix} - s i n θ \\ c o s θ \end{matrix}] [\begin{matrix} x_{i} \\ y_{i} \end{matrix}]

(1)

where (

x_{i},

y_{i}

) represents the original input image pixel, (

x_{i}^{'}

,

y_{i}^{'}

) represents the output image pixel after being rotated, and θ represents the rotation angle.

In addition, one way to eliminate the effect in which the defect area’s bounding box deviates from its original position is to rotate the original bounding box. Therefore, the bounding box parameters of the image annotation files need to be adjusted according to the origin of the coordinates. The rotation function is defined as:

\{\begin{matrix} x_{m i n} = m i n (x_{1}^{'}, x_{2}^{'}, x_{3}^{'}, x_{4}^{'}) \\ y_{m i n} = m i n (y_{1}^{'}, y_{2}^{'}, y_{3}^{'}, y_{4}^{'}) \\ x_{m a x} = m a x (x_{1}^{'}, x_{2}^{'}, x_{3}^{'}, x_{4}^{'}) \\ y_{m a x} = \max (y_{1}^{'}, y_{2}^{'}, y_{3}^{'}, y_{4}^{'}) \end{matrix}

(2)

where the parameters of

x_{1}^{'}

and

y_{1}^{'}

represent transformed coordinates, and the subscript number represents the four points of the bounding box.

The rotation verification of one image is shown in Figure 2. After the rotation operation, the image is rotated, and the coordinates of the bounding box where the label is stored in XML files are adjusted so that the bounding box always falls on the target.

4.2. Surface Defect Detection

Evolved from YOLO-based deep learning methods, the version called YOLOv5 can offer highly accurate detection for objects. In this work, the lightweight YOLOv5s model was used to realize the detection of multi-scale and multi-type steel surface defects (Figure 3). Three primary parts are involved in the structure of the YOLOv5s model: the backbone, the neck and the head. Firstly, the backbone network is a core part of the model used to extract features of the input image based on the Convolutional Block Sequences (CBS) and the Cross-Stage Partial (CSP) structures. The CBS component is composed of Conv (convolution with stride two), BatchNorm (batchnormalization) and SiLU (a special Swish activation). The Resx represents multiple bottlenecks of the Resnet neural networks [46]. The CSP structure is similar to the bottleneck structure, but their specific operations on tensors are different. In particular, the shortcut pattern is concatenation in the former case but addition in the latter. Then, multiple-scale features are mainly fused from different feature pyramid levels in the neck network. Finally, the head is responsible for defect detection through the loss function (see Section 4.3 for details).

In practice, one type of defect, the end hole, may be quite small. Image scaling, one machine-vision-based method, is usually adopted to eliminate the effect of multi-scale defects, but size differences are inevitable when images are captured from fixed shooting distances. Another issue is where the defect is similar to its background, since the weakness of class separation occurs in steel bracket weldments. If the above issues are not well addressed, it will be more difficult to accurately locate and identify defects.

Inspired by Spatially Partitioned Depthwise Convolution (SPD-Conv) [47], the YOLO-SPD-Conv module is proposed to improve the detection precision, wherein the space-to-depth layer and non-striped convolution layer are added to CBS and CSP structures. The feature map is divided into smaller partitions, and each partition is processed separately. In this way, more detailed features can be extracted from the input data, leading to improved accuracy in the task of defect recognition. Therefore, seven YOLO-SPD-Conv modules are added to further retain as much discriminant information as possible (Figure 4).

4.3. Training Process

In this section, several training tricks are introduced, and the training parameters of our model are presented in Algorithm 1. The Warmup and Cosine learning rate strategy is adopted to accelerate training. The multi-scale training strategy is introduced to reduce the memory occupation and computational burden on the GPU. In this case, the training sizes are the random values of 0.5 × 3072~1.5 × 2048 pixels. With 32 times at most during down-sampling, all integers are multiples of 32.

In addition, the mean squared error is used to obtain a high IoU during the process of bounding box regression. The Adam algorithm is applied as an optimizer to achieve gradient normalization as well as loss minimization during backpropagation. The loss function is mainly composed of three parts, as shown in Equation (3). Binary cross entropy is adopted as the loss equation for classification (Lcls) and the object (Lobj) in Equation (4). CIoU loss is defined as shown in Equation (5) and used for the location (Lloc):

L o s s = λ_{1} L_{c l s} + λ_{2} L_{o b j} + λ_{3} L_{l o c}

(3)

where

λ_{1}, λ_{2}

and

λ_{3}

represent equilibrium coefficients.

L_{B C E} = - (y l o g (p (x) + (1 - y) l o g (1 - p (x))))

(4)

where

p (x)

represents the output of the model, and y is the real label.

L_{C I O U} = 1 - I O U + \frac{ρ^{2} (b, b^{g t})}{c^{2}} + α υ

(5)

where IoU is the intersection-over-union,

ρ^{2} (b, b^{g t})

represent the Euclidean distance of the center points of the prediction box and the real box, c represents the angular distance of the minimum closure area that can contain both the prediction box and the real box, and α and υ are the aspect ratio.

Algorithm 1 Training process of the proposed approach

Step 1: Import the necessary libraries and tools for image processing, object detection and deep learning, such as the Warmup and Cosine learning rate strategy and multi-scale training strategy;

Step 2: Define the object detection parameters, such as the confidence threshold, non-maximum suppression (NMS) threshold, input size, etc.;

Step 3: Load the dataset (Section 5) for training with labeled images and annotations;

Step 4: Split the dataset into training and validation sets at 85% and 15%, respectively;

Step 5: Use the multi-scale training strategy to process the dataset by resizing the images to the input size of the model, scaling and normalizing the pixel values;

Step 6: Train the proposed YOLOv5s object detection model (Section 6) on the dataset, using the Adam optimizer and loss function;

Step 7: Use the Warmup and Cosine learning rate strategy to accelerate training;

Step 8: Monitor the training progress by evaluating the loss and mAP score on the validation set;

Step 9: Fine-tune the model by adjusting the learning rate and other hyperparameters;

Step 10: Once the model is trained and optimized, use it to perform object detection for the new images of the bracket weldments;

Step 11: Apply NMS to remove overlapping object detections and refine the output;

Step 12: Display the final detection results for the original images, including the bounding boxes, class labels and confidence scores.

4.4. Evaluation Metrics

To better evaluate the algorithm, several indexes are analyzed as follows. First, the basic index, the intersection-over-union (IoU), is calculated to assess the quality of the bounding box. Second, mean accuracy precision (mAP) is used to comprehensively evaluate the detection accuracy. Finally, frames per second (FPS) is calculated to evaluate the model’s inference speed:

IoU = \frac{B_{p r e d i c t} \cap B_{t r u t h}}{B_{p r e d i c t} \cup B_{t r u t h}}

(6)

where the bounding box is predicted through B predict, and B truth represents the ground truth bounding box. The predicted box is more exact when the IOU value is higher. If the predicted object in A is not in the ground truth set B, then it is marked as a false positive:

mAP = \frac{\sum_{i = 1}^{n} A P_{i}}{n}

(7)

where n is the number of categories that need to be detected, and AP_i is the average precision for each category i.

FPS = \frac{f r a m N u m}{f r a m T i m e}

(8)

where framNum is the number of frames captured within a set time period, and framTime is the duration of that time period in seconds.

5. Experimental Results and Analysis

As described in this section, some raw data on bracket weldments were gathered and tested on the edge through experiments. The Raspberry-Pi-powered board is equipped with an Intel i7-10750 2.60 GHz processor and an NVIDIA GeForce GTX 1660ti GPU (NVIDIA company, Santa Clara, CA, USA) for parallel computation. The experiments were implemented in the Python 3.80 environment and run on the Ubuntu 16.0 LTS. The cloud–edge coordination inspection and control software was developed combining IoT and virtual simulation technologies. When analyzing the industrial scenario, the workstation, with an Intel i7-7700 3.60 GHz processor and an NVIDIA GeForce RTX 2080 Ti GPU, serves as the cloud platform. The 128 GB memory of the workstation satisfies the requirements for the storage of image data and model training.

5.1. Dataset Creation

The bracket plays an important role in supporting the automobile body. If there are surface defects in the weld seams on the robot-joined bracket, the normal driving of the automobile will be influenced. Therefore, bracket weldments were chosen in this work to verify the effectiveness of the proposed approach. The hardware setup for dataset creation is presented in Figure 5. The robot controller was connected to the Eston arc-welding robot, on which a 600-million-pixel CMOS industrial camera was installed for image capture, and the light controller was used to control the on–off functions of the P-RLG-80-90-WS LED light source. Considering the rigid material of steel bracket weldments that can lead to an exposure problem when capturing images, the camera and light source were equipped with polarizers. Furthermore, a dome was designed to protect the image acquirement hardware from strong welding sparks and large-area spatters.

On this basis, the dataset was created, and some high-resolution images are presented, in which the structure of the weld seam can be observed in the fusion zone, as presented in Figure 6. Each of the images revealed the property of a weld seam obtained at various arc energy levels. Seven types of weld seams, including normal, penetration, welding offset, missing solder, not beautiful, end hole and undercut seams, can be observed, as they are commonly observed in working conditions.

Additionally, the sample size was further expanded so that each sample was rotated eight times, by 45 degrees each time, generating eight rotated images from the original one. The percentages of training and validation sets were 85% and 15%, respectively. The numbers of samples are shown in Table 1.

5.2. Edge Experiment and Result Analysis

Two experiments were conducted to illustrate the performance of the proposed method in terms of detection accuracy and computational speed, respectively.

5.2.1. Detection Result Analysis

In order to widely verify the applicability of the proposed approach, previous machine-vision-based methods may be inappropriate in this case. Therefore, the typical two-stage method of Faster R-CNN [48] and one-stage method of Single-Shot Multi-Box Detection (SSD) [49] are adopted in this paper. The ResNet-based networks are one of the most commonly used convolutional networks that can be used as backbone networks in the Faster R-CNN and SSD object detection algorithms. As a result, Faster R-CNN+ResNet-50 and SSD+ResNet-50 are compared with the YOLOv5s and the proposed detection model in terms of detection precision. The experimental results of the mean average precision (mAP@50) are shown in Table 2. The mAP@0.5, a commonly used measurement, considers the precision and recall when the IOU is more than 50%. The mAP of the proposed approach is improved by at least 5.0%. The training parameters were set as shown in Table 3.

Some of the inspection results identified are presented in Figure 7, Figure 8 and Figure 9. Theoretically, the Faster R-CNN+ResNet-50 is better than the one-stage methods in terms of its detection effect. However, its performance may vary depending on the specific dataset and detection task. In this work, the end hole defect is small, while the other defects are similar to the background, which make it difficult to distinguish them from nuts, dirt or other objects (Figure 7a). As a result, the defects were detected with low accuracy (Figure 7b). Compared with the Faster R-CNN+ResNet-50, the detection performance of SSD+ResNet-50 improved a little, but there are still some problems, such as false detections and missing detections. The oval hole was incorrectly detected as a penetration defect (Figure 8a), the normal weld seam was mistakenly identified as a missing solder defect (Figure 8b), and the detection of the welding offset was missed (Figure 8c). To solve these problems, our approach applies parallel convolutions to extract features on different scales that can effectively avoid the loss of local information, which improves the accuracy of defect objection. In order to avoid such a situation in which the candidate boxes are either too large to cover the defect completely or too small, causing the overlapping problem, appropriate aspect ratios and scales are obtained using the Adam algorithm to obtain higher-quality candidate boxes and more accurate defect localization (Figure 9).

5.2.2. Detection Speed

As presented in Table 4 and Figure 10, the fastest detection models are the proposed one and the SSD, which have a speed of 20 FPS, among all the models, but the inference time per image is only 47 ms for our approach. On the basis of the detection results’ analysis, when detection applicability is combined with the model’s inference speed, SSD and our approach have relatively fast detection speeds, both reaching 20 FPS. This demonstrates that the one-stage method has the potential to be applied to this industrial scenario. The proposed model can achieve better detection results on the created dataset. As a result, our proposed approach is a suitable choice for real-time applications.

5.3. Ablation Study

Ablation experiments were conducted on the created dataset. The original YOLOv5s model was trained as the benchmark, adding the YOLO-SPD-Conv module to verify the detection performance. Some of the results identified are shown in Figure 11, before and after adding the module. They demonstrate that this technology, using dilated convolutions to increase the receptive field of the detection model, can enhance the ability to capture more context and information from the image of the bracket weldment.

At the same time, the training loss and mAP@50 were calculated to further prove that the utilization of the YOLO-SPD-Conv module can improve overall inspection performance. As shown in Figure 12, overall, the training loss after adding the YOLO-SPD-Conv module is lower than that without the module. This means that the training loss is lower, and the model performance is better. As the number of epochs increase, our approach’s mAP also increases until reaching approximately 90, where the mAP remains stable above 95%. In conclusion, it is evident that the YOLO-SPD-Conv module can improve the models’ sensitivity and specificity and enhance the detection accuracy.

5.4. Cloud–Edge Coordination Inspection and Control Software

The following information can be seen in the software: the model training information in the cloud, the detection information at the edge and the virtual and physical robot welding parameter information on-site. The operation process of the cloud–edge collaboration inspection and control software is shown in Figure 13.

The cloud-computing center mainly trains and updates the model using the uploaded images and then devolves it to the edge. The edge-computing center mainly carries out real-time defect inspection using the devolved model. At the same time, the control center provides control strategies (e.g., the panel or voice prompts) so that the operator can make parameter adjustments to the virtual arc-welding robot. The control strategies are set in advance: for example, if the voltage is too high, panel or voice prompts are given to indicate that it should be lowered so as to reduce the risk of “penetration”. If the wire feed speed is too low, it should be increased to prevent “undercut”. Finally, the welding parameters of the physical robot can be adjusted by combing virtual robot operation and expert experience. In this way, the software can achieve the closed-loop management that can improve the production efficiency.

A flow chart for the program is presented in Figure 14. Firstly, the cloud–edge coordination inspection and control software need to be initialized. Then, data acquisition is implemented, through which real-time data from the sensors (cameras, voltage sensors, wire feed speed sensors) attached to the welding equipment is obtained. To transmit the collected image data for edge computing, a GigE-based transmission standard is adopted to realize image signal transmission. The edge computing is responsible for preprocessing and analyzing the captured data locally through the edge devices. The analyzed welding defect data and control commands are sent to the cloud for further processing through the HTTP standard protocol. In addition, to ensure the encrypted transmission of the image data, control commands and model updates to protect the integrity and privacy of the information, the WiFi module, one of the network connectivity solutions, is employed to establish connections between the edge devices and cloud. After that, control strategies and decision making are executed to notify the operator about the detected defects and suggest corrective actions and trigger control commands to optimize the welding process in real time. Data visualization is performed to generate visualizations and prompts for monitoring and optimization. Finally, the program ends or loops back to the data acquisition stage for continuous monitoring and analysis.

6. Conclusions

With the increasing demand for reliable and effective defect inspection methods in various industrial sectors, which are embraced with advanced information technologies, a robotic-vision-based defect inspection system for bracket weldments was proposed for a cloud–edge coordination environment. Instead of public datasets, accurate and accessible datasets were built in this work. In this way, the YOLOv5 object detection algorithm was used to inspect multi-scale defects in which a YOLO-SPD-Conv module was added to further retain discriminant information on small defects. Furthermore, a control software for defect inspection was developed for the arc-welding-robot-based production line. By enabling feedback and corrections during the manufacturing process, it helps to improve overall manufacturing efficiency. As a result, the proposed system’s cost-effectiveness and reduced inspection time make it an attractive option for manufacturers and industries requiring efficient quality control solutions. In this work, however, we mainly focused on the defect inspection and classification of brackets in automobiles. More types of datasets should be gathered and tested in the future.

Author Contributions

Conceptualization, H.L.; methodology, Y.L. and Y.Z.; software, X.W.; validation, G.L. and X.Y.; resources, H.W.; writing—original draft preparation, X.W.; writing—review and editing, Y.L.; project administration, H.L. and Z.Z.; funding acquisition, H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China (No. 52175256).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors are grateful to the anonymous reviewers for their comments, which helped to improve this paper.

Conflicts of Interest

The authors declare no potential conflict of interest with respect to the research, authorship, and/or publication of this article.

References

Kaleem, M.A.; Khan, M. Significance of Additive Manufacturing for Industry 4.0 with Introduction of Artificial Intelligence in Additive Manufacturing Regimes. In Proceedings of the 2020 17th International Bhurban Conference on Sciences and Technology, Islamabad, Pakistan, 14–18 January 2020; pp. 152–156. [Google Scholar]
Manimuthu, A.; Venkatesh, V.G.; Shi, Y.Y.; Sreedharan, V.R.; Koh, S.C.L. Design and development of automobile assembly model using federated artificial intelligence with smart contract. Int. J. Prouduction Res. 2022, 60, 111–135. [Google Scholar] [CrossRef]
Awasthi, A.; Saxena, K.K.; Arun, V. Sustainable and smart metal forming manufacturing process. Mater. Today-Proc. 2021, 44, 2069–2079. [Google Scholar] [CrossRef]
Tang, H.; Li, D.; Wan, J.; Imran, M.; Shoaib, A. reconfigurable method for intelligent manufacturing based on industrial cloud and edge intelligence. IEEE Internet Things J. 2020, 7, 4248–4259. [Google Scholar] [CrossRef]
Tripathi, V.; Chattopadhyaya, S.; Mukhopadhyay, A.K.; Sharma, S.; Li, C.H.; Di Bona, G. A Sustainable Methodology Using Lean and Smart Manufacturing for the Cleaner Production of Shop Floor Management in Industry 4.0. Mathematics 2022, 10, 347. [Google Scholar] [CrossRef]
Cho, C.S.; Chung, B.M.; Park, M.J. Development of real-time vision-based fabric inspection system. IEEE Trans. Ind. Electron. 2005, 52, 1073–1079. [Google Scholar] [CrossRef]
Li, D.; Xie, Q.; Gong, X.; Yu, Z.; Xu, J.; Sun, Y.; Wang, J. Automatic defect detection of metro tunnel surfaces using a vision-based inspection system. Adv. Eng. Inform. 2021, 47, 101206. [Google Scholar] [CrossRef]
Kumar, A. Computer-vision-based fabric defect detection: A survey. IEEE Trans. Ind. Electron. 2008, 55, 348–363. [Google Scholar] [CrossRef]
Gao, Y.; Li, X.; Wang, X.V.; Wang, L.; Gao, L. A review on recent advances in vision-based defect recognition towards industrial intelligence. J. Manuf. Syst. 2022, 62, 753–766. [Google Scholar] [CrossRef]
Tian, Y.H. Artificial Intelligence Image Recognition Method Based on Convolutional Neural Network Algorithm. IEEE Access 2020, 8, 125731–125744. [Google Scholar] [CrossRef]
Sun, Y.A.; Xue, B.; Zhang, M.J.; Yen, G.G. Evolving Deep Convolutional Neural Networks for Image Classification. IEEE Trans. Evol. Comput. 2020, 24, 394–407. [Google Scholar] [CrossRef] [Green Version]
Dong, Y.N.; Liu, Q.W.; Du, B.; Zhang, L.P. Weighted Feature Fusion of Convolutional Neural Network and Graph Attention Network for Hyperspectral Image Classification. IEEE Trans. Image Process. 2022, 31, 1559–1572. [Google Scholar] [CrossRef] [PubMed]
Bai, L.; Cui, L.X.; Jiao, Y.H.; Rossi, L.; Hancock, E.R. Learning Backtrackless Aligned-Spatial Graph Convolutional Networks for Graph Classification. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 783–798. [Google Scholar] [CrossRef]
Li, X.; Ding, M.L.; Pizurica, A. Deep Feature Fusion via Two-Stream Convolutional Neural Network for Hyperspectral Image Classification. IEEE Trans. Geosience Remote Sens. 2020, 58, 2615–2629. [Google Scholar] [CrossRef] [Green Version]
Hou, Q.B.; Cheng, M.M.; Hu, X.W.; Borji, A.; Tu, Z.W.; Torr, P.H.S. Deeply Supervised Salient Object Detection with Short Connections. IEEE Trans. Pattern Anal. Mach. Intell. 2019, 41, 815–828. [Google Scholar] [CrossRef] [Green Version]
Liu, Y.; Sun, P.; Wergeles, N.; Shang, Y. A survey and performance evaluation of deep learning methods for small object detection. Expert Syst. Appl. 2021, 172, 114602. [Google Scholar] [CrossRef]
Satpathy, S.; Sahoo, B.; Turuk, A.K. Sensing and actuation as a service delivery model in cloud edge centric Internet of Things. Future Gener. Comput. Syst. 2018, 86, 281–296. [Google Scholar] [CrossRef]
Zhao, Z.Q.; Zheng, P.; Xu, S.T.; Wu, X.D. Object Detection With Deep Learning: A Review. IEEE Trans. Neural Netw. Learn. Syst. 2019, 30, 3212–3232. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Ren, Y.P.; Huang, J.S.; Hong, Z.Y.; Lu, W.; Yin, J.; Zou, L.J.; Shen, X.H. Image-based concrete crack detection in tunnels using deep fully convolutional networks. Constr. Build. Mater. 2020, 234, 117367. [Google Scholar] [CrossRef]
Shelhamer, E.; Long, J.; Darrell, T. Fully Convolutional Networks for Semantic Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 640–651. [Google Scholar] [CrossRef]
Dung, C.V.; Anh, L.D. Autonomous concrete crack detection using deep fully convolutional neural network. Autom. Constr. 2019, 99, 52–58. [Google Scholar] [CrossRef]
Zhou, Z.; Chen, X.; Li, E.; Zeng, L.K.; Luo, K.; Zhang, J.S. Edge Intelligence: Paving the Last Mile of Artificial Intelligence with Edge Computing. Proc. IEEE 2019, 107, 1738–1762. [Google Scholar] [CrossRef] [Green Version]
Zhou, X.K.; Xu, X.S.; Liang, W.; Zeng, Z.; Yan, Z. Deep-Learning-Enhanced Multitarget Detection for End-Edge-Cloud Surveillance in Smart IoT. IEEE Internet Things J. 2021, 8, 12588–12596. [Google Scholar] [CrossRef]
Shao, Z.F.; Pan, Y.; Diao, C.Y.; Cai, J.J. Cloud Detection in Remote Sensing Images Based on Multiscale Features-Convolutional Neural Network. IEEE Trans. Geosci. Remote Sens. 2019, 57, 4062–4076. [Google Scholar] [CrossRef]
Yang, P.C.; Hu, D.; Wang, C.Y.; Zhang, Y.X.; You, D.Y.; Gao, X.D.; Zhang, N.F. Weld Surface Imperfection Detection by 3D Reconstruction of Laser Displacement Sensing. In Proceedings of the 2020 5th International Conference on Mechanical, Control and Computer Engineering, Harbin, China, 25–27 December 2020; pp. 2102–2105. [Google Scholar]
Vater, J.; Schlaak, P.; Knoll, A. A Modular Edge-/Cloud-Solution for Automated Error Detection of Industrial Hairpin Weldings using Convolutional Neural Networks. In Proceedings of the 2020 IEEE 44th Annual Computers, Software, and Applications Conference, Madrid, Spain, 13–17 July 2020; pp. 505–510. [Google Scholar]
Hu, Q.M.; Hao, K.R.; Wei, B.; Li, H.J. An efficient solder joint defects method for 3D point clouds with double-flow region attention network. Adv. Eng. Inform. 2022, 52, 101608. [Google Scholar] [CrossRef]
Vater, J.; Kirschning, M.; Knoll, A. Closing the loop: Real-time Error Detection and Correction in automotive production using Edge-/Cloud-Architecture and a CNN. In Proceedings of the 2020 Interational Conference on Omni-Layer Intelligent Systems, Barcelona, Spain, 31 August–2 September 2020; pp. 42–48. [Google Scholar]
Lee, H.; Sung, K.; Park, H.; Rhee, S.H. Measurement of Weld Bead Defect for Shock Absorber using Laser Vision Sensor. Key Eng. Mater. 2004, 270–273, 2332–2337. [Google Scholar] [CrossRef]
Nguyen, H.C.; Lee, B.R. Laser Vision-based Quality Inspection System for Small Bead Laser Welding. Int. J. Precis. Eng. Manuf. 2014, 15, 415–423. [Google Scholar] [CrossRef]
Chu, H.H.; Wang, Z.Y. A vision-based system for post-welding quality measurement and defect detection. Int. J. Adv. Manuf. Technol. 2016, 86, 3007–3014. [Google Scholar] [CrossRef]
Senthil Kumar, G.; Natarajan, U.; Ananthan, S. Vision inspection system for the identification and classification of defects in MIG welding joints. Int. J. Adv. Manuf. Technol. 2012, 61, 923–933. [Google Scholar] [CrossRef] [Green Version]
Weimer, D.; Scholz-Reiter, B.; Shpitalni, M. Design of deep convolutional neural network architectures for automated feature extraction in industrial inspection. CIRP Ann. 2016, 65, 417–420. [Google Scholar] [CrossRef]
Sassi, P.; Tripicchio, P.; Avizzano, C.A. A smart monitoring system for automatic welding defect detection. IEEE Trans. Ind. Electron. 2019, 66, 9641–9650. [Google Scholar] [CrossRef]
Shin, S.; Jin, C.; Yu, J.; Rhee, S. Real-time detection of weld defects for automated welding process base on deep neural network. Metals 2020, 10, 389. [Google Scholar] [CrossRef] [Green Version]
Du, Y.; Pan, N.; Xu, Z.; Deng, F.; Shen, Y.; Kang, H. Pavement distress detection and classification based on YOLO network. Int. J. Pavement Eng. 2021, 22, 1659–1672. [Google Scholar] [CrossRef]
Lijian, Y.; Zhujun, W.; Songwei, G. Pipeline magnetic flux leakage image detection algorithm based on multiscale SSD network. IEEE Trans. Ind. Inf. 2020, 16, 501–509. [Google Scholar]
Bai, T.; Yang, J.; Xu, G.; Yao, D. An optimized railway fastener detection method based on modified faster R-CNN. Measurement 2021, 182, 109742. [Google Scholar] [CrossRef]
Li, C.; Shang, X.; Zhang, L.; Yang, F.; Zheng, J.; Xu, X. Pipeline Defect Detection Cloud System Using Role Encryption and Hybrid Information. Comput. Mater. Contin. 2019, 61, 1245–1260. [Google Scholar] [CrossRef]
Huang, J.; Sergin, N.; Dua, A.; Tavakoli, E.B.; Yan, H.; Ren, F.; Ju, F. Edge computing accelerated defect classification based on deep convolutional neural network with application in rolling image inspection. In International Manufacturing Science and Engineering Conference; American Society of Mechanical Engineers: New York, NY, USA, 2020; Volume 84263, p. V002T07A037. [Google Scholar]
Chu, Y.; Feng, D.; Liu, Z.; Zhao, Z.; Wang, Z.; Xia, X.; Quek, T. Hybrid-Learning-Based Operational Visual Quality Inspection for Edge-Computing-Enabled IoT System. IEEE Internet Things J. 2022, 9, 4958–4972. [Google Scholar] [CrossRef]
Wang, Y.; Liu, M.; Zheng, P.; Yang, H.; Zou, J. A smart surface inspection system using faster R-CNN in cloud-edge computing environment. Adv. Eng. Inform. 2020, 43, 101037. [Google Scholar] [CrossRef]
Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788. [Google Scholar]
Redmon, J.; Farhadi, A. YOLO 9000: Better, Faster, Stronger. In Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 6517–6525. [Google Scholar]
Jocher, G.; Nishimura, K.; Mineeva, T.; Vilariño, R. YOLOv5. 2021. Available online: https://github.com/ultralytics/yolov5 (accessed on 1 September 2021).
Gao, S.H.; Cheng, M.M.; Zhao, K.; Zhang, X.Y.; Yang, M.H.; Torr, P. Res2Net: A New Multi-Scale Backbone Architecture. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 43, 652–662. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Raja, S.; Tie, L. No More Strided Convolutions or Pooling: A New CNN Building Block for Low-Resolution Images and Small Objects. In Proceedings of the European Conference, ECML PKDD 2022, Grenoble, France, 19–23 September 2022; pp. 443–459. [Google Scholar]
Ren, S.; He, K.; Girshick, R.; Sun, J. Faster r-cnn: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Inthelligrnce 2017, 39, 1137–1149. [Google Scholar] [CrossRef] [Green Version]
Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.; Fu, C.-Y.; Berg, A.C. Ssd: Single shot multibox detector. In Proceedings of the Computer Vision–ECCV 2016, 14th European Conference, Amsterdam, The Netherlands, 11–14 October 2016; Part I 14. Springer International Publishing: Cham, Switzerland, 2016; pp. 21–37. [Google Scholar]

Figure 1. The framework of a robotic-vison-based defect inspection system for bracket weldments.

Figure 2. Rotation verification (the left image is without image rotation, the right is the rotated image, and the red box is defect location).

Figure 3. Structure of the YOLOv5s model.

Figure 4. YOLO-SPD-Conv modules in the CBS and CSP structures.

Figure 5. Hardware setup for smart terminals and control devices.

Figure 6. Seven types of weld seams.

Figure 7. Results of Faster R-CNN+ResNet-50 (the ground truth is shown in red).

Figure 8. Results of SSD+ResNet-50 (the ground truth is shown in red).

Figure 9. Results of proposed approach (the ground truth is shown in red).

Figure 10. Comparison of inference time (ms) and inference speed (FPS).

Figure 11. Results identified before (the first row) and after adding the module (the second row).

Figure 12. The mAP@50 (left) and train loss (right) during 100 training epochs.

Figure 13. Operation process of the cloud–edge collaboration inspection and control software.

Figure 14. A flow chart for the program.

Table 1. The numbers of samples.

Defect Type	Number
Normal	1272
Undercut	1296
Welding offset	1272
Penetration	1272
Not beautiful	1272
End hole	1248
Missing solder	1272

Table 2. Experimental results of different models.

Models	Feature Extraction Network	mAP (@0.5)
Faster RCNN+ResNet-50	ResNet-50	86.40%
SSD+ResNet-50	ResNet-50	87.40%
YOLOv5s	CSPNet	90.40%
YOLOv5s	CSPNet+YOLO-SPD-Conv (proposed)	98.60%

Table 3. Setting of training parameters.

Models	Faster RCNN + ResNet-50	SSD + ResNet-50	YOLOv5s	Proposed Method
Image size	640 × 640	640 × 640	640 × 640	640 × 640
Learning rate	0.001	0.001	0.01	0.01
Optimizer	Adam	Adam	Adam	Adam
Weight decay	0.0001	0.0001	0.0005	0.0005
Batch size	8	16	16	16
Number of epochs	100	100	100	100

Table 4. Comparison of inference time (ms) and inference speed (FPS).

Models	Inference Time (ms)	Inference Speed (FPS)
Faster RCNN	118	8
SSD	68	15
YOLOv5s	57	18
Proposed approach	47	21

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, H.; Wang, X.; Liu, Y.; Liu, G.; Zhai, Z.; Yan, X.; Wang, H.; Zhang, Y. A Novel Robotic-Vision-Based Defect Inspection System for Bracket Weldments in a Cloud–Edge Coordination Environment. Sustainability 2023, 15, 10783. https://doi.org/10.3390/su151410783

AMA Style

Li H, Wang X, Liu Y, Liu G, Zhai Z, Yan X, Wang H, Zhang Y. A Novel Robotic-Vision-Based Defect Inspection System for Bracket Weldments in a Cloud–Edge Coordination Environment. Sustainability. 2023; 15(14):10783. https://doi.org/10.3390/su151410783

Chicago/Turabian Style

Li, Hao, Xiaocong Wang, Yan Liu, Gen Liu, Zhongshang Zhai, Xinyu Yan, Haoqi Wang, and Yuyan Zhang. 2023. "A Novel Robotic-Vision-Based Defect Inspection System for Bracket Weldments in a Cloud–Edge Coordination Environment" Sustainability 15, no. 14: 10783. https://doi.org/10.3390/su151410783

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Novel Robotic-Vision-Based Defect Inspection System for Bracket Weldments in a Cloud–Edge Coordination Environment

Abstract

1. Introduction

2. Related Works

3. Framework of a Robotic-Vision-Based Defect Inspection System for Bracket Weldments

4. Surface Defect Detection Approach Based on YOLOv5

4.1. Data Pre-Processing

4.2. Surface Defect Detection

4.3. Training Process

4.4. Evaluation Metrics

5. Experimental Results and Analysis

5.1. Dataset Creation

5.2. Edge Experiment and Result Analysis

5.2.1. Detection Result Analysis

5.2.2. Detection Speed

5.3. Ablation Study

5.4. Cloud–Edge Coordination Inspection and Control Software

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI