An Operational Image-Based Digital Twin for Large-Scale Structures

Benzon, Hans-Henrik; Chen, Xiao; Belcher, Lewis; Castro, Oscar; Branner, Kim; Smit, Jesper

doi:10.3390/app12073216

Open AccessArticle

An Operational Image-Based Digital Twin for Large-Scale Structures

by

Hans-Henrik Benzon

^1,†,

Xiao Chen

^1,*,†

,

Lewis Belcher

²,

Oscar Castro

¹

,

Kim Branner

¹ and

Jesper Smit

³

¹

Department of Wind Energy, Technical University of Denmark, Frederiksborgvej 399, 4000 Roskilde, Denmark

²

Desupervised ApS, Njalsgade 76, 3, 2300 Copenhagen, Denmark

³

Quali Drone ApS, Mariagervej 3, 9560 Hadsund, Denmark

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Appl. Sci. 2022, 12(7), 3216; https://doi.org/10.3390/app12073216

Submission received: 22 February 2022 / Revised: 15 March 2022 / Accepted: 20 March 2022 / Published: 22 March 2022

(This article belongs to the Special Issue Advances in Deep Learning-Based Information Processing for Big Data Analytics and Digital Transformation)

Download

Browse Figures

Versions Notes

Abstract

:

This study presents a novel methodology to create an operational Digital Twin for large-scale structures based on drone inspection images. The Digital Twin is primarily used as a virtualized representation of the structure, which will be updated according to physical changes during the life cycle of the structure. The methodology is demonstrated on a wind turbine transition piece. A three-dimensional geometry reconstruction of a transition piece as manufactured is created using a large number (>500) of RGB images collected from a drone and/or several LiDAR scans. Comparing the reconstruction to the original design will locate and quantify geometric deviations and production tolerances. An artificial intelligence algorithm is used to detect and classify paint defects/damages from images. The detected and classified paint defects/damages are subsequently digitalized and mapped to the three-dimensional geometric reconstruction of the structure. These developed functionalities allow the Digital Twin of the structure to be updated with manufacturing-induced geometric deviations and paint defects/damages using inspection images at regular time intervals. The key enabling technologies to realize the Digital Twin are presented in this study. The proposed methodology can be used in different industrial sectors, such as the wind energy, oil, and gas industries, aerospace, the marine and transport sector, and other large infrastructures.

Keywords:

Digital Twin; drone; machine learning; 3D reconstruction; damage inspection; artificial intelligence

1. Introduction

Digitalization is one of the priorities in transforming industrial research and innovation [1]. In addition, the offshore industry can be significantly transformed by embracing advanced digital technologies such as the Internet of Things, Big Data, and Cloud Computing, together with sensor technologies and high-fidelity structural modeling techniques. All of these technologies come together in what is broadly described as a Digital Twin. In this paper, the development of Digital Twin technologies for large-scale offshore structures is explored.

The term “Digital Twin” has many different definitions in both academia and industry, in a rather broad sense. A state-of-the-art and a state-of-the-use review provide more than 400 definitions of Digital Twin [2]. However, in this work we adopt the definition of Bolton et al. [3], that a Digital Twin is a “dynamic virtual representation of a physical object or system across its lifecycle, using real-time data to enable understanding, learning and reasoning.”

The vision for using a Digital Twin for smart, efficient, and reliable large-scale offshore structures is further elaborated in [4]. A Digital Twin can be connected to the real structure through a live feed of data streamed from embedded sensors, meteorology data, and, most importantly, from structural health monitoring and inspection systems. Ideally, therefore, a Digital Twin can be used to make predictions of the development of damage, based on the current damage state that is recorded and updated in a structural health journal maintained for each structural component. Based on physical models, the Digital Twin can then be used to simulate the effect on the structure for different damage modes and different operational scenarios, e.g., during installation or in the event of a storm, grid loss, shutting down, etc. [4].

A Digital Twin of a large-scale structure should start at the early design stage. Within the manufacturing stage, the monitoring of the process parameters is essential for future damage identification. Numerous in-service failures originate from fabrication defects. Therefore, each structure needs to be digitally reconstructed and scanned for flaws before leaving the factory; accordingly, a Digital Twin is updated with information on the manufacturing history and fabrication defects. This should be carried out automatically to limit the workload of such a task.

Technologies such as autonomous robots that adapt to the surface of large-scale offshore structures, such as wind turbine blades, are now being developed to perform three-dimensional (3D) scans and to identify surface manufacturing defects before the structures leave the factory [5]. Although this type of technology seems to have good potential in terms of accuracy, the long scanning time and high costs, in addition to the robustness of the system, make it difficult to apply this technology on a large scale and use it in other lifecycle stages, such as inspection during operation. Alternative 3D reconstruction technologies, such as LiDAR (light detection and ranging) scanning and drone-based photogrammetry, implemented in the building and construction industry [6,7,8], could also be applied in the offshore industry not only to perform 3D digital reconstruction of large-scale structures but also to identify surface defects and damages. Scanning using LiDAR scanners has proven to be a good option, in terms of high extracting speed and high accuracy, for obtaining a 3D digital representation of complex objects, such as bridges, without physical contact [6]. In addition to these two advantages, drone-based photogrammetry has been shown to decrease overall reconstruction time and costs relative to the common manual and direct inspection techniques that are used in the building and construction industry [7,8]. In this technology, analog cameras are mounted on unmanned aerial vehicles (UAVs), commonly known as drones, to take high-resolution RGB aerial images for different points of the object. The images are then used to make a 3D reconstruction of the object by implementing post-processing techniques using matching key points through the structure from motion (SfM) or multi-view stereopsis (MVS) algorithms [9].

Once the Digital Twin is created with the 3D digital reconstruction of the large-scale structure and information on the manufacturing defects, it must be continuously updated throughout the life cycle of the structure, including transportation, installation, operation, maintenance, and repair. Advanced and robust inspection systems and sensors must be developed, such that the environmental exposure and deterioration of structures and materials may be monitored. Through the Digital Twin, the state of the individual structures can always be assessed, giving valuable information to the operator. Based on this information, the operator can make decisions that affect the lifetime of its assets, e.g., changing the operation mode to reduce the loading on the structures. Furthermore, information about necessary repairs will be available through processing data in the Digital Twin.

Currently, inspections of large-scale offshore structures during operation, such as wind turbine structures, are often carried out manually using e.g., lifts and rope access. This is a time-consuming, expensive, and potentially dangerous task. Therefore, the use of drones is a potential alternative for this process, as they could be used to carry out remote, faster, safer, and cheaper inspections [10].

However, drone inspections could bring other challenges, such as creating a large amount of images and data that need to be processed. In addition, users have difficulty precisely and correctly locating surface defects and damages because the images are often shown out of context. Therefore, there is a need for a faster and more efficient solution to store, analyze, and report on the vast amount of data the inspections provide. Artificial intelligence (AI) and Digital Twins provide these opportunities. AI can be used to process many more images, much faster than a human can handle. Observations from multiple images can be grouped into unique issues by AI and, at the same time, the 3D location can be calculated so the issues can be automatically mapped in the Digital Twin. The result is a single issue that can be seen from multiple images and angles through the Digital Twin. In addition, smaller defects/damages can potentially be detected by AI, which would have been missed by the human eye. An example of wind turbine surface damage detection using AI-aided drone inspection is found in [10].

Depending on the application and the objective, the methodologies of creating Digital Twins can be significantly different. In the present study, we propose a novel methodology to create an operational image-based Digital Twin for large-scale offshore structures. We use drone inspection images and/or LiDAR scans to create a 3D geometric reconstruction of the structure, compare the reconstructed model to the original design to locate and quantify geometric deviation, apply AI to identify surface defects/damages, and map the identified surface defects/damages to the geometric reconstruction. The entire process can be done automatically, allowing efficient model updating and status tracking with new sets of inspection images. The proposed methodology is demonstrated on a wind turbine transition piece (TP) and can be applied to other large-scale structures where structural health monitoring and asset management are needed.

2. The Proposed Digital Twin Framework

The process to create an operational image-based Digital Twin for large-scale offshore structures proposed in this study is shown in Figure 1. The process begins with a description of the physical structure in terms of geometry and position in a georeferenced coordinate system. Drones and/or LiDAR scans are used to obtain a large number of RGB images of the physical structure after the manufacturing process, which are then used to create a three-dimensional geometric reconstruction of the structure. This reconstruction model is produced with the use of photogrammetry. During this step, a detailed meshed CAD model of the structure containing the details of the geometry is moved to the same georeferenced coordinate system that is used during the drone flights for easier comparison with the reconstructed 3D model. Parallel to this, an artificial intelligence (AI) algorithm is applied to the RBG images to detect and classify paint defects and damages.

A large number of images (>2000) were used to train the convolutional neural network. The state-of-the-art real-time object detection algorithm You Only Look Once (YOLO) version 5 was selected (see reference [11]), and it has proven to be an effective tool for detecting and classifying paint defects/damages. The type, position, and size of the detected paint defects/damages were subsequently extracted from the AI algorithms and mapped to the three-dimensional geometry reconstruction of the transition piece, thereby creating the first version of the Digital Twin. Information on the position, type, and size of paint defects/damages from the Digital Twin can be used to determine if maintenance is needed on the physical structure. After offshore installation of the structure, drones are used to inspect the structure at regular time intervals. The Digital Twin is then updated, based on the new images and scans. In the future, the proposed Digital Twin can be used to facilitate high-fidelity finite element analysis of the structure to evaluate its structural integrity.

3. Drone-Based Image Acquisition and Pre-Processing

Images of a specific transition piece were collected at the factory of Bladt Industries using a DJI zenmouse P1 45M pixels camera onboard a DJI M300 RTK drone. Real-time kinematic positioning (RTK) was used to improve the precision of the GPS information saved in the metadata section of the images. The selected TP had sufficient space around it, making it easier to plan for a good flight path, where the drone during flight has a safe distance from both the selected TP and the surrounding TPs. Figure 2a,b show the drone flight paths and the actual drone and transition piece, respectively. The drone flight path was planned in a local Cartesian coordinate system which had the origin placed in the center of the CAD model of the tower. The number of flight points were selected high enough to ensure the necessary overlap between the images, in both the horizontal and in vertical directions. It was generally accepted, as a rule of thumb, that the minimum overlap between the photos was 60%, while the maximum angle difference between consecutive photos was 15 degrees [12]. The flight points were converted to a georeferenced coordinate system and uploaded to the drone. Figure 2a shows the drone flight paths, while the image acquisitions positions are represented by the blue dots.

The images with improved quality were obtained when the camera on the drone was placed in a gimbal stabilizer and was pointing toward the TP during the flight. The drone images were used both for paint defect/damage detection and for performing the 3D model reconstruction. The TP was made of steel, which meant that specular reflection in the tower could produce shiny images that had patches with a color different from the surrounding areas. The TP in the images had very little texture; there were large areas with only slight differences in color.

These issues made the 3D reconstruction of the TP very challenging. A possible solution to this challenge could be to optically project a pattern on the tower. Image key points can be used to predict whether the drone images can successfully be used in the model reconstruction process.

Image key points were used in photogrammetry software, such as ContextCapture from the Bentley Institute [12], to find the “interesting” points in an image and to find the connection between images. The calculation of these keypoints for all of the images is one of the first steps in the model reconstruction process. Keypoints are spatial locations in the image that stand out; they can be an edge or a defect in a surface. Figure 3a,b and Figure 4 show the keypoints represented with green + signs in one of the images of the TP. These image keypoints were calculated using the speeded-up robust features (SURF) algorithm and the scale-invariant feature fransform (SIFT) algorithm, as seen in Figure 3 and Figure 4, respectively. Both of these robust feature descriptors are invariant to scale changes, blur, rotation, illumination changes, and affine transformations (see reference [13]). SIFT is better than SURF in different scale images, while SURF is a faster algorithm (see reference [14]). A good reconstruction is expected when the image keypoints are uniformly distributed; however, this is not the case for the example shown in Figure 3a.

It is seen that there were very few SIFT and SURF keypoints on the TP, thus making the reconstruction more challenging. There were many keypoints on the highly textured ground close to the tower, which meant that these areas of the scene would be well-captured in the reconstructed model. Both the SIFT and SURF descriptors could find the areas where there were tower paint defects/damages, as seen in Figure 3b and Figure 4b. These figures show a smaller section of the tower where the SIFT and SURF keypoints, respectively, were positioned at a section with paint defects/damages. The SIFT algorithm also placed keypoints on the welding seam. Figure 3c shows the 100 strongest-matched SURF points in two images that were captured subsequently during the drone flight.

The possibility of tracking keypoints from one image to the next is very important for a good reconstruction. The calculation of image keypoints does not take a long time. This calculation makes it possible to estimate, in the field, how well suited the images are for performing 3D reconstruction; few keypoints on the tower and in the image would hinder a good 3D reconstruction of the tower. Image keypoints, such as SIFT and SURF, can also be used in the field to estimate where the paint defects/damages are placed on the tower. Images that only show the tower, and not the surroundings, would have image keypoints mostly in the areas of the paint defects/damage. The SURF algorithm was better suited for our purposes because it finds the paint damages and not other non-essential features, such as the welding seams.

4. 3D Geometry Reconstruction

The 3D reconstructed model of the transition piece is the visual part of the Digital Twin. This reconstruction model was based on both images and LiDAR scans.

The images were collected during the drone flights; ContextCapture from the Bentley Institute was used to generate the 3D reconstruction models. These 3D models are textured meshes with a large number of faces and vertices. Figure 5a shows the CAD model of the TP from Bladt Industries and Figure 5b shows the corresponding reconstructed 3D model. This reconstruction is based on 445 images. Small holes can be observed on the cylindrical sides of the reconstructed model. These small holes in the reconstruction mesh appear in the areas where there is a low number of keypoints. A larger number of images can in some cases mitigate the number of small holes in the reconstruction. The overlapping between the CAD model and the reconstructed model is shown in Figure 6. The critical step of accurately aligning the reconstructed and the CAD model meshes was done with the use of the Meshlab tool [15]. The grey areas come from the CAD model, while the yellow areas come from the reconstructed model.

It is seen that the reconstructed model, in general, is a very good representation of the CAD model, with a few exceptions. The biggest difference between the two models is the different positions of the crane. The different heights of the posts on the upper platform is another difference between the two models. The height of the posts is lower, on the tower that was built, than what it should have been according to the CAD model. Figure 7a shows the differences in distance between these models as colors superimposed on the CAD model. The green color corresponds to small distance differences between the models, while the red and blue colors correspond to higher positive and negative differences, respectively. The histogram in Figure 7b shows the relative probability for the different distance differences, together with the corresponding color. The x-axes in the histogram have a logarithmic scale that makes it possible to see the very small probability values and the corresponding colors used in Figure 7a. The distance differences have values from −1.1 to 3.7 m. Fortunately, it is seen that the probability for these high values is very low. The histogram has a relatively low standard deviation of 0.30 m and it is centered on a mean value that is very close to zero. The error associated with the reconstructed model, compared to the CAD model, is therefore, in general, relatively low. The red colors on the crane, corresponding to large values of the distance differences, are due to the different positions of the crane in the two models. The different crane position is also the reason for the second peak in the histogram for high values of geometric differences, as shown in Figure 7b. Appendix A shows how the dimension of objects can be measured from the model reconstruction.

The reconstructed 3D model of the TP can also be based on LiDAR scans. These results are shown in Figure 8 and Figure 9. The reconstructed model in Figure 8 is based on six static LiDAR scans of the TP, corresponding to a point cloud of 63 million points. Shiny surfaces can cause problems for photogrammetry reconstruction based not only on images but also on LiDAR scans. Blockade of the scanning LiDAR beams is another type of problem that reduces the quality of the reconstruction. We encountered all of these obstacles during the drone inspection of the TPs. A lack of information in a certain area of the tower can cause holes to appear in the reconstructed 3D model. Smaller holes can be mitigated in the photogrammetry software ContextCapture from the Bentley Institute. The comparison between the CAD model and the reconstructed model, based on LiDAR scans, is shown in Figure 9. Figure 9a shows, again, the differences in distances between the CAD model and the reconstructed 3D model with the use of colors superimposed on the CAD model. The probabilities for the distance differences are shown in Figure 9b, together with the corresponding color distribution. The distance differences have values from −2.8 to 4.1 m. The histogram has a relatively low standard deviation of 0.57 m and is centered on a mean value that is very close to zero.

Therefore, the error associated with the reconstructed model, compared to the CAD model, was, in general, relatively low. Some parts of the ladders on the platform were not reconstructed, due to the LiDAR beam blockade. This, together with the small holes in the reconstruction, increased the width of the histogram. The quality of the model reconstruction based on RGB images is better than the reconstruction based on LiDAR scans, which is evident from a comparison of the histograms in Figure 7b and Figure 9b. The ladders on the platform were better captured in the image-based reconstruction, because the platform itself blocks for the LiDAR beams. Accordingly, placing the LiDARs on the drone could solve this problem.

5. AI-Based Surface Defect/Damage Detection

The domain of defect/damage detection on TP surfaces falls under the category of either semantic segmentation or object detection. Given recent advancements in object detection with YOLO-based algorithms, which include benefits such as high classification accuracy and real-time throughput [11], we applied object detection to the task at hand. Incoming images originated from drone footage using a high-resolution camera (~45 MP), reinforcing the requirement for highly efficient neural networks.

A YOLO-based architecture was chosen, due to the high network throughput and the availability of open-source implementations in common machine-learning libraries. Given the large image sizes in this project (approx. 45 MP), a network with high throughput was a key requirement. The availability of common machine-learning libraries allowed for a wider range of and easier deployment on GPU-ready machines.

The neural network architecture used was YOLOv5 (see reference [11]), which, in turn, is based on the seminal YOLO (You Only Look Once) network established by Joseph Redmon [11]. In this family of architectures, input images are effectively divided into a grid of cells, where each cell outputs a likelihood constrained on the interval [0, 1] for the presence of one or more objects (called “objectness”), values describing those bounding boxes (including height, width, and box center), and their associated class likelihoods. The loss function contains terms for each of these values, where the values from cells that have objectness lower than a given threshold are ignored and not penalized during loss calculation.

All of the available labeled data amounted to ~1200 images from handheld cameras and manual drone flights, totaling ~3500 objects of interest. Of these images, ~230 were reserved as a validation set. We expanded the training set by applying augmentations, including rotation, translation, HSV value shifts, perspective shifts, horizontal/vertical flips, and more.

A widely used performance metric for object detection is mean average precision (mAP). A target performance of 0.7 mAP was defined in the early stages of this study, which corresponded with a high-performing model on open source datasets during that time. The best performing model reached only 0.31 mAP on a hold-out set. The reason for this stark contrast in the expected performance and the realized performance is attributed to the subjectivity in labeling. Contrary to most publicly used datasets, where object granularity is often not an issue, this work demonstrated data where a single object of interest may constitute many smaller objects (see Figure 10 and Figure 11), or conversely where many smaller objects may constitute a single larger object. This decision-making is somewhat subjective and depends on the pragmatism of the labeler. As an example, the class “Flying Rust” often comprises 100 or more small flecks of rust on the TP surface, grouped in heterogeneous clusters. Given that the object granularity bore little consequence in our application field, we developed a performance metric that calculated the mean intersect over union (IoU) for each given class, irrespective of the number of bounding boxes that intersected with a ground truth bounding box (or vice versa). With this metric (which we abbreviated to mIoU), our model performed at 0.59. For comparison, a high performing model evaluated on the COCO 2017 dataset reached 0.69 mIoU.

Training to convergence on the validation dataset took approximately 30 h on a P6000 GPU. Once completed, the network weights were stored and the neural network could be deployed with these weights in a REST server on a GPU-enabled machine for making real-time detections over an internet network. Examples of ground truth labels and network detections are shown in Figure 10 and Figure 11.

6. Defect/Damage Mapping

The paint defects/damages were mapped, using a novel approach, onto the reconstructed model or as an alternative to the CAD model. The reconstructed/CAD model, with all the defects/damages, could be used to provide an overview of the positioning of the defects/damages in 3D, thus making it possible to identify any systematic in which the defects/damages were placed. This identification could be used in the optimization of production. The AI algorithm identified the pixels in an image that corresponded to paint defects/damages; this 2D information was mapped to the 3D CAD or reconstructed model. The same paint defect/damage could often be seen from a slightly different angle in a number of different images. Each of these pixels with paint defects/damages was mapped to the reconstructed model, thereby increasing the paint defect/damage area on the tower. However, some of the mapped defect/damage points overlapped each other. Three different coordinate systems were used to perform the mapping (see Figure 12a). The world coordinate system (x, y, z) is shown in the figure, together with the three colored axes of the local camera coordinate system. Moving this coordinate system to the center of the camera sensor results in the third coordinate system.

A meshed version of the CAD or reconstructed model was placed in the georeferenced coordinate system that was used during model reconstruction. The local camera coordinate system, where the x-, y-, and z-axis have the colors blue, red and black, respectively, is shown in Figure 12a. The camera angle measured from the optical axis of the camera determined the direction in which the camera is pointing. The green ray in the Figure 12 shows this direction. A simple pinhole camera model provided the relation between an object in the “real world” and the corresponding image captured by the camera. The effects of the light-collecting lens in the camera were added to the pinhole camera model. This “extended” pinhole camera model was used as shown in Equation (1) below. If (x’, y’) are the pixel values of the image, then the 3D world coordinates (x, y, z) of the 3D object can be calculated using the following equation:

(\begin{matrix} x \\ y \\ z \end{matrix}) = R o t M (\begin{matrix} (x^{'} - c x) \frac{D C T}{f k} \\ (y^{'} - c y) \frac{D C T}{f l} \\ D C T \end{matrix}) + (\begin{matrix} c p o s x \\ c p o s y \\ c p o s z \end{matrix})

(1)

Here, (cposx, cposy, cposz) are the positions of the camera in world coordinates. DCT is the distance from the camera to the tower, in the direction in which the camera is pointing, while fk and fl are the focal lengths of the camera expressed in pixels along the horizontal and vertical directions of the camera sensor, respectively. The two values cx and cy move the pixels values from the upper left corner to the center of the camera sensor. RotM is the rotation matrix that transforms the local camera coordinates to world coordinates. In many applications, the image pixels are calculated from the 3D world coordinates of the object. This calculation is the reverse of the equation given in Equation (1). Parameters such as camera positions and rotation matrices can often be extracted from the photogrammetry software for every image used in the reconstruction. The simple pinhole camera model and the “extended” pinhole camera models calculating the image pixel values from the 3D world coordinates are explained in the course notes found in reference [16].

The effect of distortion is seen in the images from the drone. Vertical structures placed in the center of the image are reproduced as vertical structures, but when these structures were placed near the edges of the images, they tended to be slightly curved, showing signs of distortion. Radial and tangential lens distortion can be added to the “extended: pinhole model provided in Equation (1). Particularly small lenses suffer from radial distortion when the light rays bend closer to the edges of a lens than when they bend closer to the center of a lens. Tangential distortion occurs when the lens and the image plane are not parallel. To account for radial and tangential distortions, the contributions to pixel values must be calculated. If the dimensionless pixel values dlx and dly used in Equation (1) are given by

d l x = \frac{(x^{'} - c x)}{f k} d l y = \frac{(y^{'} - c y)}{f l}

(2)

then the radial (xrdis, yrdis) and tangential distortions (xtdis, ytdis) are provided by the following set of equations:

x r d i s = d l x (k_{1} r^{2} + k_{2} r^{4} + k_{3} r^{6}) y r d i s = d l y (k_{1} r^{2} + k_{2} r^{4} + k_{3} r^{6}) x t d i s = 2 p_{1} d l x d l y + p_{2} (r^{2} + 2 d l x^{2}) y t d i s = p_{1} (r^{2} + 2 d l y^{2}) + 2 p_{2} d l x d l y

(3)

Here,

r = \sqrt{d l x^{2} + d l y^{2}},

and k₁, k₂ and k₃ are the radial distortion coefficients of the lens, while p₁ and p₂ are the tangential distortion coefficients of the lens. These distortion parameters can normally also be extracted from the photogrammetry software. It is seen that the magnitude of the distortion correction in Equations (2) and (3) will increase with the value of the radius r. The distance DCT from the camera to the tower needed in the equations above can be calculated because both the camera and the tower positions are known. The distortion of the paint damage pixels found from the AI routine are provided by Equation (3). The dimensionless pixel values dlx and dly are corrected by these values before they are used in Equations (1) and (2) to calculate the 3D world coordinates of the paint damages. The red points in Figure 12a,b near and on the tower are the world coordinates corresponding to the paint defect/damage pixels calculated using Equations (1)–(3). Not all of the points are on the tower (see Figure 12b), because depth information for all of the the pixel values in the image is needed in order to perform a precise image pixel to world coordinates transformation.

The green points in the figure are calculated as a projection of the red points onto the tower. These green points represent the mapping of the paint defect/damage pixels found in one of the drone images onto the CAD model of the transition piece. Figure 13a,b shows the CAD model superimposed with the paint defects/damages found in all the drone images for this particular flight. It is seen that the defects/damages are not distributed uniformly, and there is a higher concentration of paint defects/damages near the bottom of the tower than in the middle and upper sections of the tower.

7. Concluding Remarks

In this study, we proposed and demonstrated a novel concept and developed a working methodology to create an operational Digital Twin of large-scale structures based on drone inspection images and LiDAR scans. The Digital Twin presented in this study is a 3D visual representation of a physical structure, e.g., the wind turbine transition piece demonstrated in this study, on which both drone inspection and LiDAR scans were conducted. The Digital Twin could (1) visualize and quantify the geometric derivation of the real structure from the design, (2) map the surface defects/damages including the sizes, locations, and types detected by AI on the 3D geometric reconstruction, and (3) update whenever new sets of inspection data became available.

The Digital Twin concept presented in this study is operational and opens many opportunities for preventive maintenance and optimal asset management of large-scale structures and infrastructure. Further improvement of the Digital Twin is necessary to make it more accurate, automated, robust, and faster. The remaining challenges include improving the accuracy of 3D geometric reconstruction when using inspection images of shining surfaces; automatizing and streamlining the working flow with a huge amount of data; and enhancing the robustness of the Digital Twin for other applications on different large-scale structures. The proposed framework is modularized and has flexibility to adapt and upgrade. For example, the current study uses AI to train the damage detection network; this module can be replaced by other methods, such as wavelet and contourlet transforms, which can also detect damages from images. In addition, the damage-related information obtained from the drone-based images could be complemented with data obtained from sensors installed in the structure, such as strain gauges and accelerometers, and from other inspection techniques, such as thermography. Advanced numerical tools can be further developed to understand these sensor signals and inspection data, and to simulate the consequences of structural and material deterioration. These challenges will be the focus of our future study based on the foundation provided in the current work.

Author Contributions

H.-H.B. implemented software for geometry reconstruction, developed the concept for defect/damage mapping, and wrote the original manuscript; X.C. developed the research idea, supervised and managed the research, analyzed results, and wrote the original manuscript; L.B. implemented AI algorithms for defect/damage detection and contributed to writing the manuscript; K.B. contributed to the overall idea on Digital Twin and to writing the manuscript; O.C. contributed to data structure, working flow, testing, and writing the manuscript; J.S. provided all drone images and LiDAR measurements and contributed to writing the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This study was funded by the QualiDrone Project (Intelligent, autonomous drone inspection of large structures within the energy industry, 64020-2099) through the Danish Energy Technology Development and Demonstration Program (EUDP).

Institutional Review Board Statement

Not applicable to this study.

Informed Consent Statement

Not applicable to this study.

Data Availability Statement

Not applicable to this study.

Acknowledgments

We would like to thank BLADT INDUSTRIES A/S for supporting this study. The authors are grateful to Christian Boysen from Energy Cluster Denmark, Teddy Jensen and Jeppe Hebsgaard Laursen from Zebicon A/S, and Jens Skoustrup-Jacobsen from Desupervised for their constructive comments and useful discussions in the QualiDrone project.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

This appendix discusses a research model of a transition piece. Figure A1 shows an image of the transition piece model. This research model and the reconstruction can help us better understand the limiting factors in the reconstruction process based on photogrammetry. A small yellow magnet is placed on the TP. The height of this magnet can be estimated from the largest difference between the CAD model of the TP and the reconstructed model based on photogrammetry.

Figure A1. (a) The picture shows a simple model of a transition piece. A magnet has been placed on the TP model. (b) The color on the CAD model represents the distance between the CAD model and the reconstructed model. The histogram shows this relation together with the frequency of the distance differences measured in mm. Green color: distances close to zero. Red and blue colors correspond to positive and negative distances, respectively. The histogram peaks for higher values of geometric differences corresponding to real differences between the two models because the magnet is not included in the CAD model of the TP. The largest distance values in the histogram corresponding to the dark red values are, therefore, estimates of the height of the magnet.

The reconstruction is based on 221 images. There is approximately an 85% overlap between the images. The images have been framed so they include both the tower and the surrounding areas, which helps the reconstruction software. Based on the geometric differences between the reconstruction and the CAD model, the height of the magnet can be estimated to be 6.6 mm. The height of the magnet is measured to be 6.7 mm, so the estimated value is relatively accurate.

References

European Commission. Industrial Research and Innovation: Why the EU Supports Industrial Research and Innovation. 2020. Available online: https://ec.europa.eu/info/research-and-innovation/research-area/industrial-research-and-innovation_en (accessed on 1 December 2021).
Barricelli, B.R.; Casiraghi, E.; Fogli, D. A survey on Digital Twin: Definitions, characteristics, applications, and design implications. IEEE Access 2019, 7, 167653–167671. [Google Scholar] [CrossRef]
Bolton, R.N.; McColl-Kennedy, J.R.; Cheung, L.; Gallan, A.; Orsingher, C.; Witell, L.; Zaki, M. Customer experience challenges: Bringing together digital, physical and social realms. J. Serv. Manag. 2018, 29, 776–808. [Google Scholar] [CrossRef] [Green Version]
Branner, K.; Eder, M.A.; Danielsen, H.K.; Chen, X.; McGugan, M. Towards more smart, efficient and reliable wind-turbine structures. In DTU International Energy Report 2021: Perspectives on Wind Energy; Jørgensen, B.H., Madsen, P.H., Giebel, G., Martí, I., Thomsen, K., Eds.; DTU Wind Energy: Roskilde, Denmark, 2021; pp. 115–124. Available online: https://backend.orbit.dtu.dk/ws/portalfiles/portal/264478705/Chapter_12_DTU_International_Energy_Report_2021.pdf (accessed on 1 December 2021).
FORCE Technology. Autonomous Robot 3D Scans Wind Turbine Blades. 2019. Available online: https://forcetechnology.com/en/about-force-technology/news/2019/autonomous-robot-3d-scans-wind-turbine-blades (accessed on 1 December 2021).
Rashidi, M.; Mohammadi, M.; Sadeghlou Kivi, S.; Abdolvand, M.M.; Truong-Hong, L.; Samali, B.A. Decade of modern bridge monitoring using terrestrial laser scanning: Review and future directions. Remote Sens. 2020, 12, 3796. [Google Scholar] [CrossRef]
Rashidi, M.; Samali, B. Health Monitoring of Bridges Using RPAs. In EASEC16. Lecture Notes in Civil Engineering; Wang, C.M., Dao, V., Kitipornchai, S., Eds.; Springer: Singapore, 2021; Volume 101, pp. 209–218. [Google Scholar]
Masoud, M.; Vahid Mousavi, M.R.; Yang Yu, L.K.; Samali, B. Quality evaluation of Digital Twins generated based on UAV photogrammetry and TLS: Bridge case study. Remote Sens. 2021, 13, 3499. [Google Scholar] [CrossRef]
Szeliski, R. Computer Vision: Algorithms and Applications; Springer: London, UK, 2011. [Google Scholar]
Shihavuddin, A.S.M.; Chen, X.; Fedorov, V.; Christensen, A.N.; Riis, N.A.B.; Branner, K.; Dahl, A.B.; Paulsen, R.R. Wind Turbine Surface Damage Detection by Deep Learning Aided Drone Inspection Analysis. Energies 2019, 12, 676. [Google Scholar] [CrossRef] [Green Version]
Jocher, G.; Stoken, A.; Chaurasia, A.; Borovec, J.; NanoCode012; Xie, T.; Kwon, Y.; Michael, K.; Changyu, L.; Fang, J.; et al. Ultralytics/yolov5: V4.0-nn.SiLU() Activations, Weights & Biases Logging, PyTorch Hub Integration (v4.0). 2021; Available online: https://github.com/ultralytics/yolov5 (accessed on 13 February 2021).
Bentley Institute Inc. ContextCapture: 4D Digital Context for Digital Twins; Bentley Institute Inc.: Exton, PA, USA, 2021; Available online: https://www.bentley.com/en/products/product-line/reality-modeling-software/contextcapture (accessed on 1 December 2021).
Jain, S.; Kumar, B.L.S.; Shettigar, R. Comparative study on SIFT and SURF face feature descriptors. In Proceedings of the 2017 International Conference on Inventive Communication and Computational Technologies (ICICCT), Coimbatore, India, 10–11 March 2017; pp. 200–205. [Google Scholar] [CrossRef]
Mistry, D.; Banerjee, A. Comparison of Feature Detection and Matching Approaches: SIFT and SURF. GRD J. Glob. Res. Dev. J. Eng. 2017, 2, 2455–5703. [Google Scholar]
Meshlab. Available online: https://www.meshlab.net/#description (accessed on 1 December 2021).
Hata, K.; Savarese, S. Notes from Stanford Course CS231A: Computer Vision, From 3D Reconstruction to Recognition; Stanford University: Stanford, CA, USA, 2021; Available online: https://web.stanford.edu/class/cs231a/course_notes.html (accessed on 1 December 2021).

Figure 1. The flowchart shows the different steps needed to calculate the Digital Twin of a large-scale structure and subsequently update the physical structure based on this information.

Figure 2. Drone-based image acquisition. (a) Drone flight planning, with blue dots showing the drone locations for image acquisition. (b) The actual drone flight around the transition piece. Information that possibly leads to the identification of the TP owner is removed from the images.

Figure 3. (a) SURF keypoints in the image shown as green plus signs. There are a large number of image keypoints on the highly textured ground but only a few on the TP. These keypoints are placed on edges, light shadows, and paint defects/damages. (b) A smaller section of the image where paint defects and damages are located. The green SURF keypoints are placed on the paint defects/damages. (c) The 100 strongest-matched SURF points in two images. The yellow lines connect the matched SURF image keypoints in the two images.

Figure 4. (a) The green plus signs show the SIFT keypoints in the image. Compared to the SURF keypoints, a higher number of points are placed on the TP. (b) A smaller section of the image where paint defects/damages are located. The SIFT algorithm places keypoints both on the paint defects/damages and on the welding seams. This explains the higher number of SIFT keypoints.

Figure 5. (a) The CAD model of a transition piece. (b) The reconstructed 3D model based on 445 drone images. Small holes can be observed on the cylindrical surfaces of the TP. These holes are the result of large cylindrical surfaces with only small changes in the color of the tower. Information that possibly leads to the identification of the TP owner is removed and the local design details are masked in the images.

Figure 6. Comparison between design and reconstruction. The mesh of the CAD model has been aligned with the mesh of the reconstructed model. The yellow areas come from the reconstructed model, while the grey areas come from the CAD model. It is seen that the crane is not placed in the same position in the two models. Information that possibly leads to the identification of the TP owner is removed from the images.

Figure 7. Quantification of geometric deviation. (a) The color contour represents the distance between the CAD model and the reconstructed model. (b) The histogram shows this relation, together with the probability of the distance differences measured in meters. Green color: distances close to zero. Red and blue colors correspond to positive and negative distances, respectively. Note that the probability is expressed using a logarithmic scale, thus making it possible to see the color bars for the distances with a very small relative probability. The histogram peaks for higher values of geometric differences correspond to real differences between the two models. The dark red peak is the result of the different positions of the crane in the two models.

Figure 8. The reconstructed 3D model based on six static LiDAR scans. The ladders on the platform are not perfectly captured, due toa blockade of the scanning beams from the platform itself. A LiDAR scanner aboard a drone should not have this problem. Information that possibly leads to the identification of the TP owner is removed from the images.

Figure 9. (a) The color on the CAD model represents the geometric distance between the CAD model and the LiDAR-based reconstructed model. (b) The histogram shows this relation, together with probabilities of the distance differences measured in meters. Green color: distances close to zero. Red and blue colors correspond to positive and negative distances, respectively. A logarithmic probability scale is used. Note that the histogram for the first peak is relatively narrow, corresponding to a low standard deviation of the geometric differences.

Figure 10. The ground truth objects (left) and associated outputs from the network (right). The example shown is an out-of-sample image that was not presented to the network during training time.

Figure 11. In this example, an object labeled using two separate bounding boxes in the ground truth case was detected as one object by the neural network. Bounding box granularity is a subjective matter of little relative consequence in our application area—where the presence of objects is far more important. We are currently developing a metric that is more agnostic to this subjectivity than the traditional mean average precision (mAP) used in most open source projects.

Figure 12. (a) The CAD model of the TP has been meshed with a triangular mesh and placed in a georeferenced coordinate system. This 3D world coordinates system is shown in the figure, together with a local camera coordinate system in which the x-, y-, and z-axis have the colors blue, red and black, respectively. The z-axis of this local camera coordinate system is the optical axis of the camera. The green ray shows the direction in which the camera is pointing. The red points near and on the tower are the 3D points corresponding to the paint defect/damage pixels, while the green points are the projected points onto the tower. (b) The figure zooms in on the projected points.

Figure 13. (a) The meshed CAD model is superimposed with the positions of all the paint defects/damages found in the images collected during one flight. (b) The green points show the paint defects/damage in a small section of the tower.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Benzon, H.-H.; Chen, X.; Belcher, L.; Castro, O.; Branner, K.; Smit, J. An Operational Image-Based Digital Twin for Large-Scale Structures. Appl. Sci. 2022, 12, 3216. https://doi.org/10.3390/app12073216

AMA Style

Benzon H-H, Chen X, Belcher L, Castro O, Branner K, Smit J. An Operational Image-Based Digital Twin for Large-Scale Structures. Applied Sciences. 2022; 12(7):3216. https://doi.org/10.3390/app12073216

Chicago/Turabian Style

Benzon, Hans-Henrik, Xiao Chen, Lewis Belcher, Oscar Castro, Kim Branner, and Jesper Smit. 2022. "An Operational Image-Based Digital Twin for Large-Scale Structures" Applied Sciences 12, no. 7: 3216. https://doi.org/10.3390/app12073216

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Operational Image-Based Digital Twin for Large-Scale Structures

Abstract

1. Introduction

2. The Proposed Digital Twin Framework

3. Drone-Based Image Acquisition and Pre-Processing

4. 3D Geometry Reconstruction

5. AI-Based Surface Defect/Damage Detection

6. Defect/Damage Mapping

7. Concluding Remarks

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI