True2 Orthoimage Map Generation

Zhou, Guoqing; Wang, Qingyang; Huang, Yongsheng; Tian, Jin; Li, Haoran; Wang, Yuefeng

doi:10.3390/rs14174396

Open AccessArticle

True² Orthoimage Map Generation

by

Guoqing Zhou

^1,2,3,*

,

Qingyang Wang

^1,2

,

Yongsheng Huang

⁴,

Jin Tian

^2,3,

Haoran Li

^2,3 and

Yuefeng Wang

^2,3

¹

College of Earth Sciences, Guilin University of Technology, Guilin 541004, China

²

Guangxi Key Laboratory of Spatial Information and Geomatics, Guilin University of Technology, Guilin 541004, China

³

College of Geomatics and Geoinformation, Guilin University of Technology, Guilin 541004, China

⁴

Guangxi Zhuang Autonomous Region Natural Resources Remote Sensing Institute, Nanning 530219, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2022, 14(17), 4396; https://doi.org/10.3390/rs14174396

Submission received: 6 August 2022 / Revised: 28 August 2022 / Accepted: 1 September 2022 / Published: 4 September 2022

(This article belongs to the Special Issue 3D City Modelling and Remote Sensing: Advances, Challenges, and New Technologies)

Download

Browse Figures

Versions Notes

Abstract

:

Digital/true orthoimage maps (D/TOMs) are one of the most important forms of national spatial data infrastructure (NSDI). The traditional generation of D/TOM is to orthorectify an aerial image into its upright and correct position by deleting displacements on and distortions of imagery. This results in the generated D/TOM having no building façade texture when the D/TOM superimposes on the digital building model (DBM). This phenomenon is no longer tolerated for certain applications, such as micro-climate investigation. For this reason, this paper presents the generation of a true² orthoimage map (T²OM), which is radically different from the traditional D/TOM. The basic idea for the T²OM generation of a single building is to orthorectify the DBM-based building roof from up to down, the building façade from front to back, from back to front, from left side to right side, and from right side to left side, as well as complete a digital terrain model (DTM)-based T²OM, of which a superpixel is proposed to store building ID, texture ID, the elevation of each pixel, and gray information. Two study areas are applied to verify the methods. The experimental results demonstrate that the T²OM not only maintains the traditional characteristics of D/TOM, but also displays building façade texture and three-dimensional (3D) coordinates (XYZ) measurable at any point, and the accuracy of 3D measurement on a T²OM can achieve 0.025 m (0.3 pixel).

Keywords:

true² orthoimage map; true orthoimage map; superpixel; three-dimensional measurement; orthorectification

1. Introduction

Digital orthophotomaps (DOMs) are a critical component of national spatial data infrastructure (NSDI) [1,2,3,4]. DOMs (1) serve as a geospatial foundation upon which to add detail and attach attribute information; (2) provide a base on which to accurately register and compile other themes of data; and (3) orient and link the results of an application to the landscape [5,6]. Especially, a highly detailed DOM is capable of serving as a source for locating the features to be mapped and measured [7].

Many investigations have demonstrated that the generation of high-resolution urban DOMs using the existing procedures and algorithms, proposed in 1990 by the USA National Digital Orthophoto Program (NDOP), has encountered many problems, such as incomplete orthorectification, occlusion, ghost image, shadow, etc. A comprehensive discussion regarding these problems can be found in [8,9]. Thus, the generation of so-called true orthophoto maps (TOMs) has become obligatory, and has been researched by many studies at the end of the 20th century and the beginning of the 21st century, such as [10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39].

However, when a TOM superimposes onto a corresponding digital terrain model (DTM) in a flat and/or hilly area, the terrestrial texture is clearly visible (Figure 1a), but when a TOM superimposes onto a digital building model (DBM), the detailed façade textures of a building are not visible (Figure 1b). The impacts of these problems have significantly influenced the usefulness of TOMs in industries such as micro-climate monitoring, micro-environment analysis, and cellphone transmission station distribution, since incomplete building façade information cannot be tolerated for these applications. Therefore, this paper proposes a true² orthoimage map (T²OM) generation method which can provide three-dimensional (3D) and detailed textures of a building’s roof and facades.

2. Related Works

The study of TOM began in the 1980s and has continued for decades. A successful and complete TOM generation method consists of an orthorectification algorithm, occlusion detection and compensation, shadow detection and recovery, a seamless mosaic, etc. The review of related works is, therefore, classified accordingly as follows.

Occlusion detection and compensation: Occlusion detection is one of the most major components of TOM generation. Amhar et al. [10] and Schickler and Thorpe [11] considered the hidden effects introduced by abrupt changes of surface height (e.g., buildings and bridges). Jauregui et al. [20] presented a procedure for orthorectifying aerial photographs to produce and update terrain surface maps. Vassilopoulou et al. [40] used IKONOS images to generate orthoimages for monitoring volcanic hazards on Nisyros Island, Greece, and Siachalou [21] used IKONOS images to generate the urban orthoimage. Cameron et al. [41] analyzed orthorectified aerial photographs to measure changes in the native pinewood of Scotland, and Passini and Jacobsen [42] analyzed the accuracy of orthoimages from very-high-resolution imagery. Biason et al. [22] further explored the automatic generation of true orthoimages. Piatti and Lerma [43] address the problem of image orthorectification through photogrammetric simulation and its generation based on digital elevation/building/surface models, as well as internal and external orientation parameters of image sensors (i.e., digital cameras). This method appears to be able to create the high-resolution 3D models needed for accurate orthophotos. Zhou et al. [29] proposed a new urban orthophoto occlusion detection method, which first establishes a model describing the relationship between each ghost image and the corresponding building occlusion boundary, and then applies an algorithm that uses building displacement to identify the occlusion region in the ghost image. The method can effectively avoid pseudo-occlusion detection and the drawback of simultaneous occlusion detection and orthophoto generation, providing a key technique for DBM-based T²OM. Yoo and Lee [44] proposed a facet-based method for generating realistic orthophotos of building surface facets. The method identifies occluded areas based on the unit surface of the building and uses multiple images and high-detail digital building model data (i.e., DBM) to recover the occluded areas from each other. Oliveira et al. [45] proposed a new occlusion detection method for true radiographic image mosaic generation. The proposed method uses irregularly spaced point clouds to identify occluded regions, avoiding the interpolation process as an initial step in occlusion detection and thus avoiding the insertion of additional errors in the surface representation. Zhou and Sha [46] proposed a method to simultaneously detect building roof and ground shadows using the DBM as an overlay model. The method determines the solar zenith angle and solar altitude angle by selecting the geographic information of the corner points on the shadow boundary in the aerial image, and then displays the actual shadow area determined in the DBM on the ghost image. The method is independent of ground reflectivity and illumination conditions, and provides technical support for producing high-quality true projection images. Marsetič [47] proposed a method to automatically generate true orthophotos from optical super-resolution satellite images. The automatic workflow consists of five modules, starting with the extraction of ground control points, then the geometric processing of image patches, occlusion detection, orthorectification, and finally generating a real orthophoto. The quality of the true radiographic images produced by this method depends on the accuracy of the geometric correction and the number of images. The occlusion compensation for TOM generation, Skarlatos [13] and Greenfeld [15] demonstrated that building occlusions significantly influenced not only image quality but also the accuracy of the orthoimages. Rau et al. [18] treated enhancements in image radiometry, demonstrating a suitable enhancement technique to restore information within building shadow areas. Sheng et al. [48] used a model-based method to reconstruct a true orthophoto model (CSM) to replace the DEM to generate a true photogram of the forest scene, which mainly focused on the efforts of occlusion and distortion caused by trees in the forest area. Zhou et al. [9] compensated by conjugating blocks of orthoimages, i.e., by refilling the masked area adjacent to the orthoimages. Zhou et al. [29] used adjacent overlapping “slave” orthophotos to fill the occluded region using the filling method proposed by Zhou et al. [9] to compensate for the occluded region in the “master” orthophoto. With such occlusion compensation, a complete true orthophoto can be created for the study area.

For shadow detection and recovery, many efforts have been made. For example, Leone and Distante [49] performed image shadow detection by improving the classification, segmentation and localization of detected objects, which improved the effect of shadow detection. Makarau et al. [50] proposed an alternative robust method for shadow detection. The method is to adaptively calculate the parameters of a specific scene and allow one to use many different sensors and images obtained under different lighting conditions, improving the accuracy of shadow detection. Tiwari et al. [51] propose an improved algorithm to obtain rough shadows by changing the ratio of intensity to hue and then performing shadow compensation using local thresholding. The experimental results show that the method is more suitable for the shadow detection of low-intensity and medium-intensity images, and the shadow compensation algorithm is suitable for all test images.

For the mosaic of multiple TOMs. Many researchers have made many efforts for the generation of high-quality TOM through the improvement of TOM mosaicking, such as [11,17,26,36,37,38,52,53]. Their studies resulted in clearer features in the shadows and more continuous and natural grayscale of the filled areas and surrounding images. For example, Pan and Wang [53] adopted a multi-scale processing strategy which can automatically locate the specific positions of the splicing lines and transition areas and improve the image quality after mosaicking. Gharibi and Habib [36] proposed a weighted averaging method to mitigate seam line effects and spectral differences that may occur in true orthophoto mosaics.

Despite many previous efforts, traditional TOM only provides 2D (XY) coordinates and building roofs’ texture information, while the 3D attributes and the textures of the building facades cannot be provided at all. For this reason, the generation of T²OM is presented in this paper. The organization of this paper is arranged as follows: The principle of T²OM is presented in Section 3, and Section 4 presents the experimental results and analysis. The conclusions are drawn in Section 5.

3. Principles of True² Orthoimage Map (T²OM) Generation

T²OM is defined as a DOM that can provide measurable 3D (XYZ) coordinates and textures for both the roof and the facade of a building. This means that the T²OM not only has traditional TOM characteristics, but also provides the 3D geometric information (X,Y,Z) and the textures for the facade of a building. The method of generating a T²OM includes the four basic steps below:

(1): DBM-based single-building T²OM generation, which consists of orthorectifying both the building roof and building facades: a concept, named “superpixel” is proposed the for storage of building texture, building ID, etc. information.
(2): DBM-based multiple-building T²OM generation: merging the DBM-based single-building T²OM, including organization of the building ID, building façade, building corner coordinates, etc.
(3): DTM-based T²OM generation for the orthorectification of gentle and continuously elevated hilly areas.
(4): DTM- and DBM-based T²OM merging, which is for merging DTM- and DBM-based T²OM for the creation of an entire T²OM.

3.1. Generation of a DBM-Based Single-Building T²OM

In order to clearly describe the process for the generation of T²OM for buildings, a single building is first taken as an example (see Figure 2), presuming that the DBM for the single building and the exterior/interior orientational parameters (EOPs/IOPs) for an image are known. The steps for the generation of a T²OM for a single building are below.

Step 1: DBM-based building roof orthorectification, which consists of:

(1): Determining the size of the T²OM: The resulting DBM-based single-building T²OM is expressed as a raster image with pixels arranged in rows and columns. Since the resulting orthoimage is orthorectified from raster image input (called original image) using the DBM data, the size of the output image is defined [9] as

\begin{array}{l} X_{0} = \max {\min {X_{D}}, \min {X_{I}}}, Y_{0} = \max {\min {Y_{D}}, \min {Y_{I}}} \\ X_{1} = \min {\max {X_{D}}, \max {X_{I}}}, Y_{1} = \min {\max {Y_{D}}, \max {Y_{I}}} \end{array}

(1)

where X₀ and Y₀ are the coordinates of the lower-left corner of the output image; X₁ and Y₁ are the coordinates of the upper-right corner of the output image; X_D and Y_D are the X and Y coordinates of the DBM; X_I and Y_I are the X and Y coordinates of the original image; and max and min denote maximum and minimum of the elements in the blanket. All of the coordinates here refer to the geodetic coordinate system required in the resulting T²OM.

(2): Computing the X, Y coordinate of each pixel: In Figure 2, P (I, J) is a given point pixel on the roof of the T²OM building roof, and their raster rows and columns can be transformed to the coordinates of the output T²OM, i.e.,

\begin{array}{l} X = X_{0} + I \times P_{x} \\ Y = Y_{0} + J \times P_{y} \end{array}

(2)

where X and Y represent the coordinates of pixels; X₀ and Y₀ are the coordinates of the lower-left corner of the output roof T²OM; P_X and P_y are the sizes of the pixels in the X and Y directions, respectively; and I and J are the rows and columns of P points, respectively.

(3): Computing the Z coordinate of P (I, J): In order to perform orthorectification, we also need to know the Z coordinates of the pixel P (I, J) in the output roof T²OM and this is obtained from DBM. However, DBM data only have vector coordinates at corner points. Therefore, it is necessary to interpolate an elevation to the roof pixel of the building. As shown in Figure 3, the elevation (height) is obtained only for pixels with corner points (blue pixels in Figure 3), while the other pixels (orange in Figure 3) are calculated by:

h = a_{0} j - a_{1} i - a_{2}

(3)

where h is the raster height value; i, j is the row number of the raster; a₀, a₁ and a₂ are the equation weight values defined by:

A j + B i + C h + D = 0 (C \neq 0)

(4)

h = - \frac{A}{C} j - \frac{B}{C} i - \frac{D}{C}

(5)

a_{0} = - \frac{A}{C}, a_{1} = - \frac{B}{C}, a_{2} = - \frac{D}{C}

(6)

where each triangular surface weight can be calculated from Equation (2) in the case of three known vertices of the triangular surface, and A, B and C are the equation weight values.

When the pixels on the roof of the building have an elevation, we have to convert the pixel’s geodetic coordinates back into the column and row by:

R_D = (X − X_0D)/P_D, C_D = (Y − Y_0D)/P_D

(7)

where P_D denotes the pixel size of the DBM image. X_0D and Y_0D are the lower left corner coordinates of the DBM image. K_D and L_D will not generally be exact integers. Thus, an interpolation must be performed to determine Z. Usually, a bilinear interpolation method of the following form is employed (see Figure 2c):

Z = {[Z_{1} Δ X + (1 - Δ X) Z_{4}] + [Z_{2} Δ X + (1 - Δ X) Z_{3}] + [Z_{1} Δ Y + (1 - Δ Y) Z_{4}] + [Z_{2} Δ Y + (1 - Δ Y) Z_{3}]} / 4

(8)

where △X = R_D − R_m and △Y = C_D − C_m, in which R_m is the R_D rounded to its maximal integer and C_m is C_D rounded to its maximal integer. After this estimation, we then know the coordinate (X, Y, Z) of the pixel.

(4): Computing the corresponding coordinate in the original image: In order to orthorectify the source image, the corresponding coordinate of the source image pixel in the output image is calculated by:

\begin{array}{l} x_{I} = x_{0 I} - f \frac{a_{1} (X - X_{s}) + b_{1} (Y - Y_{s}) + c_{1} (Z - Z_{s})}{a_{3} (X - X_{s}) + b_{3} (Y - Y_{s}) + c_{3} (Z - Z_{s})} \\ y_{I} = y_{0 I} - f \frac{a_{2} (X - X_{s}) + b_{2} (Y - Y_{s}) + c_{2} (Z - Z_{s})}{a_{3} (X - X_{s}) + b_{3} (Y - Y_{s}) + c_{3} (Z - Z_{s})} \end{array}

(9)

where x_I and y_I are the corresponding coordinates of the pixel P(X,Y) in the source image; X_s, Y_s and Z_s are the exposure stations; f is the focal length; ai = {a₁, a₂, a₃}, bi = {b₁, b₂, b₃} and ci = {c₁, c₂, c₃} are the elements of the rotation matrix, and are the functions of the three exterior orientation angles (ϕ, ω, κ). These elements have to be computed using at least three ground control points (GCP).

(5): Assigning the gray value to pixels: Since the grid of pixels in the source image rarely matches the grid of the output orthoimage, a re-sampling of the pixels has to be performed in order to assign gray value to the pixels in the output image. The nearest neighbor is employed because it directly transfers the original data values without averaging them. The computational procedure is illustrated in Figure 2.
(6): Storing the data for DBM-based building roof T²OM: As can be seen from the above, T²OM needs to store more information than the traditional TOM does, such as building roof texture, facade texture, and façade Z coordinates. For this reason, “superpixel” is presented and has the following characteristics (see Figure 4): (1) it inherits the original image gray information; (2) the gray value, elevation, building ID, and facade texture index ID are stored; (3) each pixel coordinate is directly interconnected with the building ID and façade texture ID.

The detailed descriptions of G, S, H, and ID in the superpixel are as follows:

(1): G (I, J) stands for storage of the gray at i-th row and j-th column in the image coordinate system whose value is 0–255.
(2): S (I, J) stands for storage of the building corner coordinate subdivision grid identification value, which occupies 8 bits. That is, by dividing a single pixel into 256 subdivision sequences, the accuracy of vector to grid data conversion is improved. For a given point P (X_P, Y_P), this can be expressed as (i_p, j_p) after the conversion of the vector to the grid. The lost information is $(X_{P} - X_{0}) - i_{p} \cdot Δ X$ . With this information, Sx can be calculated through Equation (8).

S_{x} = ⌊ \frac{(X_{_{P}} - X_{0} - Δ x \cdot i_{p}) \cdot 16}{Δ x} ⌋

(10)

where

Δ X

is the image resolution and

⌊ . ⌋

is the function of rounding down. (X₀, Y₀) is the top-left point of the image.

The calculation method used for S_y is the same as that used for S_x. At this time, S is expressed as two values. This makes the storage of S very difficult. Therefore, (S_x, S_y) is converted into a one-dimensional form by means of the Morton code [54].

(3): H (I, J) stands for the storage of the building height or DTM height with a floating format.
(4): ID stands for the storage of building ID. An ID can be used to call for the facade texture. A large city may have hundreds of buildings; therefore, 12 bits are designed to store 0 to 4095 buildings.

The T²OM data for the roof of the building generated by the above steps are stored in the superpixel and shown in Table 1.

Step 2: DBM-based orthorectification for a single-building facade

In order to obtain the façade texture and the 3D coordinates of a building, the orthorectification for building facade in four directions is also performed in T²OM, and the four directions are determined according to the minimum bounding box (a detailed description has been given by [55]). The basic idea for the orthorectification of a building façade is: four directions at 0° for the front façade (Figure 5a), 90° for the left façade (Figure 5b), 180° for the back façade (Figure 5c), and 270° for the right façade (Figure 5d) are orthorectified, respectively. For example, the co-linear equation for the 0° directional facade texture is adopted (see Figure 6a), i.e.,

\begin{array}{l} y_{g} = - f \frac{a_{1} (Y_{G} - Y_{S}) + b_{1} (Z_{G} - Z_{S}) + c_{1} (X_{G} - X_{S})}{a_{3} (Y_{G} - Y_{S}) + b_{3} (Z_{G} - Z_{S}) + c_{3} (X_{G} - X_{S})} + y_{0} \\ z_{g} = - f \frac{a_{2} (Y_{G} - Y_{S}) + b_{2} (Z_{G} - Z_{S}) + c_{2} (X_{G} - X_{S})}{a_{3} (Y_{G} - Y_{S}) + b_{3} (Z_{G} - Z_{S}) + c_{3} (X_{G} - X_{S})} + z_{0} \end{array}

(11)

The facade texture in 0° direction is orthorectified into the ZOY plane (see Figure 6a). Similarly, the collinear equation for the orthorectification of the other three directions at 90°, 180°, 270° can be orthorectified into the ZOX (see Figure 6b), YOZ (see Figure 6c), and YOX (see Figure 6d) planes, respectively.

The details of the orthorectification of a single building can be further described as follows: As shown in Figure 7, the buildings in Figure 7a,b are 25.8 m and 46.4 m, respectively, with an elevation resolution of 0.2 m. There are four planes (a₁b₁, b₁c₁, c₁d₁ and d₁a₁) for the building facade in Figure 7a, and the corresponding texture index data are 65, 66, 67 and 68, respectively. There are four planes (a₂b₂, b₂c₂, c₂d₂ and d₂a₂) for the building facade in Figure 7b, and the corresponding texture index data are 809, 810, 811 and 812, respectively (see Table 2 and Table 3).

The buildings in Figure 7a,b have 37 and 47 façade superpixels, respectively, and the superpixels are stored for each pixel as shown in Table 2 and Table 3. The corresponding pixels, textures, and elevations of each facade of the building in Figure 7 can be obtained, and the detailed storage contents are shown in Table 4 and Table 5.

3.2. Generation for DBM-Based Multiple-Building T²OMs

There are usually many buildings in a city. Therefore, the next step is to generate the multiple-building T²OM on the basis of the generation of a single-building T²OM. Part 2 in Figure 8 shows the process of generation for multiple-building T²OMs, in which each building is assigned a unique identification (BuindingID) and the building ID is used to control the display and hiding of each building (see Table 6). In addition, the information of a single-building model is divided into top surface, elevation and bottom surface. Additionally, each roof and facade are assigned a separate identity (RoofID, WallID) (see Table 7 and Table 8). The face IDs are associated with the building top surface table and building wall table, and each face point ID (Points) is recorded in the roof table and wall table, and the point IDs are associated with the building corner point information table (see Table 9). The corner point information is expressed using multiple horizontal projection polygons, and each corner point is provided with 3D vector information in the corner point information table. Building textures are divided into top surface textures and wall textures in the table. In the building, the top surface table does not need to identify the top surface texture identity; its information can be associated with the 2D T²OM multiple-building data table in having the building identity (BuindingID) from which the top surface texture value and 3D coordinate information value of the building is obtained, and the real building top surface texture is obtained by rendering a single pixel at a single vector point in turn. For the wall texture of the building, the texture ID of the wall is associated with the texture data in the database. The texture name (TextureName), the address of the file uploaded in the computer (FileAddress), the data of the texture saved in binary (Binary), the format of the texture (Format), the size of the texture (Size) recorded in bytes (Byte) in the database, and the date when the texture was saved (Date) are recorded (see Table 10). Parts 1, 3 and 4 in Figure 8 are described in detail in Section 3.3 and Section 3.4.

3.3. Generation of DTM-Based T²OM

Part 1 in Figure 8 is DTM-based T²OM generation, which orthorectifies the displacement caused by terrestrial elevation, i.e., orthorectifies the terrains into an upright position in a given map coordination. Therefore, the digital differential orthorectification method is applied for this purpose. The details of this method can be found in [8,37]. Similarly, the given DTM data, data structure and data storage for each pixel in the DTM-based T²OM are similar to that of the DBM-based T²OM, but the building ID is assigned as “none”.

3.4. Merging DTM- and DBM-Based T²OMs

In view of the different structures of the DBM- and DTM-based T²OMs, an entire T²OM generation needs merging algorithms (see Figure 8). To do this, the logic operation <or> is performed with the superpixel ID of 0 or non-zero. In order to eliminate possible boundary confusion in the merging process, the following judgment conditions are executed: with the DTM-based T²OM as the base map (see Part 1 in Figure 8), when the same grid number appears in the DTM- and DBM-based T²OMs at the same time, only the DBM-based T²OM is retained (see Part 2 in Figure 8). This is because the building area determined by the horizontal projection polygon is not a regular rectangle or divided along the grid direction. Therefore, there are actually more grid elements located at the building boundary than at the real building boundary. Retaining the grid of the DBM-based T²OM will ensure the accuracy of the building location to the greatest extent. This is helpful for the 3D T²OM display. Merge the DTM-based and DBM-based true-squared orthophotos to obtain a near-true-squared image (see Part 3 in Figure 8). Linking the DBM model and wall textures to the building data enables the ability to display building façade textures and three-dimensional (3D) coordinates (XYZ) measurable at any point (see Part 4 in Figure 8).

4. Experiments and Analysis

Figure 9 is a flowchart for T²OM generation, divided into five parts. Part 1 shows two experimental datasets (high-resolution images, control points, orientation parameters, DBM and DTM) from Denver, CO, USA, and Nanning, China, which are used as input data. Part 2 shows DBM-based T²OM generation, which consists of orthorectifying both building roof and building facades, with which “superpixel” is used for the storage of building texture, building ID, etc. Part 3 shows DTM-based T²OM generation, which is for orthorectification of gentle and continuously elevated hilly areas. Part 4 shows the merging of DBM-based and DTM-based T²OMs as the output data, i.e., T²OM. Part 5 shows the accuracy evaluation for the generated T²OM using ground control points.

4.1. Metadata of T²OM

This experiment was implemented using the programming language C++. To store the T²OM in binary form, a file format fus needed to be designed to store superpixels due to the restriction of traditional bit-storing in computers. Table 11 shows the entire file format, which consists of a file flag block, image header file information, and image pixel information. These three parts were written to the fus file in binary form.

In the superpixel data structure, information about the elevation level H and building identification ID need 12 bits of memory. However, there is no data type of this size in the computer. Therefore, 12 bits of information are saved by the bit operation. As shown in Figure 10, this method first opens up two unsigned short types of data (16 bit), TEM_Height and TEM_Index, and uses them to record H and ID, respectively. Then, the data are put into variables (T²OM_height, T²OM_hi and T²OM_index) through the bit operation. Finally, the information in the superpixel is combined, as shown on the right side of the figure.

4.2. T²OM Generation

In this section, this paper uses Dataset 1 and Dataset 2 to describe the generation process of generating T²OM and to verify the feasibility of the method in this paper.

4.2.1. Experimental Result with Dataset 1

The experimental Dataset 1 includes digital surface model data, aerial imagery data, and digital building model data. A brief description is given as follows:

(1): DTM data: Figure 11a shows DTM data from Denver, CO, USA, which is represented as a height–depth map, where the darker the color is, the lower the height, and vice versa, because the topography of the city is relatively flat. Thus, the elevations shown on the ground are relatively similar (the colors shown are similar). The accuracy of plane surface coordinates and vertical coordinates are about 0.1 m and 0.2 m, respectively. The horizontal datum is GRS 1980, and the vertical datum is NAD83.
(2): Aerial Image data: Figure 11b shows the original aerial image acquired using the RC30 aerial camera lens in Denver. The flight altitude in Denver is 1650 m higher than the mean ground elevation of the imaged area. Aerial photographs were initially recorded on film and then scanned into digital format at a pixel resolution of 25 μm.
(3): DBM data: Figure 11c shows Denver DBM data, and these buildings with a ground resolution of about 25.4 cm per pixel were identified. Each building model contains building corner point information and elevation texture information.

Step 1. Generation of DBM-based T²OM

(1): DBM-based building roof orthorectification

DBM-based roof orthorectification corrects only the displacements caused by the buildings and does not take into account the displacements caused by the terrain. Therefore, the generated DBM-based T²OM (see Figure 12) only corrects the building texture and not the texture of the terrain area, so the texture at the terrain is black (background value). Where (a) and (c) are two buildings in the T²OM, it can be seen that the roof textures of the buildings are obtained accurately.

(2): DBM-based building façade orthorectification

In order to obtain the facade textures and 3D coordinates of the buildings, the orthorectification of the building facade textures was also performed in T²OM. The building facade textures for Dataset 1 were selected from the existing texture library and the same textures were used in all four directions, and the results of facade texture correction are shown in Figure 13. Figure 13a,b show the results of the façade texture correction for (a) and (c) in Figure 12, respectively, and the angles marked in Figure 13 are consistent with those in Figure 12.

Step 2. Generation of DTM-based T²OM

The correction of buildings needs to be followed by the orthorectification of non-building areas. The DTM-based differential correction of non-buildings is performed to obtain the corrected texture of the image. At the same time, superpixels are generated by overlaying each data information, and finally the DTM-based T²OM is obtained (see Figure 14). Because this part only corrects the terrain texture and not the building area texture, the texture at the building is black (background value). The DTM-based T²OM also has ghosts and shadows in the texture area, because Dataset 1 lacks complementary images, so occlusion detection, compensation for textures and shadow detection and compensation operations are not performed in this step.

Step 3. Merging DTM- and DBM-based T²OMs

Finally, the merging of the DBM with the DTM’s T²OM yields the result shown in Figure 15b, which fills the superpixels at the roof texture exactly to the area where the original background value is black. Figure 15a,c show enlarged views of two of the buildings.

4.2.2. Experimental Result with Dataset 2

The experimental Dataset 2 includes digital surface model data, aerial imagery data, and digital building model data. A brief description is given as follows.

(1): DTM data: Figure 16a shows DTM data from Nanning, China, represented as a height–depth map, where the darker the color is, the lower the height, and vice versa, because the topography of the city is relatively flat. Thus, the elevations shown on the ground are relatively similar (the colors shown are similar). The accuracy of the plane surface coordinates and vertical coordinates are about 0.1 m and 0.2 m, respectively. The horizontal datum is GRS 1980, and the vertical datum is NAD83.
(2): Aerial Image data: Figure 16b shows the original aerial image acquired using the CMOS lens in Nanning. The flight altitude in Nanning is 200 m higher than the average ground elevation of the imaging area.
(3): DBM data: Figure 16c shows Nanning DBM data, and buildings with a ground resolution of about 25.4 cm per pixel were identified. Each building model contains building corner point information and elevation texture information.

Step 1. Generation of DBM-based T²OM

(1): DBM-based building roof orthorectification

The DBM-based roof orthorectification corrects only the displacements caused by the buildings and does not take into account the displacements caused by the terrain. Therefore, the generated DBM-based T²OM (see Figure 17) only corrects the building texture and not the texture of the terrain area, so the texture of the terrain is black (background value). Four buildings of the T²OM are denoted as (a–d), and the roof textures of the buildings are obtained accurately.

(2): DBM-based building façade orthorectification

In order to obtain the building facade texture and 3D coordinates, the building facade texture is also orthorectified in T²OM. The orthorectification results are shown in Figure 18. Figure 18a–d show the results of the façade texture correction for (a–d) in Figure 17, respectively, and the angles marked in Figure 18 are consistent with those in Figure 17.

Step 2. Generation of DTM-based T²OM

The correction of buildings needs to be followed by the orthorectification of non-building areas. The DTM-based differential correction of non-buildings is performed to obtain the corrected texture of the image. At the same time, superpixels are generated by overlaying each data information, and finally the DTM-based T²OM is obtained (see Figure 19). Because this part only corrects the terrain texture and not the building area texture, the texture of the building is black (background value). The DTM-based T²OM also has ghosting and shadows in the texture region, and the occlusion detection and compensation of textures and shadow detection and compensation operations in this paper are adopted from [9].

Step 3. Merging DTM- and DBM-based T²OMs

Finally, the merging of the DBM with the DTM’s T²OM yields the result shown in Figure 20, which fills the superpixels at the roof texture exactly to the area where the original background value is black. Figure 20a–d show enlarged views of four of the buildings.

4.2.3. T²OM 3D Measurement

With the 3D measurement function, the elevation of any point in the scene can be obtained. Figure 21 shows elevation information from a point on the selected facade. Figure 22 shows the color information, true 3D coordinates, and attribute information of each pixel in the acquired point. “Selective Hide” can also be used to display individual buildings, as shown in Figure 23. In addition, based on the fact that the horizontal projection polygon of a building can completely record building information, we can obtain complete information about corner points and facade through the “3D Building Information Display” function, as shown in Figure 24. The “Building Distance Measurement” function calculates the minimum and maximum distances between two buildings by using the building corner information, as shown in Figure 25.

4.3. Accuracy Evaluation and Analysis

In our method, T²OM generation is obtained by merging the DTM- and DBM-based T²OMs. There can be errors in multiple steps of the process. In addition, because the superpixel uses an elevation series instead of the double-form true elevation value, errors exist in the recording height of the superpixel. Based on the generated T²OM, this section evaluates the matching accuracy between the building horizontal projection polygons.

In order to ensure the accuracy of building location information in T²OM, it is necessary to evaluate the construction accuracy of the horizontal projection polygon. In Figure 26, the grid detected as the building edge is represented in blue. Building corners are represented in green. The horizontally projected polygon is displayed in red. As can be seen from the enlarged Figure 26e–g, the extracted blue edge is completely consistent with the green building corner. This proves that the accuracy of the building horizontal projection polygon construction is sufficient.

Four buildings identified in the T²OM data were randomly selected to evaluate the accuracy of information recording in T²OM, as shown in Figure 27. Firstly, roof corner points of each building above were extracted from the DBM. Then, the coordinates of the corner points in the DBM were compared with the grid number, subdivision, and elevation series recorded in the superpixels. Table 12 presents the 3D coordinates before and after coding, where x_ori, y_ori, and z_ori are the 3D coordinates before coding; S(r,c) and H are the subdivision and elevation series, respectively; and X_bc, Y_bc, and Z_bc are the 3D coordinates calculated using superpixels. Through calculations, the average errors of the X, Y and Z coordinate components were determined to be 0.017, 0.025, and 0.09 m, respectively. Compared with the resolution of the original data in plane coordinates of 0.1 m and the elevation of 0.2 m, it can be concluded that the coding method recorded by superpixels can greatly reduce the level of error generated in the process of converting vector data into grid data.

4.4. Discussion

From the above two sets of experimental results, it is concluded that our proposed T²OM can realize the switching display of two-dimensional flat TOMs and three-dimensional buildings, and use superpixels to save three-dimensional information so that the accuracy of three-dimensional measurement can be controlled within 0.0625 m.

In addition, it is feasible to expand the traditional pixel storage method to increase the amount of information expressed by a grid image. In our method, first, a large number of heterogeneous data are unified to achieve centralized management. This makes the reconstruction and display of a 3D model easier, because superpixels accurately store each location with elevation information. However, the proposed method still has defects. The main problem is that storage of the T²OM requires twice as much memory space as that of the TOM. This makes it difficult to store and transfer data, especially across large-scale urban areas. A solution is to compress the bit width of storage components, such as S, H and ID, by a statistical method. For example, in the compression of H, the height change curve for the whole study area can be counted, and the elevation series can be reduced from 4096 to 256. In addition, in areas without buildings, the removal of ID space to reduce memory consumption can be considered. Another defect that cannot be ignored is that the description of the geometric structure of building facades is not refined enough. On facades, there are balconies protruding from walls and windows recessed into walls. The geometric correction and line extraction of these facade structures is still difficult at present [56].

5. Conclusions

In light of the problem that traditional DOMs/TOMs only provide building roofs’ 2D (X,Y) attributes and gray information, and cannot provide 3D information or building facade textures at all, this paper proposes the generation of T²OM, which is radically different from the traditional generation of DOMs/TOMs, since the T²OM is able to provide three-dimensional (3D) and detailed textures of building roofs and facades.

The major innovation of this manuscript lies in the new method for the generation of T²OM, in which a data structure that can simultaneously store the 2D and 3D information of a building, building roof and building façade is developed. The proposed superpixel data structure takes the grid as the basic unit and successfully integrates a variety of data types by expanding the pixel storage space. The application of subdivision S and elevation series H greatly improves the accuracy of the 3D model. The proposed superpixel model is capable of promoting the fusion of multi-source heterogeneous data, so that a single image of data can display both 2D plane information and the 3D real scene. Moreover, the superpixel model can be applied to facade texture images, so that the 3D measurement of any point in a scene can be achieved. These contributions are valuable for large-scale urban DOM generation and applications.

Two sets of experimental results demonstrate that the proposed generation method of T²OM can maintain the traditional DOM/TOM characteristics, i.e., provide 2D XY coordinates and displaced building texture, but also provide the 3D XYZ coordinates of buildings’ roofs and facades. The accuracy of 3D measurement on a T²OM can achieve 0.025 m (0.3 pixel).

Nevertheless, the proposed method needs to be improved; for example, when the number of buildings in a city is large, if all of them are loaded into the memory according to the original texture data, the 3D display may occupy a large amount of memory and the refresh speed will be reduced. Therefore, memory loading according to the visible area and the compression of loaded memory are needed to reduce the memory occupancy and improve the refresh speed.

Author Contributions

Conceptualization, G.Z. and Q.W.; methodology, G.Z.; software, Y.H.; validation, Q.W., Y.H. and J.T.; formal analysis, H.L.; investigation, Y.W.; resources, Q.W.; data curation, Q.W.; writing—original draft preparation, Q.W.; writing—review and editing, G.Z.; visualization, Y.H.; supervision, Y.W.; project administration, G.Z.; funding acquisition, G.Z and Q.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science of China (Project No. 41961065), Guangxi Innovative Development Grand Program (Project No. Guike AD19254002, GuikeAA18118038, and GuikeAA18242048); Guangxi Natural Science Foundation for Innovation Research Team (Project No. 2019GXNSFGA245001), the BaGui Scholars program of Guangxi (Guoqing Zhou), Innovation Project of Guangxi Graduate Education (Project No. YCBZ2021061) and Guangxi Key Laboratory of Spatial Information and Geomatics Program (Project No. 19-050-11-14).

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank the reviewers for their constructive comments and suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

References

Federal Geographic Data Committee. Fact Sheet: National Digital Geospatial Data Framework: A Status Report; Federal Geographic Data Committee: Reston, VA, USA, July 1997; 37p.
Liu, Y.; Zheng, X.; Ai, G.; Zhang, Y.; Zuo, Y. Generating a High-Precision True Digital Orthophoto Map Based on UAV Images. ISPRS Int. J. Geo-Inf. 2018, 7, 333. [Google Scholar] [CrossRef]
Maitra, J.B. The National Spatial Data Infrastructure in the United States: Standards; Metadata, Clearinghouse, and Data Access; Federal Geographic Data Committee c/o US Geological Survey: Reston, VA, USA, 1998. [Google Scholar]
Yang, M.; Liu, J.; Zhang, Y.; Li, X. Design and Construction of Massive Digital Orthophoto Map Database in China. ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2016, 41, 103–106. [Google Scholar] [CrossRef]
Zhou, G. Onboard Processing for Satellite Remote Sensing Images; CRC Press: Boca Raton, FL, USA, 2022; ISBN 978-10-32-329642. [Google Scholar]
Federal Geographic Data Committee. Development of a National Digital Geospatial Data Framework; Federal Geographic Data Committee: Reston, VA, USA, 1995. [CrossRef]
Jamil, A.; Bayram, B. Tree Species Extraction and Land Use/Cover Classification From High-Resolution Digital Orthophoto Maps. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2017, 11, 89–94. [Google Scholar] [CrossRef]
Zhou, G.; Schickler, W.; Thorpe, A.; Song, P.; Chen, W.; Song, C. True orthoimage generation in urban areas with very tall buildings. Int. J. Remote Sens. 2004, 25, 5163–5180. [Google Scholar] [CrossRef]
Zhou, G.; Chen, W.; Kelmelis, J.A.; Zhang, D. A comprehensive study on urban true orthorectification. IEEE Trans. Geosci. Remote Sens. 2005, 43, 2138–2147. [Google Scholar] [CrossRef]
Amhar, F.; Jansa, J.; Ries, C. The generation of true orthophotos using a 3D building model in conjunction with a conventional DTM. Int. Arch. Photogramm. Remote Sens. 1998, 32, 16–22. [Google Scholar]
Schickier, W.; Thorpe, A. Operational procedure for automatic true orthophoto generation. Int. Arch. Photo-Grammetry Remote Sens. 1998, 32, 527–532. [Google Scholar]
Di, K.; Jia, M.; Xin, X.; Wang, J.; Liu, B.; Li, J.; Xie, J.; Liu, Z.; Peng, M.; Yue, Z.; et al. High-Resolution Large-Area Digital Orthophoto Map Generation Using LROC NAC Images. Photogramm. Eng. Remote Sens. 2019, 85, 481–491. [Google Scholar] [CrossRef]
Skarlatos, D. Orthophotograph Production in Urban Areas. Photogramm. Rec. 1999, 16, 643–650. [Google Scholar] [CrossRef]
Zhou, G.; Li, H.; Song, R.; Wang, Q.; Xu, J.; Song, B. Orthorectification of Fisheye Image under Equidistant Projection Model. Remote Sens. 2022, 14, 4175. [Google Scholar] [CrossRef]
Greenfeld, J. Evaluating the accuracy of digital orthophoto quadrangles (DOQ) in the context of parcel-based GIS. Photogramm. Eng. Remote Sens. 2001, 67, 199–206. [Google Scholar]
Haggag, M.; Zahran, M.; Salah, M. Towards automated generation of true orthoimages for urban areas. Am. J. Geogr. Inf. Syst. 2018, 7, 67–74. [Google Scholar] [CrossRef]
Mayr, W. True orthoimages. GIM Int. 2002, 37, 37–39. [Google Scholar]
Rau, J.-Y.; Chen, N.-Y.; Chen, L.-C. True orthophoto generation of built-up areas using multi-view images. Photogramm. Eng. Remote Sens. 2002, 68, 581–588. [Google Scholar]
Shoab, M.; Singh, V.K.; Ravibabu, M.V. High-Precise True Digital Orthoimage Generation and Accuracy Assessment based on UAV Images. J. Indian Soc. Remote Sens. 2021, 50, 613–622. [Google Scholar] [CrossRef]
Jauregui, M.; Vílchez, J.; Chacón, L. A procedure for map updating using digital mono-plotting. Comput. Geosci. 2002, 28, 513–523. [Google Scholar] [CrossRef]
Siachalou, S. Urban orthoimage analysis generated from IKONOS data. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2004, 35, 12–23. [Google Scholar]
Biasion, A.; Dequal, S.; Lingua, A. A new procedure for the automatic production of true orthophotos. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2004, 35, 1682–1777. [Google Scholar]
Shin, Y.H.; Lee, D.-C. True Orthoimage Generation Using Airborne LiDAR Data with Generative Adversarial Network-Based Deep Learning Model. J. Sensors 2021, 2021, 4304548. [Google Scholar] [CrossRef]
Yao, J.; Zhang, Z.M. Hierarchical shadow detection for color aerial images. Comput. Vis. Image Underst. 2006, 102, 60–69. [Google Scholar] [CrossRef]
Xie, W.; Zhou, G. Experimental realization of urban large-scale true orthoimage generation. In Proceedings of the ISPRS Congress, Beijing, China, 3–11 July 2008; pp. 3–11. [Google Scholar]
Zhou, G.; Jezek, K.C. Satellite photograph mosaics of Greenland from the 1960s era. Int. J. Remote Sens. 2002, 23, 1143–1159. [Google Scholar] [CrossRef]
Zhou, G.; Schickler, W. True orthoimage generation in extremely tall building urban areas. Int. J. Remote Sens. 2004, 25, 5161–5178. [Google Scholar] [CrossRef]
Zhou, G. Near Real-Time Orthorectification and Mosaic of Small UAV Video Flow for Time-Critical Event Response. IEEE Trans. Geosci. Remote Sens. 2009, 47, 739–747. [Google Scholar] [CrossRef]
Zhou, G.; Wang, Y.; Yue, T.; Ye, S.; Wang, W. Building occlusion detection from ghost images. IEEE Trans. Geosci. Remote Sens. 2016, 55, 1074–1084. [Google Scholar] [CrossRef]
Zhang, R.; Liu, N.; Huang, J.; Zhou, X. On-Board Ortho-Rectification for Images Based on an FPGA. Remote Sens. 2017, 9, 874. [Google Scholar] [CrossRef]
Zhou, G.; Zhang, R.; Zhang, D.; Huang, J.; Baysal, O. Real-time ortho-rectification for remote-sensing images. Int. J. Remote Sens. 2018, 40, 2451–2465. [Google Scholar] [CrossRef]
Zhou, G.; Bao, X.; Ye, S.; Wang, H.; Yan, H. Selection of Optimal Building Facade Texture Images From UAV-Based Multiple Oblique Image Flows. IEEE Trans. Geosci. Remote Sens. 2020, 59, 1534–1552. [Google Scholar] [CrossRef]
Jensen, L.B.; Per, S.; Nielsen; Alexander, T.; Mikkelsen, P.S. The Potential of the Technical University of Denmark in the Light of Sustainable Livable Cities. Des. Civ. Environ. Eng. 2014, 90. [Google Scholar] [CrossRef]
Huang, X.; Zhang, L. Morphological Building/Shadow Index for Building Extraction From High-Resolution Imagery over Urban Areas. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2011, 5, 161–172. [Google Scholar] [CrossRef]
Yu, B.; Wang, L.; Niu, Z. A novel algorithm in buildings/shadow detection based on Harris detector. Optik 2014, 125, 741–744. [Google Scholar] [CrossRef]
Gharibi, H.; Habib, A. True Orthophoto Generation from Aerial Frame Images and LiDAR Data: An Update. Remote Sens. 2018, 10, 581. [Google Scholar] [CrossRef]
Zhou, G. Urban High-Resolution Remote Sensing: Algorithms and Modeling; CRC Press: Boca Raton, FL, USA, 2020. [Google Scholar] [CrossRef]
Liu, X.; Zhou, G.; Zhang, W.; Luo, S. Study on Local to Global Radiometric Balance for Remotely Sensed Imagery. Remote Sens. 2021, 13, 2068. [Google Scholar] [CrossRef]
Wang, Q.; Zhou, G.; Song, R.; Xie, Y.; Luo, M.; Yue, T. Continuous space ant colony algorithm for automatic selection of or-thophoto mosaic seamline network. ISPRS J. Photogramm. Remote Sens. 2022, 186, 201–217. [Google Scholar] [CrossRef]
Vassilopoulou, S.; Hurni, L.; Dietrich, V.; Baltsavias, E.; Pateraki, M.; Lagios, E.; Parcharidis, I. Orthophoto generation using IKONOS imagery and high-resolution DEM: A case study on volcanic hazard monitoring of Nisyros Island (Greece). ISPRS J. Photogramm. Remote Sens. 2002, 57, 24–38. [Google Scholar] [CrossRef]
Cameron, A.; Miller, D.; Ramsay, F.; Nikolaou, I.; Clarke, G. Temporal measurement of the loss of native pinewood in Scotland through the analysis of orthorectified aerial photographs. J. Environ. Manag. 2000, 58, 33–43. [Google Scholar] [CrossRef]
Passini, R.; Jacobsen, K. Accuracy analysis of digital orthophotos from very high resolution imagery. International Archives of the Photogrammetry. Remote Sens. Spat. Inf. Sci. ISPRS Arch. 2004, 35 Pt B4, 695–700. [Google Scholar] [CrossRef]
Piatti, E.J.; Lerma, J.L. Generation of True Ortho-Images Based On Virtual Worlds: Learning Aspects. Photogramm. Rec. 2014, 29, 49–67. [Google Scholar] [CrossRef]
Yoo, E.J.; Lee, D.-C. True orthoimage generation by mutual recovery of occlusion areas. GIScience Remote Sens. 2015, 53, 227–246. [Google Scholar] [CrossRef]
De Oliveira, H.C.; Dal Poz, A.P.; Galo, M.; Habib, A.F. Surface gradient approach for occlusion detection based on triangu-lated irregular network for true orthophoto generation. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2018, 11, 443–457. [Google Scholar] [CrossRef]
Zhou, G.; Sha, H. Building Shadow Detection on Ghost Images. Remote Sens. 2020, 12, 679. [Google Scholar] [CrossRef]
Marsetič, A. Robust Automatic Generation of True Orthoimages rom Very High-Resolution Panchromatic Satellite Imagery Based on Image Incidence Angle for Occlusion Detection. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 3733–3749. [Google Scholar] [CrossRef]
Sheng, Y.; Gong, P.; Biging, G.S. True Orthoimage Production for Forested Areas from Large-Scale Aerial Photographs. Photogramm. Eng. Remote Sens. 2003, 69, 259–266. [Google Scholar] [CrossRef]
Leone, A.; Distante, C. Shadow detection for moving objects based on texture analysis. Pattern Recognit. 2007, 40, 1222–1233. [Google Scholar] [CrossRef]
Makarau, A.; Richter, R.; Muller, R.; Reinartz, P. Adaptive Shadow Detection Using a Blackbody Radiator Model. IEEE Trans. Geosci. Remote Sens. 2011, 49, 2049–2059. [Google Scholar] [CrossRef]
Tiwari, S.; Chauhan, K.; Kurmi, Y. Shadow Detection and Compensation in Aerial Images using MATLAB. Int. J. Comput. Appl. 2015, 119, 5–9. [Google Scholar] [CrossRef]
Li, D.; Wang, M.; Pan, J. Auto-dodging processing and its application for optical RS images. Geomat. Inf. Sci. Wuhan Univ. 2006, 31, 753–756. [Google Scholar]
Pan, J.; Wang, M. A Multi-scale Radiometric Re-processing Approach for Color Composite DMC Images. Geomat. Infor. Sci. Wuhan Univ. 2007, 32, 800–803. [Google Scholar]
Zhou, G.; Pan, Q.; Yue, T.; Wang, Q.; Sha, H.; Huang, S.; Liu, X. Vector and Raster Data Storage based on Morton Code. ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2018, XLII-3, 2523–2526. [Google Scholar] [CrossRef]
Chan, C.; Tan, S. Determination of the minimum bounding box of an arbitrary solid: An iterative approach. Comput. Struct. 2001, 79, 1433–1449. [Google Scholar] [CrossRef]
Fan, H.; Wang, Y.; Gong, J. Layout graph model for semantic façade reconstruction using laser point clouds. Geo Spatial Inf. Sci. 2021, 24, 403–421. [Google Scholar] [CrossRef]

Figure 1. (a) Terrestrial textures visible in flat and hilly area when TOM is superimposed on DTM (http://www.pcvr.com.cn/html/software/softwarei.html (accessed on 5 August 2022)), and (b) building façade texture visible when DOM is superimposed on DBM.

Figure 2. The procedure of DBM-based building roof orthorectification. (a) Rectified T²OM; (b) Original image; (c) Resampling; (d) DBM-Based piexl.

Figure 3. Assigning an elevation value to the building’s roof. (a) Before the elevation value is filled; (b) After the elevation value is filled, where a, b, c and d represent corner of DBM.

Figure 4. Superpixel occupies a total of 40 bits in the computer, where I, J represent the row and column, G (I, J) denotes gray, S (I, J) denotes corner coordinate subdivision grid identification value, H (I, J) denotes elevation, and ID (I, J) represents the identification of a building.

Figure 5. True three-dimensional (360°) full-circle T²OM generation. (a) Orthorectification for a building facade in 360° direction; (b) orthorectification for a building facade in 180° direction; (c) orthorectification for a building facade in 90° direction; (d) orthorectification for a building facade in 0° direction; (e) orthorectification for the building’s roof; (f) the generation of T²OM for a building after a full circle (360°) orthorectification; (g) explanation of a superpixel data structure.

Figure 6. Orthorectification method for the building facade through 4 different directions. (a) Orthorectification for a building facade in 0° direction; (b) orthorectification for a building facade in 90° direction; (c) orthorectification for a building facade in 180° direction; (d) orthorectification for a building facade in 270° direction.

Figure 7. Building 3D spaghetti data structure, where TID represents the wall texture ID (e.g., T = 809); a₁, b₁, c₁, d₁, a₂, b₂, c₂, and d₂ represent building’s roof corners. (a) 3D data structure of a building with a height of 25.8 m; (b) 3D data structure of a building with a height of 46.4 m.

Figure 8. DBM-based generation for multiple-building T²OM and merged DTM-based and DBM-based T²OM.

Figure 9. The proposed flowchart for T²OM generation.

Figure 10. Saving pixel information by bit manipulation.

Figure 11. The experimental Dataset 1. (a) DTM data, where area ① and ② represent areas without elevation data; (b) Aerial image data; (c) DBM data.

Figure 12. Generation results of DBM-based T²OM on dataset 1. (b) DBM-based T²OM; (a,c) are enlarged windows of the two regions.

Figure 13. Façade texture orthorectification results. (a,b) are the orthorectification results of the building Façade texture in a and c in Figure 12.

Figure 14. Generation results of DTM-based T²OM on dataset 1. (b) DTM-based T²OM; (a,c) are enlarged windows of the two regions.

Figure 15. Merging DTM- and DBM-based T²OM on dataset 1. (b) T²OM; (a,c) are enlarged windows of the two regions.

Figure 16. The experimental Dataset 2. (a) DTM data; (b) Aerial image data; (c) DBM data.

Figure 17. Generation results of DBM-based T²OM on dataset 2. (a–d) are enlarged windows of the four regions.

Figure 18. Façade texture orthorectification results. (a–d) are the orthorectification results of the building Façade texture in (a–d) in Figure 17.

Figure 19. Generation results of DTM-based T²OM; (a–d) are enlarged windows of the four regions.

Figure 20. Merging DTM- and DBM-based T²Oms on dataset 2; (a–d) are an enlarged windows of the four regions.

Figure 21. Measuring the elevation of any point on the façade.

Figure 22. View of the superpixel information.

Figure 23. Selectively showing or hiding information.

Figure 24. View information about the building.

Figure 25. Measuring the maximum and minimum distances between two buildings.

Figure 26. Acquisition of horizontal projection polygon corner points: (a) building model consisting of two voxels; (b) graph of the corner point detection results; (c) building model consisting of a single voxel; (d–f) local enlargement of (a); (g) local enlargement of (c).

Figure 27. Analysis of T²OM generation results. (a) output orthophoto, (b) orthophoto superimposed on the DBM, (c) extracted roof texture, (d) red area used to represent the superpixels generated with the DBM. Where a1-a4 are four regions with the same geographical position in (a–d); b1–b4 are four regions with the same geographical position in (a–d); c1–c4 are four regions with the same geographical position in (a–d); d1–d4 are four regions with the same geographical position in (a–d).

Table 1. The superpixel for the roof texture data-generated DBM-based building.

DBM-Based Pixel	Row	Column	Gray Value	Sub Ordinal	Height	BuildingID
BP1	I(BP1)	J(BP1)	G1	id_(BP₁₎ (r,c)	h_(BP₁₎	B₁
BP2	I(BP2)	J(BP2)	G2	0	h_(BP₂)	B₁
…	…	…	…	…	…	…
BPi	I(BPi)	J(BPi)	Gi	id_(BPi) (r,c)	h_(BPi)	B₁
BPi + 1	I(BPi + 1)	J(BPi + 1)	Gi + 1	0	h_(BPi₊₁₎	B₁
…	…	…	…	…	…	….

Table 2. The building wall texture superpixel data in Figure 7a.

Pixel	Gray (8 bit)	Hight (H, 12 bit)	TextureID (TID, 12 bit)	Notes
1	11000110	000100000010	000001000001	Gray = 198	Wall 1 H = 25.8 TID = 65
2	11010011	000100000010	000001000001	Gray = 211
3	11010000	000100000010	000001000001	Gray = 208
…	…	…	…	…
8	11010011	000100000010	000001000001	Gray = 211
9	11001000	000100000010	000001000010	Gray = 200	Wall 2 H = 25.8 TID = 66
10	11101000	000100000010	000001000010	Gray = 232
…	…	…	…	…
18	11011101	000100000010	000001000010	Gray = 221
19	11011101	000100000010	000001000011	Gray = 221	Wall 3 H = 25.8 TID = 67
20	11011000	000100000010	000001000011	Gray = 216
…	…	…	…	…
27	11011000	000100000010	000001000011	Gray = 216
28	11010001	000100000010	000001000100	Gray = 209	Wall 4 H = 25.8 TID = 68
29	11011010	000100000010	000001000100	Gray = 218
…	…	…	…	…
37	11011010	000100000010	000001000100	Gray = 218

Table 3. The building façade texture superpixel data in Figure 7b.

Pixel	Gray (8 bit)	Hight (H, 12 bit)	TextureID (TID, 12 bit)	Notes
1	11010101	000111010000	001100101001	Gray = 213	Wall 1 H = 46.4 TID = 809
2	11111101	000111010000	001100101001	Gray = 253
3	11110001	000111010000	001100101001	Gray = 241
…	…	…	…	…
9	11010100	000111010000	001100101001	Gray = 212
10	11010111	000111010000	001100101010	Gray = 215	Wall 2 H = 46.4 TID = 810
11	11010101	000111010000	001100101010	Gray = 213
…	…	…	…	…
22	11010101	000111010000	001100101010	Gray = 213
23	11011111	000111010000	001100101011	Gray = 223	Wall 3 H = 46.4 TID = 811
24	11010111	000111010000	001100101011	Gray = 215
…	…	…	…	…
30	11010111	000111010000	001100101011	Gray = 215
31	11010101	000111010000	001100101100	Gray = 213	Wall 4 H = 46.4 TID = 812
32	11010111	000111010000	001100101100	Gray = 215
…	…	…	…	…
47	11010101	000111010000	001100101100	Gray = 213

Table 4. The building wall texture index in Figure 7a.

WallID	Wall Pixel Index	Texture Index	Hight
a₁b₁	1, 2, 3, 4, 5, 6, 7, 8	65	25.8
b₁c₁	9, 10, 11, 12, 13, 14, 15, 16, 17, 18	66	25.8
c₁d₁	19, 20, 21, 22, 23, 24, 25, 26, 27	67	25.8
d₁a₁	28, 29, 30, 31, 32, 33, 34, 35, 36, 37	68	25.8

Table 5. The building wall texture index in Figure 7b.

WallID	Wall Pixel Index	Texture Index	Hight
a₂b₂	1, 2, 3, 4, 5, 6, 7, 8, 9	809	46.4
b₂c₂	10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22	810	46.4
c₂d₂	23, 24, 25, 26, 27, 28, 29, 30	811	46.4
d₂a₂	31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47	812	46.4

Table 6. Building relationships.

BuildingID	Type	RoofID	WallID	Properties	Others
B₁	Volume	R₁₁	W₁₁, W₁₂, W₁₃…	Brick structure	…
B₂	Volume	R₁₂	W₂₁, W₂₂, W₂₃…	reinforced concrete structure	…
B₃	Volume	R₁₃	W₃₁, W₃₂, W₃₃…	steel structure	…
…	…	…	…	…	…

Table 7. The relationship of building roof textures.

RoofID	Type	TextureID	PointID	Others
R₁₁	Polygon	T^R₁₁	P^R₁₁, P^R₁₂, P^R₁₃…	…
R₁₂	Polygon	T^R₁₂	P^R₂₁, P^R₂₂, P^R₂₃…	…
R₁₃	Polygon	T^R₁₃	P^R₃₁, P^R₃₂, P^R₃₃…	…
…	…	…	…	…

Table 8. The relationship of building wall textures.

WallID	Type	TextureID	PointID	Others
W₁₁	Polygon	T^W₁₁	P^w₁₁, P^w₁₂, P^w₁₃…	…
W₁₂	Polygon	T^W₁₂	P^w₂₁, P^w₂₂, P^w₂₃…	…
W₁₃	Polygon	T^W₁₃	P^w₃₁, P^w₃₂, P^w₃₃…	…
…	…	…	…	…

Table 9. Building vertex relationships.

PointID	X, Y, Z Coord.			Pixel Coord.		Others
P^w₁₁	X^W₁₁	Y^W₁₁	Z^W₁₁	I^W₁₁	J^W₁₁	…
P^w₁₂	X^W₁₂	Y^W₁₂	Z^W₁₂	I^W₁₂	J^W₁₂	…
P^w₁₃	X^W₁₃	Y^W₁₃	Z^W₁₃	I^W₁₃	J^W₁₃	…
…	…	…	…	…	…	…
P^R₁₁	X^R₁₁	Y^R₁₁	Z^R₁₁	I^R₁₁	J^R₁₁	…
P^R₁₂	X^R₁₂	Y^R₁₂	Z^R₁₂	I^R₁₂	J^R₁₂	…
P^R₁₃	X^R₁₃	Y^R₁₃	Z^R₁₃	I^R₁₃	J^R₁₃	…
…	…	…	…	…	…	…

Table 10. Data texture table.

TextureD	TextureName	FileAddress	Date	Fomat	Others
T^W₁₁	WTN11	WFA11	WD11	WF11	…
T^W₁₂	WTN12	WFA12	WD12	WF12	…
T^W₁₃	WTN13	WFA13	WD13	WF13	…
…	…	…	…	…	…
T^R₁₁	PTN11	PFA11	PD11	PF11	…
T^R₁₂	PTN12	PFA12	PD12	PF12	…
T^R₁₃	PTN13	PFA13	PD13	PF13	…
…	…	…	…	…	…

Table 11. The file format of the T²OM fus.

File Section	Properties	Description
File flag block	m_FileProperty	Identifier “fus” (char type)
File flag block	m_Version	Version number (int type)
Image header information	m_UpleftCoordinateX	Image coordinate lower right X value (double type, units: meters)
	m_UpleftCoordinateY	Image coordinate lower right Y value (double type, units: meters)
	m_TMaxZ	The highest point in the DTM file (double type, in meters)
	m_TMinZ	The lowest point in the DTM file (double type, in meters)
	m_BMaxZ	Maximum building height in DBM (double type in meters)
	m_BMinZ	Minimum building height in DBM (double type in meters)
	m_IntervalX	Unit interval in X-axis direction (double type in meters)
	m_IntervalY	Unit interval in Y-axis direction (double type, units: meters)
	m_FileHigh	Image height (int type)
	m_FileWidth	Image width (int type)
	Z_Tresolution	Topographic data unit elevation level (double type, units: meters)
	Z_Bresolution	Building data unit elevation level (type double, in meters)
	Build_Num	Number of building objects elements (type int)
Image Pixels Information	T²OM_Grey	Pixel grey component (unsigned char type)
	T²OM_Ordinal	Subdivision grid order (unsigned char type)
	T²OM_Height	Elevation level high 8 bits (unsigned char type)
	T²OM_HI	Elevation level low 4 bits, logo high 4 bits (unsigned char type)
	T²OM_Index	Marker data low 8 bits (unsigned char type)

Table 12. 3D coordinate values before and after encoding.

Building	X_ori	Y_ori	Z_ori	id(r,c)	h	X_bc	Y_bc	Z_bc
a	1286.850	1306.000	5551.700	14	1973	1286.840	1306.030	5551.600
	1346.850	1245.000	5551.700	14	1973	1346.840	1245.030	5551.600
	1355.000	1237.975	5551.700	241	1973	1355.030	1237.970	5551.600
	1356.000	1237.850	5551.700	209	1973	1356.030	1237.840	5551.600
b	1931.937	1424.000	5533.400	15	1876	1931.910	1424.030	5533.330
	1932.150	1440.000	5533.400	3	1876	1932.160	1440.030	5533.330
	1910.850	1462.000	5533.400	14	1876	1910.840	1462.030	5533.330
	1906.000	1464.150	5533.400	33	1876	1906.030	1464.160	5533.330
c	2990.850	1244.000	5462.500	14	1499	2990.840	1244.030	5462.330
	2912.850	1151.000	5462.500	14	1499	2912.840	1151.030	5462.330
	2911.850	1148.000	5462.500	14	1499	2911.840	1148.030	5462.330
	2964.850	1093.000	5462.500	14	1499	2964.840	1093.030	5462.330
d	3339.000	1882.850	5755.800	209	3057	3339.030	1882.840	5755.760
	3341.096	1883.000	5755.800	2	3057	3341.090	1883.030	5755.760
	3344.850	1881.000	5755.800	14	3057	3344.840	1881.030	5755.760
	3356.054	1864.000	5755.800	1	3057	3356.030	1864.030	5755.760

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhou, G.; Wang, Q.; Huang, Y.; Tian, J.; Li, H.; Wang, Y. True² Orthoimage Map Generation. Remote Sens. 2022, 14, 4396. https://doi.org/10.3390/rs14174396

AMA Style

Zhou G, Wang Q, Huang Y, Tian J, Li H, Wang Y. True² Orthoimage Map Generation. Remote Sensing. 2022; 14(17):4396. https://doi.org/10.3390/rs14174396

Chicago/Turabian Style

Zhou, Guoqing, Qingyang Wang, Yongsheng Huang, Jin Tian, Haoran Li, and Yuefeng Wang. 2022. "True² Orthoimage Map Generation" Remote Sensing 14, no. 17: 4396. https://doi.org/10.3390/rs14174396

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

True² Orthoimage Map Generation

Abstract

1. Introduction

2. Related Works