Next Article in Journal
Methods for Spatiotemporal Analysis of Human Gait Based on Data from Depth Sensors
Next Article in Special Issue
Sensitivity Analysis of RV Reducer Rotation Error Based on Deep Gaussian Processes
Previous Article in Journal
A Lightweight Algorithm for Insulator Target Detection and Defect Identification
Previous Article in Special Issue
High-Precision Tribometer for Studies of Adhesive Contacts
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Point Cloud Data-Driven Pallet Pose Estimation Method Using an Active Binocular Vision Sensor

1
College of Mechanical Engineering, Zhejiang University of Technology, Hangzhou 310023, China
2
Noblelift Intelligent Equipment Co., Ltd., Huzhou 313100, China
3
Ningbo Fujia Industrial Co., Ltd., Ningbo 330200, China
*
Author to whom correspondence should be addressed.
Sensors 2023, 23(3), 1217; https://doi.org/10.3390/s23031217
Submission received: 29 December 2022 / Revised: 17 January 2023 / Accepted: 18 January 2023 / Published: 20 January 2023
(This article belongs to the Special Issue Applications of Manufacturing and Measurement Sensors)

Abstract

:
Pallet pose estimation is one of the key technologies for automated fork pickup of driverless industrial trucks. Due to the complex working environment and the enormous amount of data, the existing pose estimation approaches cannot meet the working requirements of intelligent logistics equipment in terms of high accuracy and real time. A point cloud data-driven pallet pose estimation method using an active binocular vision sensor is proposed, which consists of point cloud preprocessing, Adaptive Gaussian Weight-based Fast Point Feature Histogram extraction and point cloud registration. The proposed method overcomes the shortcomings of traditional pose estimation methods, such as poor robustness, time consumption and low accuracy, and realizes the efficient and accurate estimation of pallet pose for driverless industrial trucks. Compared with traditional Fast Point Feature Histogram and Signature of Histogram of Orientation, the experimental results show that the proposed approach is superior to the above two methods, improving the accuracy by over 35% and reducing the feature extraction time by over 30%, thereby verifying the effectiveness and superiority of the proposed method.

1. Introduction

The application of intelligent logistics equipment has played a key role in the transformation and upgrading of the manufacturing industry in recent years. Driverless industrial trucks are common intelligent logistics equipment [1] and widely used in warehousing, production, medical treatment, the service industry and other fields. It can realize automatic material handling and improve production efficiency and lower production costs for intelligent logistics systems [2]. However, as shown in Figure 1, due to the influence of obstacles, uneven lighting, human intervention and other factors, a certain deviation is caused between the actual pose and the correct pose of the pallet. As a result, collision and incomplete forks will occur when driverless industrial trucks forklift pallets, which will lead to dumping, damage and safety problems. Therefore, it is necessary to upgrade the technology of traditional driverless industrial trucks and introduce some external industrial vision sensors [3] to achieve the purpose of adaptive and automatic production.
In order to solve the problem shown in Figure 1, based on the data collected by the vision sensor, it is necessary to estimate the pose of the pallet when the driverless industrial truck is driven to a certain distance in front of the pallet, so as to correct the pose deviation of the pallet and ensure the efficiency and safety of the logistics process. There are two main kinds of target pose estimation methods: LIDAR based and vision based. The LIDAR-based methods are mature and have high estimation accuracy. Baglivo et al. [4] proposed an efficient scheme, which combined a laser range-based object localization approach with PC-Sliding. Mohamed et al. [5] presented a novel architecture allowing a robot to detect, localize and track pallets using machine learning techniques based on an onboard 2D laser rangefinder. Zhang et al. [6] improved the matching degree of multi-modal features of the target and achieved accurate target pose estimation by fusing the range of view, aerial view and RGB view of the LIDAR.
However, due to the problems of high cost, limited range and difficulty in eliminating cumulative errors of LIDAR-based methods, vision-based methods are often adopted for pose estimation in indoor environments with sufficient illumination. The vision-based pose estimation methods mainly include 2D vision and 3D vision. In certain work environments, the 2D vision-based methods have been widely used in the pose estimation of the target object [7,8,9,10]. Varga et al. [11] obtained the position and pose information of pallets by intensity image and a stereo camera, and the LBP featured descriptor was introduced to realize automatic fork picking of pallets. Monocular images and matching of 2D deformation patterns were used to estimate the pose of known objects in dynamic environments by Casado et al. [12]. With the rapid development of low-cost depth sensors, object pose estimation has been transformed from traditional single-point and segmented measurements to dense point clouds and full-profile measurements [13,14,15,16]. Compared with 2D vision, 3D vision can obtain one more dimension of target information, which solves the problem of information loss in the process of mapping from 3D space to 2D space and has gradually become a hot topic in current research [17,18]. The current popular 3D vision solution is to estimate the pose of the target by point cloud registration. Common point cloud registration algorithms include the Normal Distributions Transform (NDT) [19] algorithm, Principal Component Analysis (PCA) algorithm [20], Iterative Closest Point (ICP) algorithm [21], and many other improved algorithms. The principle of the traditional ICP algorithm is popular and easy to understand, and the registration effect is remarkable; thus, it is widely used in point cloud registration. However, the ICP algorithm also has drawbacks, such as long calculation time and inability to solve local optimal problems [22]. Wu et al. [23] proposed a novel nearest neighbor search algorithm to improve the iteration speed of point cloud registration. Fotsing et al. proposed a novel region growing-based approach [24] for plane detection in unorganized point clouds extract reliable seeds by the Iterative Closest Point algorithm to enhance the performance and the quality of the results.
In addition, the initial pose of the point cloud to be registered has an important influence on the accuracy of point cloud registration. The coarse registration based on feature matching is used to obtain a better initial pose, which is beneficial to improve the pose estimation accuracy [25]. Rusu [26] proposed the Fast Point Feature Histogram (FPFH) descriptor, which can be used to describe the neighborhood geometry information of the query point and is often used to estimate the target pose. SaltiS et al. [27] proposed a Signature of Histograms of Orientation (SHOT), which can represent topological features, has rotation invariance and is robust to noise. Li et al. [28] proposed real-time path planning based on a VFH feature descriptor, which was robust against a large degree of surface noise and missing depth information; however, pose estimation could fail when the object was placed symmetrically with the viewpoint. Compared with other feature descriptors, the FPFH has the characteristics of fast computation and high accuracy, and it is used to describe the geometric features of the pallet point cloud in this study. However, the current FPFH descriptor also has some drawbacks. Firstly, when selecting the weight coefficient of the FPFH, only the Euclidean distance between the query point and the neighborhood point is considered, which makes the weight order difference too large and reduces the robustness of the FPFH feature descriptor. In addition, the current calculation approach of the FPFH feature descriptor does not consider the selection criteria of neighborhood radius and is usually debugged based on experience to determine neighborhood radius, which reduces the efficiency and accuracy of pose estimation.
In view of the above problems, a pallet pose estimation approach based on an Adaptive Gaussian Weight-based Fast Point Feature Histogram is proposed. On one hand, when determining the neighborhood radius of the descriptor, the optimal neighborhood radius of each point is obtained based on the minimum rule of neighborhood feature entropy function, which overcomes the randomness of neighborhood radius parameters debugging manually. On the other hand, when determining the weight of neighborhood points, the weight calculation formula is redefined according to the average distance and standard deviation between key points and their neighborhood points, which makes the weight of each neighborhood point controlled within a certain range.
The remainder of the paper is organized as follows: In Section 2, an overview of the proposed approach and the specific steps of the method are described. In Section 3, two cases are presented to verify the effectiveness of the proposed approach in engineering application, and the experimental results are analyzed and discussed. Finally, a conclusion is drawn in Section 4.

2. The Proposed Pallet Pose Estimation Method

2.1. Overview of the Proposed Approach

In order to realize the accurate position of pallets for driverless industrial trucks in the storage environment, a pallet pose estimation method based on an Adaptive Gaussian Weight-based Fast Point Feature Histogram is proposed; the procedure involves the following steps, and the flowchart is shown in Figure 2.
Step 1: Point cloud preprocessing. The source point cloud of the pallet and the scene point cloud containing the pallet are collected using an active binocular vision sensor. The redundant scene information in the scene cloud is removed through the pass-through filtering and voxel mesh downsampling method [29], and the plane segmentation algorithm is used to obtain the target pallet point cloud. The key points in the target point cloud are extracted by the Intrinsic Shape Signatures (ISS) algorithm [30].
Step 2: Adaptive Gaussian Weight-based Fast Point Feature Histogram definition. Adaptive optimal neighborhood radius is used to determine the neighborhood range of each point in the target point cloud. By calculating the mean value and variance of the distance between each key point and its neighborhood, the Adaptive Gaussian weight-based Fast Point Feature Histogram (AGWF) of each key point is obtained.
Step 3: Point cloud registration. According to the AGWF feature descriptor, the SAC-IA (sample consensus initial alignment) algorithm is used to coarsely register the source point cloud with the target point cloud. Then, the ICP algorithm is used to transform the point cloud iteratively and obtain the optimal rigid transformation matrix. The matrix parameters are converted into horizontal deviation and angle to realize pallet pose estimation.

2.2. Point Cloud Preprocessing

2.2.1. Point Cloud Filtering

The original source point cloud P containing the pallet and the original scene point cloud Qso (about millions of points) are collected by an active binocular vision sensor. The efficiency and accuracy of point cloud processing will be reduced due to the large number of acquired scene point clouds and a large amount of redundant scene information. The filtering interval is determined according to the spatial position relationship between the pallet and the driverless industrial trucks in the standard state, and the invalid point clouds and background information in the original scene point cloud Qso are removed by the classical pass-through filter. A large number of redundant points are removed by voxel grid downsampling, and the complete geometric features of the point cloud are retained to obtain the filtered scene point cloud Qs.
The specific steps of the specific steps of voxel grid downsampling are as follows: (1) In point cloud Qso, the maximum and minimum values in X, Y and Z directions are xmax, xmin, ymax, ymin, zmax, zmin, respectively. Set the dimensions of the voxel grid d0, where m = ( x max x min ) / d 0 , n = ( y max y min ) / d 0 , l = ( z max z min ) / d 0 , and represents round up. (2) Encode each point Qv(xv, yv, zv) in Qso as (mv, lv, nv) to determine which grid each point belongs to, where m v = ( x v x min ) / d 0 , n v = ( y v y min ) / d 0 , l v = ( z v z min ) / d 0 ,and represents round down. If there are some points in a voxel grid, calculate its center of gravity C0 = (x0, y0, z0), and replace the points in each grid by the point nearest to the center of gravity to obtain the filtered scene point cloud Qs, where x 0 = 1 k i = 1 k x i , y 0 = 1 k i = 1 k y i , z 0 = 1 k i = 1 k y i , and k represents the number of points in the grid.

2.2.2. Plain Segmentation

Because the filtered scene point cloud Qs still contains pallet, wall, ground and other information, and the pallet needs to be separated from the wall and ground, the plane segmentation method based on Random Sampling Consensus (RANSAC) is used to find the points belonging to the plane iteratively according to the set plane model. Meanwhile, the ground with different degrees of fluctuation can be detected by setting the model distance threshold. The specific steps are as follows: (1) The initial plane model Ax + By + Cz + D = 0 is constructed by selecting any three points from the filtered scene point cloud. (2) The distance di from point qi (points in the point cloud Qs) to the initial plane and the angle αi between the coordinates of point qi and the normal vector of the initial plane are calculated. A distance threshold de and an angle threshold αe are set. If di < de and αi < αe, then point qi is considered an in-plane point. (3) Iterations are carried out continuously until the number of in-plain points reaches the threshold t, and the final fitting plane model representing the wall and ground is removed, obtaining the target pallet point cloud Q.

2.2.3. Key Points Extraction

After filtering and plane segmentation, there are still tens of thousands of point clouds in the target pallet point cloud Q, which will reduce the pose estimation efficiency and fail to meet the requirements of the operation of driverless industrial trucks. Therefore, the points with obvious geometric features are selected from the target point cloud Q to form the key point set Qt, and only the features of the key points are extracted, which can significantly improve the efficiency of feature extraction of the point cloud. Due to the advantages of high speed, accuracy and robustness, an intrinsic Shape Signature (ISS) algorithm is developed to extract key points, and it is suitable for various applications [31]. The main steps are as follows: (1) Establish a local coordinate at each point qv in point cloud Q and set a neighborhood search radius rf. (2) Search for the neighborhoods of the query point qv with rf and obtain their neighborhood points qj; the weight w v j = ( q j q v ) 1 is calculated according to the Euclidean distance between qv and qj. (3) Calculate the weighted neighborhood covariance matrix C i = | q i q j | < r f r a m e w i j ( q i q j ) ( q i - q j ) T / | q i q j | < r f r a m e w i j of qv. (4) The eigenvalues λ v 1 , λ v 2 and λ v 3 are obtained by eigenvalue decomposition of the covariance matrix, and they are arranged in descending order. (5) Set the thresholds ε 1 and ε 2 ( ε 1 , ε 2 1 ) . If λ 1 2 λ v 1 ε 1 , λ 1 3 λ v 2 ε 2 , and the points are supposed to be the key points, otherwise, iterate over the next point. Repeat the process until all the points have been traversed, and finally obtain the key point set Q k of the target point.

2.3. Adaptive Gaussian Weight-based Fast Point Feature Histogram Definition

2.3.1. Adaptive Neighborhood Radius

The premise of accurate pose estimation is to construct a feature descriptor of the pallet point cloud with high efficiency, strong robustness and high accuracy; the neighborhood radius is an important factor affecting the performance of feature descriptors. The neighborhood radius of traditional FPFH is usually set to a fixed value according to experience, which reduces the speed of feature extraction. A neighborhood radius selection criterion based on adaptive neighborhood feature entropy is proposed to obtain the neighborhood radius of each key point qk (points in the key point set Qk) adaptively. The detailed steps are as follows:
  • Set the range of point cloud neighborhood search radius rj from lower limit rmin to upper limit rmax with radius interval rd. The upper and lower limits of the radius range are determined by the average point cloud distance dp, which is defined as follows [32]:
    d p = 1 N d m
    where N represents the total number of points in the pallet point cloud, and dm represents the distance of each key point qk from its nearest point. Set r min = d p , r max = 2 d p , r j + 1 = r j + Δ r , where j = 1 , 2 , .
  • Calculate the covariance matrix and eigenvalues corresponding to different neighborhood radius rj. The neighborhood covariance matrix M is defined as follows:
    M = [ e 1 e 2 e 3 ] [ λ 1 0 0 0 λ 2 0 0 0 λ 3 ] [ e 1 T e 2 T e 3 T ]
    where λ 1 , λ 2 , λ 3 are the eigenvalues of the neighborhood covariance matrix, and e 1 , e 2 , e 3 are the corresponding eigenvectors.
  • According to the eigenvalues, the neighborhood feature entropy function E ξ is constructed:
    E ξ = ξ 1 ln ( ξ 1 ) ξ 2 ln ( ξ 2 ) ξ 3 ln ( ξ 3 )
    where ξ i = λ i λ i , i = 1 , 2 , 3 .
  • The adaptive optimal neighborhood radius ropt of point cloud is determined based on the minimum criterion of neighborhood feature entropy function, that is, when E ξ reaches the minimum value, the corresponding neighborhood radius rj is the optimal neighborhood radius ropt:
    r opt   = argmin ( E ξ )

2.3.2. Gaussian Weight-Based Fast Point Feature Histogram

The features of traditional FPFH are determined by the neighborhood in the radius r of the query point itself and the neighborhood of its neighborhood points, whose maximum range is 2r. The FPFH neighborhood of a query point is shown in Figure 3, where pa1–pa5 are the neighborhood points of the query point Pa within the neighborhood radius r, and pb–pi are the neighborhood points of the points pa1–pa5.
The weight coefficient of the traditional FPFH algorithm is determined only by the Euclidean distance between the query point and the neighborhoods; often, there are huge differences among various weight coefficients. As is shown in Figure 4, the feature descriptor of the red query point qr is mostly influenced by the closest black points qa and qb, and other points only have little effect, which may cause the feature descriptor to be inaccurate.
Therefore, an improved FPFH is proposed; each key point qk in the key point set Qt is taken as the query point, and based on the adaptive optimal neighborhood radius ropt, all the neighborhood points qki within its neighborhood radius are found. The weight coefficient is redefined with the Gaussian distribution, as follows:
w G i = 1 2 π σ exp ( ( w i μ ) 2 2 σ 2 )
where μ is the average distance of qk and qki, and σ represents the standard deviation of the distance between qk and qki.
The weight coefficient variation trend of the original approach and the proposed approach is shown in Figure 5. It can be seen that the weight coefficient of the original approach varies greatly, and the weight tends to infinity with the decrease of the distance. The proposed AGWF can avoid the problem that the weight coefficient difference between points is apparently large, and the weight coefficient reaches the maximum when the distance is close to the average value, which reasonably solves the unstable problem of the FPFH descriptor calculation.
Combined with the adaptive optimal neighborhood radius selection criterion, an Adaptive Gaussian Weight-based Fast Point Feature Histogram (AGWF) is proposed. It can not only improve the efficiency of feature extraction but also improve the accuracy and robustness; the specific calculation steps are as follows:
  • For each key point qk in the key point set Qt, search all the neighborhood points qki within its optimal neighborhood radius ropt.
  • Compute the normal vectors ns and nt corresponding to qk and qki, calculate the relative position deviation ( α , ϕ , θ ) between ns and nt, and generate the Simple Point Feature Histograms (SPFH) of qk(SPFH(qk)). Local coordinate system (u,v,w) is defined to calculate this deviation, which is shown in Figure 6:
The calculation formula of relative deviation is as follows:
α = v n t φ = u ( q k q k i ) q k q k i 2 θ = arctan ( w n t , u n t )
3.
Then, search the neighborhood points of qki based on the adaptive optimal neighborhood radius ropt, and generate the SPFH of qki(SPFH(qki)). Based on the Gaussian weight wGi, the SPFH(qki) is weighted to obtain the Adaptive Gaussian Weight-based Fast Point Feature Histogram of key points qk(AGWF(qk)):
A G W F P F H ( q k ) = S P F H ( q k ) + 1 k i = 1 k w G i S P F H ( q k i )

2.4. Point Cloud Registration

2.4.1. Coarse Registration

The purpose of coarse registration is to obtain the initial pose relationship between the source point cloud Pt and the target point cloud, so as to overcome the shortcomings of the ICP algorithm, which requires high initial pose and is easy to fall into local optimum. The SAC-IA algorithm can effectively adjust the initial pose relationship between the source point cloud P and the target point cloud Q and improve the accuracy of pose estimation. The specific steps are as follows:
  • Compute AGWF feature descriptors for all key points in the source point cloud P and the target point cloud Q.
  • N sample points Pu (u = 1, 2, …, N) are randomly selected from the source point cloud P, and the distance between two sample points is greater than the preset distance threshold dmin.
  • According to the AGWF, search the closest points Qu (u = 1, 2, …, N) in the target point cloud Q to the sample points Pu, and obtain the initial match point pairs.
  • Obtain the rigid transformation matrix M1 between initial match point pairs by Singular Value Decomposition (SVD). Set a registration threshold el, and calculate the distance function H(li) to evaluate the point cloud registration performance. The expression of the distance function H(li) is as follows:
    H ( l i ) = { 1 2 l i 2 l i < e l 1 2 e l ( 2 l i e l ) l i e l
    where l is the average Euclidean distance between the source point cloud, and I is the number of iterations.
  • Repeat the above four steps; when H(li) reaches the minimum, the corresponding transformation matrix is the coarse registration rigid transformation matrix Mc. The rigid transformation of the source point cloud P is carried out based on Mc to obtain the point cloud Pr, and coarse registration is completed.

2.4.2. Accurate Registration

After coarse registration, the source point cloud Pr and the target point cloud Q can only roughly coincide, so it is necessary to improve the pose estimation accuracy by further accurate registration. The ICP algorithm is used for accurate registration. The algorithm obtains the nearest Euclidean point through exhaustive search and obtains accurate registration parameters based on the results of the optimal objective function. According to the registration parameters, 6 Degrees of Freedom (6 DOF) pose estimation result can be obtained [33].
  • Set a distance threshold ef and the maximum number of iterations I0. For each point pri in the source point set Pr, search for its corresponding closest point qi in the target point set Q, and form the corresponding points pairs, set Cl.
  • Solve the rigid transformation matrix by SVD and obtain the rotation matrix Rn and the translation matrix Tn, where n is the number of iterations. Convert the source point Pr by the translation matrix (Rn, Tn) into Prn, and form the corresponding point pairs Cn. Calculate the average Euclidean distance en between every corresponding point pair.
    e n = 1 k i = 1 k q i ( p r i × R n + T n ) 2
    where k is the number of the corresponding point pairs.
  • Repeat the above steps until en is smaller than ef or the maximum number of iterations I0 is reached, and finally obtain the optimal transformation matrix R and T.
  • Let Rx, Ry and Rz be the rotation angles of the three coordinate axes, and tx, ty and tz be the translation vectors of the coordinate axes; the 6 DOF pose estimation can be represented as (Rx, Ry, Rz, tx, ty, tz). The optimal transformation matrix Ma can be expressed as
M a = T ( t x , t y , t z ) × R ( R x , R y , R z )
where T ( t x , t y , t z ) and R ( R x , R y , R z ) can be expressed as
T ( t x , t y , t z ) = ( 1 0 0 t x 0 1 0 t y 0 0 1 t z 0 0 0 1 )
R ( R x , R y , R z ) = ( r 11 r 12 r 13 0 r 21 r 22 r 23 0 r 31 r 32 r 33 0 0 0 0 1 )
It can be concluded that the 6DOF pose estimation parameters of the target can be expressed as
( R x , R y , R z , t x , t y , t z ) = ( arctan ( r 32 / r 33 ) , arcsin ( r 13 ) , arctan ( r 21 / r 11 ) , t x , t y , t z )
According to the actual situation of pallet fork taking in the storage environment, only the horizontal deviation tx and tz and the rotation angle Ry perpendicular to the ground need to be obtained. The driverless industrial trucks adjust the pallet fork taking path according to the deviation parameters (tx, ty, Ry).

3. Pallet Pose Estimation Experiment

3.1. Data Collection

An industrial vision sensor called the Percipio FM851-E2 3D vision sensor is adopted to acquire point source point cloud and scene point cloud, whose ranging principal is active binocular, and the operative range is 0.7–6.0 m. The structure of the Percipio FM851-E2 vision sensor is shown in Figure 7.
To obtain the source point cloud of the pallet, a normal blue pallet is placed in a fixed position in a laboratory with normal brightness and no other obstructions, and the Percipio FM851-E2 vision sensor is fixed on the top of the fork frame to take pictures as shown in Figure 8. To meet the operational requirements of the production floor, the front of the fork is placed 500 mm away from the pallet, ensuring that the fork is perpendicular to the front of the pallet and the center of the sensor is aligned with the center of the pallet. The collected point cloud is processed by the point cloud filter and plain segmentation algorithm, and the remaining points are the source point cloud, which contains the position information of the pallet point cloud in the sensor coordinate system under the standard state after visualization, as shown in Figure 9. The key points of the source point cloud are extracted, and the adaptive optimal neighborhood radius is obtained; then, the AGWF features are calculated (the results are shown in Figure 10), and the results are saved in the database.

3.2. Multiple Scenario Experiments

In order to verify the effectiveness of the pose estimation algorithm, the pallet pose estimation algorithm is experimentally verified in several scenarios, mainly the common ground scenarios and shelf scenarios.

3.2.1. The Ground Scene

Considering the practical requirements of the pallet attitude estimation scenario, the relative attitude relationship between the storage pallet and the sensor is considered only for the horizontal lateral translation Δ x , the longitudinal translation Δ y , and the deflection angle φ. The deflection angles of 5°, 10°, 15° and 20° and the deviations of 0.05 m, 0.10 m, 0.15 m and 0.2 m in the horizontal direction, respectively, are selected for the experiments. The correction of the deflection angle needs to be completed through the rotation of the driverless industrial truck, so the rotation center of the deflection angle is actually the origin of the camera coordinates in this experiment. The scene point clouds are taken and preprocessed, the target pallet point clouds in the scene point clouds are extracted, and the adaptive optimal neighborhoods of key points and AGWF feature descriptors are calculated to match with the pallet point clouds. Taking the experiment at a deflection angle of 5° as an example, the visualization results of the scene point cloud, the relative position relationship between the target point cloud and the source point cloud, the key points of the target point cloud, and the rough and accurate registration are listed in Figure 11.
The number of point clouds and the time consumed for each step of the experiments are shown in Table 1, and the scene point clouds captured and the registration results are shown in Figure 12. The experiments’ pose estimation results and errors are shown in Table 2. The experiments (d) and (h) are the experimental control groups with the out-of-limit deviations (bolded in Table 2), which are not taken into account for calculating the total average deviation and accuracy.
As can be seen from the experimental results derived from the above table, the average estimation error of the horizontal direction is about 0.0098 m, the average estimation error is about 0.0194 m, and the average estimation error of the deflection angle is about 0.5°, with a total average accuracy of 97.3%. It can be seen that the algorithm has high accuracy and strong robustness when the horizontal deviation is within 0.15 m and the deflection angle is within 15°. However, as is shown the experiments (d) and (h) in Figure 12, when the horizontal deviation or deflection angle is too large, due to the field of view limitation of the depth camera and excessive initial pose deviation, the target point cloud and the source point cloud may fail to be aligned, which will affect the accuracy of the pallet pose estimation.

3.2.2. The Shelf Scene

In a production workshop that has several shelves with three layers, a blue pallet is placed in the second layer, which has a certain position deviation with the standard state. The driverless industrial truck follows a preset path to a designated location, and the horizontal deflection and the size of the deflection angle must be calculated by the pose estimation algorithm to realize the accurate forklift of pallets for driverless industrial trucks.
The color image of the scene captured by the Percipio active binocular vision sensor is shown in Figure 13, and the scene point cloud is shown in Figure 14, among which the number of the scene points is 2,073,600. Redundant points were eliminated by pass-through filtering. According to the position relationship between the vision sensor and the pallet, the pass-through filtering parameters x, y and z are set as x ( 1.0 , 1.0 ) m, y ( 0.5 , 0.5 ) m, z ( 1.5 , 3.0 ) m to control the filtering interval of point cloud, and the number of remaining point clouds is 259,443; the results are shown in Figure 15. After plane segmentation and voxel grid downsampling, the target pallet point cloud in this shelf scene is obtained, and the remaining point cloud number is 7438, which is shown in Figure 16a. The search radius of key point extraction is set as 0.05 m, and the two thresholds are set as r1 = 0.4 and r2 = 0.2. A total of 1476 key points are extracted from the target point cloud, as shown in Figure 16b, the red points are the key points.
Before computing the AGWF feature descriptors for the target point cloud, the adaptive neighborhood radius of each point needs to be determined and set to rmin = 0.006 m, rmax = 0.012 m, and rd = 0.001 m, and the average distance dp = 0.006 m of the target point cloud is calculated. The adaptive optimal neighborhood radius of each point is obtained according to the minimum criterion of the neighborhood characteristic entropy function, and the adaptive optimal neighborhood radius distribution is shown in Figure 17. The horizontal coordinate indicates the different neighborhood radii, and the vertical coordinate indicates the number of points corresponding to each neighborhood radius. It can be seen that the optimal neighborhood radius of key points is concentrated on the given minimum neighborhood radius, which can significantly improve the efficiency of the feature descriptor.
The AGWF feature descriptors of the source point cloud and the target pallet point cloud are calculated based on the minimum neighborhood radius, and the AGWF feature of a point is shown in Figure 18.
The SAC-IA algorithm is used for the coarse registration, and the ICP is used to complete the accurate registration; the accurate optimal rigid rotation matrix R and the translation matrix T are obtained as follows:
R = [ 0 . 99221826 0 . 0056165964 0 . 12438513 0 . 0041089170 0 . 99991482 0 . 012374031 0 . 12444409 0 . 011766853 0 . 99215692 ]
T = [ 0 . 35200304 0 . 083008856 0 . 018321276 ]
The result of the accurate registration is shown in Figure 19. The left figure shows the registration diagram of the source point cloud and the target pallet point cloud, where the red point cloud is the source point cloud and the blue point cloud is the pallet source point cloud. The figure on the right shows the pose of the source point cloud after transformation according to the accurate optimal rigid transformation matrix, where the red point cloud is the source point cloud, and the rest of the point clouds are the scene point clouds captured by the Percipio vision sensor while the driverless industrial truck is working. As can be seen in Figure 19, after the rigid transformation, the source point cloud and the target pallet point cloud in the field point cloud can achieve basic overlap, so it can be considered that the rotation matrix R and translation matrix T obtained from the accurate registration is reliable, and the source point cloud and the target point cloud have satisfactory registration results, which further verifies the accuracy of the proposed pose estimation algorithm. According to the rotation matrix R and the translation matrix T, it can be calculated that the horizontal deviations Δ x and Δ y are 0.35 m and 0.018, respectively, the deflection angel is 12.43°, and the pose estimation is completed.

3.3. Results and Discussion

To verify the rationality of Adaptive Gaussian Weight-based Fast Point Feature Histogram, a comparison is made with the traditional FPFH feature descriptor. A diagram of the comparison between AGWF and FPFH for a point is shown in Figure 20, with the horizontal coordinates indicating the feature dimensions and the vertical coordinates indicating the feature values on each dimension.
As can be seen in Figure 20, the proposed AGWF descriptor has more balanced values among dimensions and can describe more feature information as compared with the FPFH feature descriptor. The problem that the feature descriptor is overly influenced by a certain neighborhood point is avoided, and the robustness of the feature descriptor is improved. The comparison of coarse registration results between FPFH and AGWF is shown in Figure 21.
The proposed approach is compared with the traditional PFH and FPFH feature descriptors as well as the advanced SHOT feature descriptors in terms of the feature extraction time consumption and the RMSE. The experimental results of different descriptors are shown in Table 3, where R is the neighborhood radius used for feature extraction, T is the time consumed for feature extraction, Dt is the percentage of time consumption reduction of the proposed method compared with the traditional methods, DRMSE is the percentage of error reduction of the proposed method compared with the traditional methods, and GWF is Gaussian Weight-based Fast Point Feature Histogram, which only improves the traditional FPFH by Gaussian weight without adaptive neighborhood. In Table 4, the neighborhood radius of the traditional descriptors is set as 0.014 m, and the neighborhood radius of the AGWF is set adaptively. The GWF and AGWF are the proposed feature descriptors in this paper, and they are bolded in the Table 3 and 4.
As is shown in Table 3, the time consumption of the GWF is 7.9% and 17.34% longer than the SHOT and FPFH, respectively, and it is 56.6% faster than the PFH. Compared with the PFH, FPFH and SHOT, the RMSE of the GWF is reduced by 10.99%, 54.47% and 7.57%, respectively. The GWF does not have an absolute advantage in terms of time, because it needs to consider more information during feature extraction, but it has higher registration accuracy. It can be seen in Table 4, when the neighborhood radius is set to 0.014 m, the time consumption of the proposed method is reduced by 46.61%, 79.16% and 48.15%, and the RMSE is reduced by 36.35%, 68.75% and 36.23%, respectively. Comparing Table 3 and Table 4, the choice of the neighborhood radius can influence the time consumption of the feature extraction and the accuracy of the point cloud registration, and the proposed adaptive radius can significantly reduce the time consumption of feature extraction and the accuracy of the point cloud registration. Compared with the fixed radius (GWF), the proposed AGWF feature descriptor reduces the time consumption by 43.27%, and the RMSE by 27.86%. Above all, the proposed method can reduce the time consumption by more than 30% and the error by more than 35%.
Compared with the traditional FPFH feature descriptor, the proposed AGWF feature descriptor improves the weight coefficients in its calculation process, which can describe the local features of the point cloud better and fully consider the influence of each neighboring point on the key point features, thereby improving the accuracy and efficiency of the bit pose estimation. In addition, the methods based on traditional descriptors usually set the search radius of feature descriptors through experience when setting the radius of the key point neighborhood, which has the problem of inefficiency. The proposed approach adaptively sets the neighborhood radius according to the minimum criterion of the neighborhood information entropy function, which greatly reduces the computation time of feature descriptors and achieves efficient and high-precision pose estimation of the pallet.

4. Conclusions

A point cloud data-driven pallet pose estimation method using an active binocular vision sensor is proposed, which consists of point cloud preprocessing, adaptive Gaussian weight-based fast point feature histogram definition and point cloud registration, and improves the pose estimation accuracy by over 35% and reduces the feature extraction time by over 30%. The main contributions of the proposed method can be summarized as follows:
  • A point cloud-driven method for the driverless industrial trucks to estimate the pose of the pallet in the production shop is proposed, which solves the problem that the pallet cannot be accurately forked due to the position deviation, and improves the security and stability of the logistics system.
  • An adaptive optimal neighborhood radius selection criterion based on the minimum rule of the local neighborhood characteristic entropy function is proposed to determine the neighborhood radius of each key point adaptively, instead of selecting parameters based on experience manually, which significantly shortened the time of feature extraction and improved the accuracy.
  • Traditional descriptors only consider the Euclidean distance between the query key point and the neighborhood point as the traditional methods, and the weight of the proposed descriptor is optimized by the Gaussian distribution function. The change of the weights of each neighborhood point is smoother and can describe the key point features more accurately and completely, thereby effectively improving the robustness of the feature descriptors.
Nevertheless, since the proposed approach still requires a large amount of computation on the point cloud data, the real-time performance of the approach needs to be further improved. At the same time, due to the complex storage environment, the proposed approach also needs further discussion on problems such as vision occlusion and multi-target overlap caused by dynamic and static obstacles. The subsequent work will focus on improving the real-time performance of the algorithm, as well as research on scenes with different lighting and different degrees of occlusion.

Author Contributions

Conceptualization, J.L.; methodology, Y.S.; software, Z.F.; validation, Z.F.; formal analysis, Y.S.; investigation, B.Z.; resources, J.L. and Y.L.; data curation, Z.F. and Y.L.; writing—original draft preparation, Y.S.; writing—review and editing, Y.S. and Z.F.; visualization, Z.F.; supervision, J.L. and Y.L.; project administration, B.Z. and Y.L.; funding acquisition, Y.S. and J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the Zhejiang Provincial Natural Science Foundation, grant number LQ22E050017, the Zhejiang Science and Technology Plan Project, grant number 2018C01003, the China Postdoctoral Science Foundation, grant number 2021M702894, and the Zhejiang Provincial Postdoctoral Science Foundation, grant number ZJ2021119. All experiments were performed at Noblelift Intelligent Equipment Co., Ltd. We are grateful to Noblelift engineers for their experimental support.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Baglivo, L.; Biasi, N.; Biral, F.; Bellomo, N.; Bertolazzi, E.; Da Lio, M.; De Cecco, M. Autonomous pallet localization and picking for industrial forklifts: A robust range and look method. Meas. Sci. Technol. 2011, 22, 085502. [Google Scholar] [CrossRef]
  2. Hu, H.S.; Wang, L.; Luh, P. Intelligent manufacturing: New advances and challenges. J. Intell. Manuf. 2015, 26, 841–843. [Google Scholar] [CrossRef]
  3. Shuai, L.; Mingkang, X.; Weilin, Z.; Huilin, X. Towards Industrial Scenario Lane Detection: Vision-Based AGV Navigation Methods. In Proceedings of the 2020 IEEE International Conference on Mechatronics and Automation, Beijing, China, 13–16 October 2020; pp. 1101–1106. [Google Scholar]
  4. Baglivo, L.; Bellomo, N.; Marcuzzi, E.; Pertile, M.; Cecco, M. Pallet Pose Estimation with LIDAR and Vision for Autonomous Forklifts. In Proceedings of the IEEE 13th IFAC Symposium on Information Control Problems in Manufacturing IFAC-INCOM ‘09, Moscow, Russia, 3–5 June 2009; pp. 616–621. [Google Scholar]
  5. Mohamed, I.S.; Capitanelli, A.; Mastrogiovanni, F.; Rovetta, S.; Zaccaria, R. Detection, localisation and tracking of pallets using machine learning techniques and 2D range data. Neural Comput. Appl. 2020, 32, 8811–8828. [Google Scholar] [CrossRef] [Green Version]
  6. Zhang, Z.H.; Liang, Z.D.; Zhang, M.; Zhao, X.; Li, H.; Yang, M.; Tan, W.M.; Pu, S.L. RangeLVDet: Boosting 3D Object Detection in LIDAR with Range Image and RGB Image. IEEE Sens. J. 2022, 22, 1391–1403. [Google Scholar] [CrossRef]
  7. Seelinger, M.; Yoder, J.D. Automatic visual guidance of a forklift engaging a pallet. Robot. Auton. Syst. 2006, 54, 1026–1038. [Google Scholar] [CrossRef]
  8. Syu, J.L.; Li, H.T.; Chiang, J.S.; Hsia, C.H.; Wu, P.H.; Hsieh, C.F.; Li, S.A. A computer vision assisted system for autonomous forklift vehicles in real factory environment. Multimed. Tools Appl. 2017, 76, 18387–18407. [Google Scholar] [CrossRef]
  9. Fan, R.Z.; Xu, T.B.; Wei, Z.Z. Estimating 6D Aircraft Pose from Keypoints and Structures. Remote Sens. 2021, 13, 663. [Google Scholar] [CrossRef]
  10. Guo, K.; Ye, H.; Gao, X.; Chen, H.L. An Accurate and Robust Method for Absolute Pose Estimation with UAV Using RANSAC. Sensors 2022, 22, 5925. [Google Scholar] [CrossRef]
  11. Varga, R.; Nedevschi, S. Robust Pallet Detection for Automated Logistics Operations. In Proceedings of the 11th International Conference on Computer Vision Theory and Applications, Rome, Italy, 27–29 February 2016; pp. 470–477. [Google Scholar]
  12. Casado, F.; Lapido, Y.L.; Losada, D.P.; Santana-Alonso, A. Pose estimation and object tracking using 2D images. Procedia Manuf. 2017, 11, 63–71. [Google Scholar] [CrossRef]
  13. Shao, Y.P.; Wang, K.; Du, S.C.; Xi, L.F. High definition metrology enabled three dimensional discontinuous surface filtering by extended tetrolet transform. J. Manuf. Syst. 2018, 49, 75–92. [Google Scholar] [CrossRef]
  14. Zhao, C.; Du, S.C.; Lv, J.; Deng, Y.F.; Li, G.L. A novel parallel classification network for classifying three-dimensional surface with point cloud data. J. Intell. Manuf. 2021, in press. [Google Scholar] [CrossRef]
  15. Zhao, C.; Lv, J.; Du, S.C. Geometrical deviation modeling and monitoring of 3D surface based on multi-output Gaussian process. Measurement 2022, 199, 111569. [Google Scholar] [CrossRef]
  16. Shao, Y.P.; Du, S.C.; Tang, H.T. An extended bi-dimensional empirical wavelet transform based filtering approach for engineering surface separation using high definition metrology. Measurement 2021, 178, 109259. [Google Scholar] [CrossRef]
  17. Zhao, C.; Lui, F.C.; Du, S.C.; Wang, D.; Shao, Y.P. An Earth Mover’s Distance based Multivariate Generalized Likelihood Ratio Control Chart for Effective Monitoring of 3D Point Cloud Surface. Comput. Ind. Eng. 2023, 175, 108911. [Google Scholar] [CrossRef]
  18. Wang, X.D.; Liu, B.; Mei, X.S.; Wang, X.T.; Lian, R.H. A Novel Method for Measuring, Collimating, and Maintaining the Spatial Pose of Terminal Beam in Laser Processing System Based on 3D and 2D Hybrid Vision. IEEE Trans. Ind. Electron. 2022, 69, 10634. [Google Scholar] [CrossRef]
  19. Lee, H.; Park, J.M.; Kim, K.H.; Lee, D.H.; Sohn, M.J. Accuracy evaluation of surface registration algorithm using normal distribution transform in stereotactic body radiotherapy/radiosurgery: A phantom study. J. Appl. Clin. Med. Phys. 2022, 23, e13521. [Google Scholar] [CrossRef]
  20. Xie, X.X.; Wang, X.C.; Wu, Z.K. 3D face dense reconstruction based on sparse points using probabilistic principal component analysis. Multimed. Tools Appl. 2022, 81, 2937–2957. [Google Scholar] [CrossRef]
  21. Liu, W.L. LiDAR-IMU Time Delay Calibration Based on Iterative Closest Point and Iterated Sigma Point Kalman Filter. Sensors 2017, 17, 539. [Google Scholar] [CrossRef] [Green Version]
  22. Yang, J.L.; Li, H.D.; Campbell, D.; Jia, Y.D. Go-ICP: A Globally Optimal Solution to 3D ICP Point-Set Registration. IEEE Trans. Pattern Anal. 2016, 38, 2241–2254. [Google Scholar] [CrossRef] [Green Version]
  23. Wu, P.; Li, W.; Yan, M. 3D scene reconstruction based on improved ICP algorithm. Microprocess. Microsyst. 2020, 75, 103064. [Google Scholar] [CrossRef]
  24. Fotsing, C.; Menadjou, N.; Bobda, C. Iterative closest point for accurate plane detection in unorganized point clouds. Automat. Constr. 2021, 125, 103610. [Google Scholar] [CrossRef]
  25. Guo, N.; Zhang, B.H.; Zhou, J.; Zhan, K.T.; Lai, S. Pose estimation and adaptable grasp configuration with point cloud registration and geometry understanding for fruit grasp planning. Comput. Electron. Agric. 2020, 179, 105818. [Google Scholar] [CrossRef]
  26. Rusu, R.B.; Blodow, N.; Beetz, M. Fast Point Feature Histograms (FPFH) for 3D Registration. In Proceedings of the 2009 IEEE International Conference on Robotics and Automation, Kobe, Japan, 12–17 May 2009; pp. 1848–1853. [Google Scholar]
  27. Tombari, F.; Salti, S.; Di Stefano, L. Unique Signatures of Histograms for Local Surface Description. Lect. Notes Comput. Sci. 2010, 6313, 356–369. [Google Scholar]
  28. Li, D.p.; Liu, N.; Guo, Y.L.; Wang, X.M.; Xu, J. 3D object recognition and pose estimation for random bin-picking using Partition Viewpoint Feature Histograms. Pattern Recogn. Lett. 2019, 128, 148–154. [Google Scholar] [CrossRef]
  29. Toumieh, C.; Lambert, A. Voxel-Grid Based Convex Decomposition of 3D Space for Safe Corridor Generation. J. Intell. Robot. Syst. 2022, 105, 87. [Google Scholar] [CrossRef]
  30. Khanna, N.; Delp, E.J. Intrinsic Signatures for Scanned Documents Forensics: Effect of Font Shape and Size. In Proceedings of the Proceedings of 2010 IEEE International Symposium on Circuits and Systems, Paris, France, 30 May–2 June 2010; pp. 3060–3063. [Google Scholar]
  31. Xu, G.X.; Pang, Y.J.; Bai, Z.X.; Wang, Y.L.; Lu, Z.W. A Fast Point Clouds Registration Algorithm for Laser Scanners. Appl. Sci. 2021, 11, 3426. [Google Scholar] [CrossRef]
  32. Shao, Y.P.; Fan, Z.S.; Zhu, B.C.; Zhou, M.L.; Chen, Z.H.; Lu, J.S. A Novel Pallet Detection Method for Automated Guided Vehicles Based on Point Cloud Data. Sensors 2022, 22, 8019. [Google Scholar] [CrossRef] [PubMed]
  33. Hu, H.P.; Gu, W.K.; Yang, X.S.; Zhang, N.; Lou, Y.J. Fast 6D object pose estimation of shell parts for robotic assembly. Int. J. Adv. Manuf. Technol. 2022, 118, 1383–1396. [Google Scholar] [CrossRef]
Figure 1. Diagram of pallet position deviation. (a) Correct pose. (b) Deviation pose.
Figure 1. Diagram of pallet position deviation. (a) Correct pose. (b) Deviation pose.
Sensors 23 01217 g001
Figure 2. Flowchart of the pallet pose estimation method.
Figure 2. Flowchart of the pallet pose estimation method.
Sensors 23 01217 g002
Figure 3. Schematic diagram of FPFH neighborhood.
Figure 3. Schematic diagram of FPFH neighborhood.
Sensors 23 01217 g003
Figure 4. A neighborhood point diagram of a key point.
Figure 4. A neighborhood point diagram of a key point.
Sensors 23 01217 g004
Figure 5. Weight variation trend of neighborhood points of a key point.
Figure 5. Weight variation trend of neighborhood points of a key point.
Sensors 23 01217 g005
Figure 6. Diagram of local coordinate system (u,v,w).
Figure 6. Diagram of local coordinate system (u,v,w).
Sensors 23 01217 g006
Figure 7. Structure of the Percipio FM851-E2 vision sensor.
Figure 7. Structure of the Percipio FM851-E2 vision sensor.
Sensors 23 01217 g007
Figure 8. Relative position of unmanned industrial vehicle and pallet in standard state.
Figure 8. Relative position of unmanned industrial vehicle and pallet in standard state.
Sensors 23 01217 g008
Figure 9. (a) The source point cloud. (b) The key points of source point cloud.
Figure 9. (a) The source point cloud. (b) The key points of source point cloud.
Sensors 23 01217 g009
Figure 10. Processing results of source point cloud. (a) Adaptive optimal neighborhood distribution. (b) The AGWF feature descriptor.
Figure 10. Processing results of source point cloud. (a) Adaptive optimal neighborhood distribution. (b) The AGWF feature descriptor.
Sensors 23 01217 g010
Figure 11. Point cloud processing results of 5° deflection angle. (a) Scene point cloud. (b) Initial relative position. (c) Target point cloud. (d) Key points of the target point cloud. (e) Coarse registration. (f) Accurate registration.
Figure 11. Point cloud processing results of 5° deflection angle. (a) Scene point cloud. (b) Initial relative position. (c) Target point cloud. (d) Key points of the target point cloud. (e) Coarse registration. (f) Accurate registration.
Sensors 23 01217 g011
Figure 12. Scene point cloud and registration result. (a) 5° deflection angle. (b) 10° deflection angle. (c) 15° deflection angle. (d) 20° deflection angle. (e) Deviations of 0.05 m. (f) Deviations of 0.10 m. (g) Deviations of 0.15 m. (h) Deviations of 0.2 m.
Figure 12. Scene point cloud and registration result. (a) 5° deflection angle. (b) 10° deflection angle. (c) 15° deflection angle. (d) 20° deflection angle. (e) Deviations of 0.05 m. (f) Deviations of 0.10 m. (g) Deviations of 0.15 m. (h) Deviations of 0.2 m.
Sensors 23 01217 g012aSensors 23 01217 g012b
Figure 13. Color image of the shelf.
Figure 13. Color image of the shelf.
Sensors 23 01217 g013
Figure 14. Shelf scene point cloud.
Figure 14. Shelf scene point cloud.
Sensors 23 01217 g014
Figure 15. Pass-through filtering.
Figure 15. Pass-through filtering.
Sensors 23 01217 g015
Figure 16. Key point extraction for target point cloud. (a) Target pallet point cloud. (b) Target pallet key points.
Figure 16. Key point extraction for target point cloud. (a) Target pallet point cloud. (b) Target pallet key points.
Sensors 23 01217 g016
Figure 17. Shelf scene target pallet point cloud adaptive optimal neighborhood radius.
Figure 17. Shelf scene target pallet point cloud adaptive optimal neighborhood radius.
Sensors 23 01217 g017
Figure 18. The AGWF feature descriptor for a point.
Figure 18. The AGWF feature descriptor for a point.
Sensors 23 01217 g018
Figure 19. Visualization of pallet pose estimation results. (a) Registration result (b) Pallet pose estimation.
Figure 19. Visualization of pallet pose estimation results. (a) Registration result (b) Pallet pose estimation.
Sensors 23 01217 g019
Figure 20. Comparison of FPFH and AGWF at a point. (a) The FPFH feature descriptor for a point. (b) The AGWF feature descriptor for a point.
Figure 20. Comparison of FPFH and AGWF at a point. (a) The FPFH feature descriptor for a point. (b) The AGWF feature descriptor for a point.
Sensors 23 01217 g020
Figure 21. Comparison of coarse registration results between FPFH and AGWF. (a) Coarse registration based on FPFH. (b) Coarse registration based on AGWF.
Figure 21. Comparison of coarse registration results between FPFH and AGWF. (a) Coarse registration based on FPFH. (b) Coarse registration based on AGWF.
Sensors 23 01217 g021
Table 1. Point cloud processing process.
Table 1. Point cloud processing process.
Serial No.Number of Scene Point CloudsNumber of Target Point CloudsNumber Of Key PointsTime Consumed/s
a55,827618111541.6080
b58,788650013271.8263
c64,656711514421.9777
d53,433715515481.3595
e50,085553310140.7670
f47,142521712721.2597
g44,11848959161.2393
h36,054400111731.0132
Table 2. Experimental results of the ground scene.
Table 2. Experimental results of the ground scene.
Serial No.Actual DeviationEstimated DeviationRelative DeviationAccuracy/%
Δ x Δ y φ Δ x Δ y φ Δ x Δ y φ
a0050.01020.02674.88760.01020.02670.112497.76
b00100.01090.03409.86560.01090.03400.134498.66
c00150.02120.053415.090.02120.05340.190098.73
d00200.05360.068726.530.05360.06876.530067.30
e0.0500.05000.05350.05050.40600.00350.00050.406096.00
f0.1000.10000.10960.09881.06200.00960.00121.062094.60
g0.1500.15000.15360.14771.20140.00360.00231.201498.03
h0.2000.20000.19280.15793.25030.00720.04213.250387.68
Total Average0.00980.01940.501097.30
Table 3. Comparison of experimental results of different descriptors (R = 0.012 m).
Table 3. Comparison of experimental results of different descriptors (R = 0.012 m).
NamesR/mT/sRMSEDt/%DRMSE/%
SHOT0.0121.5640.018234−7.910.99
PFH0.0123.8720.03564756.654.47
FPFH0.0121.4300.017559−17.347.57
GWF0.0121.6780.016239//
The Dt and DRMSE are calculated as Dt = (FT − GWF)/FT, DRMSE = (FR − GWF)/FR, where FT is the feature extraction time of the traditional descriptors, and FR is the RMSE of traditional descriptors.
Table 4. Comparison of experimental results of different descriptors (R = 0.014 m).
Table 4. Comparison of experimental results of different descriptors (R = 0.014 m).
NamesR/mT/sRMSEDt/%DRMSE/%
SHOT0.0141.5640.01823446.6136.35
PFH0.0143.8720.03564779.1668.75
FPFH0.0141.4300.01755948.1536.23
AGWFAdaptive radius0.9520.011715//
The Dt and DRMSE are calculated as Dt = (FT − AGWF)/FT, DRMSE = (FR − AGWF)/FR, where FT is the feature extraction time of the traditional descriptors, and FR is the RMSE of traditional descriptors.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Shao, Y.; Fan, Z.; Zhu, B.; Lu, J.; Lang, Y. A Point Cloud Data-Driven Pallet Pose Estimation Method Using an Active Binocular Vision Sensor. Sensors 2023, 23, 1217. https://doi.org/10.3390/s23031217

AMA Style

Shao Y, Fan Z, Zhu B, Lu J, Lang Y. A Point Cloud Data-Driven Pallet Pose Estimation Method Using an Active Binocular Vision Sensor. Sensors. 2023; 23(3):1217. https://doi.org/10.3390/s23031217

Chicago/Turabian Style

Shao, Yiping, Zhengshuai Fan, Baochang Zhu, Jiansha Lu, and Yiding Lang. 2023. "A Point Cloud Data-Driven Pallet Pose Estimation Method Using an Active Binocular Vision Sensor" Sensors 23, no. 3: 1217. https://doi.org/10.3390/s23031217

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop