Next Article in Journal
A Universal-Verification-Methodology-Based Testbench for the Coverage-Driven Functional Verification of an Instruction Cache Controller
Next Article in Special Issue
Tube-Based Event-Triggered Path Tracking for AUV against Disturbances and Parametric Uncertainties
Previous Article in Journal
Pairwise Guided Multilayer Cross-Fusion Network for Bird Image Recognition
Previous Article in Special Issue
Micro-Factors-Aware Scheduling of Multiple Autonomous Trucks in Open-Pit Mining via Enhanced Metaheuristics
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Joint Dispatching and Cooperative Trajectory Planning for Multiple Autonomous Forklifts in a Warehouse: A Search-and-Learning-Based Approach

1
College of Mechanical and Vehicle Engineering, Hunan University, Changsha 410082, China
2
Dongfeng USharing Technology Co., Ltd., Wuhan 430056, China
*
Author to whom correspondence should be addressed.
Electronics 2023, 12(18), 3820; https://doi.org/10.3390/electronics12183820
Submission received: 12 July 2023 / Revised: 1 September 2023 / Accepted: 7 September 2023 / Published: 9 September 2023
(This article belongs to the Special Issue Recent Advances in Motion Planning and Control of Autonomous Vehicles)

Abstract

:
Dispatching and cooperative trajectory planning for multiple autonomous forklifts in a warehouse is a widely applied research topic. The conventional methods in this domain regard dispatching and planning as isolated procedures, which render the overall motion quality of the forklift team imperfect. The dispatching and planning problems should be considered simultaneously to achieve optimal cooperative trajectories. However, this approach renders a large-scale nonconvex problem, which is extremely difficult to solve in real time. A joint dispatching and planning method is proposed to balance solution quality and speed. The proposed method is characterized by its fast runtime, light computational burden, and high solution quality. In particular, the candidate goals of each forklift are enumerated. Each candidate dispatch solution is measured after concrete trajectories are generated via an improved hybrid A* search algorithm, which is incorporated with an artificial neural network to improve the cost evaluation process. The proposed joint dispatching and planning method is computationally cheap, kinematically feasible, avoids collisions with obstacles/forklifts, and finds the global optimum quickly. The presented motion planning strategy demonstrates that the integration of a neural network with the dispatching approach leads to a warehouse filling/emptying mission completion time that is 2% shorter than the most efficient strategy lacking machine-learning integration. Notably, the mission completion times across these strategies vary by approximately 15%.

1. Introduction

The increasing demands in the logistics industry all over the world have driven researchers and engineers to focus on developing intelligent transportation systems aimed at enhancing logistics efficiency [1,2]. One prominent application in this domain is unmanned warehouse systems [3]. As a typical component in an unmanned warehouse, an autonomous forklift transports parcels more efficiently than one driven manually because the former does not induce objective mistakes such as fatigue, anxiety, impatience, or anger [4]. Multiple autonomous forklifts should work together when the delivery burden is heavy [5]. Deploying multiple autonomous forklifts enhances delivery efficiency if the inter-forklift cooperation potential can be maximized. The typical modules that influence delivery efficiency include delivery task dispatch [6], cooperative trajectory planning [7,8], and control [9]. This paper is focused on the dispatching and cooperative trajectory planning schemes.

1.1. Related Work

1.1.1. Dispatching Methods for Multiple Forklifts

A complete multi-forklift delivery planning system consists of two functions: delivery task dispatch and cooperative trajectory planning. Before cooperative trajectory planning, the goal point of each forklift is assigned in the dispatching phase [10]. Weidinger et al. [6] proposed a metaheuristic-based method in which assignment candidates are pruned a priori to facilitate the solution process [11]. A similar idea was proposed by Zhang et al. [12] to dispatch multiple automated guided vehicles (AGVs) in a matrix manufacturing workshop. However, both methods run slowly; thus, they cannot meet the real-time computation demand in a warehouse [6,12]. Lin et al. built a multi-AGV dispatching system via network structure together with simplex decision variables; in this system, an evolutionary algorithm minimizes the completion time of all AGVs in a formulated network optimization problem. However, References [6,12,13] shared a common limitation of assuming a uniform speed for AGVs. Moreover, the inter-vehicle collision avoidance problem is reduced to an oversimplified constraint, ensuring solely nonoverlapping time intervals per stopover.
Furthermore, recent studies have focused on the task integration of one AGV instead of considering how multiple AGVs cooperatively operate within the confined area. Bao et al. [14] proposed a heuristic method based on an auction strategy for a multi-AGV task dispatch scheme considering complex factors (such as pod repositioning). The concerned dispatch scheme is inherently an optimization problem with complex cost terms and constraints facilitated by the proposed auction strategy. Lee et al. [15] proposed a two-stage dispatching method. In particular, the first stage deals coarsely with the delivery efficiency and delivery flow balance by solving a bi-objective optimization problem. The result indicates how the parcels to be picked can be clustered. At stage 2, vehicles are dispatched to complete the clustered missions. Dividing the original scheme into two stages largely reduces the number of dispatch candidates without losing the optimum. Machine-learning-based dispatching methods have also been proposed [16]. The formulated reward functions efficiently simplify the dispatch scheme, particularly when complex factors are considered [17,18]. However, few vehicle kinematics is considered in [14,15], and the dispatching phase is fully separated from the trajectory planning strategy. This approach results in difficulties in maximizing overall delivery efficiency.

1.1.2. Cooperative Trajectory Planning Methods for Multiple Forklifts

Cooperative trajectory planning follows the aforementioned dispatching phase. The prevalent cooperative trajectory planners are based on model predictive control [19], which is highlighted by its fast feature while strictly satisfying safety-related constraints. The artificial potential field method is similarly widely applied in trajectory planning, but it may encounter difficulty finding paths through narrow passages [20].
Ma et al. [21] converted constrained time-varying nonlinear programming problems to general unconstrained optimization problems by properly designing a penalty function. Thereafter, a particle swarm optimization method was employed to plan the motion of multiple robots sequentially in a double warehouse with two elevators. However, the optimization phase, along with other methods based on optimization, can significantly increase the computational burden [22,23,24], which can be reduced by forming model-based paths because warehouses are generally structured.
Yang et al. [1] proposed a strategy in which a time-varying dynamic evaluation function is formed based on a network congestion diffusion model to quantify the degree of road congestion. Hereafter, an improved A* search algorithm and a time window algorithm were combined as a hierarchical planning method to search the idle path and avoid collisions. A path planning framework was designed by Zhou et al. [25] to simultaneously reduce the cost of operation and the path for AGVs in airport parcel loading scenarios. An ant colony optimization method was used to optimize the parcel pickup sequencing by ignoring other moving vehicles, while Dijkstra’s algorithm was employed to determine the shortest route of each AGV. Zacharia et al. developed a joint routing and motion planning method for AGVs that addresses uncertainties in demands and travel times. Their approach combines a scheduler for updating destination resources during navigation and integrates a fuzzy-based genetic algorithm with A* search to handle capacity and distance variations [26]. Nonetheless, the vehicle kinematics considered in [1,25,26] remain oversimplified for industrial use even in the trajectory planning phase.
In other studies, the paths of vehicles were assumed to be predefined, and only the longitude trajectories were investigated. Kneissl et al. [27] formulated a method in which potential collision zones are continuously detected. Moreover, the right-of-way is granted to the first arriving vehicle while all the other vehicles involved stop and wait. Dresner et al. proposed confronting the analogous problem of conflict zones with a reservation-based system; in this system, vehicles request and receive time slots from the intersection while they pass [28]. Similarly, a discrete-event logic, which is comparable with a conventional right-handed bidirectional traffic system, was designed by Guney et al. [29] to handle the priorities of the AGVs in a warehouse dynamically. Thus, the need for computationally demanding heuristic searches is eliminated to ease strategy implementation in real-life industrial applications. Furthermore, Digani et al. [30] proposed an obstacle-free path generation method to deal with local deviations from the predetermined path. In the proposed method, new paths are generated via polar spline curves. However, the aisles in a warehouse cannot be fully exploited when certain traffic laws in [27,28,29,30] are strictly enforced.

1.1.3. Joint Dispatching and Planning Methods for Multiple Forklifts

Most of the existing dispatching studies, e.g., [6,10,11,12,13,14,15,16,17,18], cannot accurately evaluate the candidate choices, possibly preventing the downstream planning module from achieving global optimality [1,21,22,23,24,25,26,27,28,29,30]. Thus, combining the dispatching and planning phases is naturally considered. The multi-agent path finding (MAPF) problem in its classical form is an effective approach for simplifying complex warehouse scenarios and facilitating cooperative solutions for dispatching and planning. In the MAPF problem, time is discretized into steps, allowing vehicles to either move or wait during each step [31]. Consequently, it becomes challenging to plan trajectories for vehicles with varying velocities or based on specific kinematic constraints. To address this limitation, researchers have explored extensions of the MAPF problem to accommodate such complexities. Among those extensions, Zhang et al. [32] designed a joint strategy to deal with an automatic valet parking system, in which a travel-distance-related reward function combined with a deep reinforcement learning technique was used to allocate the target parking spaces. The parking lot was segmented into local regions, and a rule-based right-of-way assignment strategy was applied to solve collisions and deadlocks. A simplified trajectory planning algorithm based on the car-following model [33] served as a tool to solve the trajectories of multiple AGVs when no potential collision was involved. A similar strategy was proposed by Lee et al. [34] for a supply-chain-connected warehouse. In their work, a cloud-based semiautomatic warehouse management system assigns tasks to mobile robots to optimize resource allocation. A robot control system executes an improved A* search algorithm to generate the path of each AGV. Then, potential collisions, named stay-on, head-on, and cross-conflict, are identified and solved by following certain priority-based rules. Redispatchment of the AGV with low priority is triggered as the conflict cannot be prevented by those basic rules.
With regard to joint strategies, the studies above [32,34] can deal with large-scale AGV-based scenarios. However, these studies were concerned with vehicles possessing the simple kinematics of unicycle (differential-drive) robots [35] and generally focused on the construction of maneuverable systems, ignoring the overall optimal solutions concerning warehouse operations. Furthermore, during the trajectory planning phase, they initially planned only the paths and ignored other moving obstacles. Such considerations substantially reduce the risk of collisions and simplify the evaluation of the traveling difficulties pertaining to one potential task relative to the corresponding traveling distance. In other words, it remains unknown the specific trajectory pertaining to each AGV as the jointed strategy is finished, and this trajectory is dependent on local scenes when conflict zones are involved. Thus, the results become unreliable when such strategies are applied to a warehouse of automated forklifts with complex kinematics. As a conclusion of this subsection, it deserves to develop a joint dispatching and planning method to balance the forks’ motion quality and reaction speed in an unmanned warehouse.

1.2. Motivations

This study aims to substantially improve the efficiency of cooperative operations among multiple autonomous forklifts by seamlessly integrating the dispatching and cooperative trajectory planning phases. Our primary objective is to address the limitations of existing dispatching methods, which often overlook low-level forklift kinematic capability. To overcome this challenge, we opt for the implementation of a graph search process in this phase. Moreover, to ensure a robust solution that avoids getting trapped in local optima, we chose to incorporate a machine-learning-based technique. In the trajectory planning phase, we recognize that optimization-based methods are computationally expensive. As such, our secondary objective is to develop an alternative search algorithm that employs a model-based approach. This algorithm is designed to be both velocity-aware and sequentially solvable, striking a balance between accuracy and computational efficiency.

1.3. Contributions

The core contribution of this paper is the proposal of a joint framework, which is promising to reduce the computational burden because all formulations involved are explicitly expressed. Concretely, the dispatching stage can enhance the multi-vehicle task solution quality because it considers the future trajectory pertaining to each forklift. Moreover, the kinematically feasible and safe trajectory of each forklift can be quickly generated through our proposed method at the trajectory planning stage, due to the removal of optimization-based methods.

1.4. Organization

In the rest of this paper, Section 2 formulates the in-warehouse delivery problem. Section 3 provides the score-based dispatching technique, in which ANN is applied to avoid deadlocks in evaluating the cost of each candidate dispatching option. Section 4 introduces the trajectory planning method, namely a model-based velocity-aware hybrid A* search algorithm. Section 5 integrates the two aforementioned methods to develop a joint dispatching and cooperative trajectory planning framework, followed by Section 6, where comparative simulation results are present. Conclusions are drawn in Section 7, finally.

2. Problem Statement

Forklifts are used to deliver goods between fixed picking stations and predetermined shelf areas during delivery tasks in warehouses. The passages are generally designed to be narrow, and they merely allow turning maneuvers with a minimum radius and the passing of only two vehicles. Hence, conflicts arise when multiple forklifts cooperatively operate within a single warehouse.
Within one subtask during the filling of one warehouse, there are two stages: first, the initial pose and the final one should be assigned to one forklift as the dispatching stage; second, the trajectory planning stage generates a trajectory by avoiding collisions with any static or moving objects and satisfying the vehicle kinematics.

2.1. Warehouse Layout

A typical small warehouse layout is schematized in Figure 1. In this warehouse, six separated shelf clusters, denoted as s d ( d = 1 , 2 , , 6 ), are placed and provide areas to store goods. Continuous lines indicate shelf walls, through which forklifts cannot move. Two firewalls, represented by rows of grey squares, are present between shelf cluster s 1 and s 3 as well as cluster s 2 and s 4 . Meanwhile, four picking stations, marked with slender solid rectangles and p r ( r = 1 , 2 , 3 , 4 ), are located in both extremities of the vertical and wide passage in Figure 1. Four forklifts can enter the passage of each row of the shelf clusters, as long as neither stored stacks nor other vehicles block the route.
As presented in Figure 2, each shelf cluster has the capacity to accommodate varying numbers of goods stacks with strategically positioned notches in the arrangement designed to suit forklift kinematics during turns. This aspect will be elaborated on in Section 2.2. Each stack is marked with a number in Figure 2 to represent the order in which a shelf cluster can be filled. The shelf filling state and the vehicle state can be effectively expressed by noting the covered grids when the warehouse is divided by the squares outlined by grey dashed lines in Figure 2. In addition, when one forklift is unloading goods within a shelf cluster, its fork side should point to the stack position (cf. upper left forklift schematic and within shelf cluster s 1 in Figure 1). Similarly, when one forklift is picking goods at a picking station, the fork side should point to the station position (cf. upper forklift schematic and picking station p 1 in Figure 1).

2.2. Kinematics of a Forklift Vehicle

As reported in Figure 3, a forklift can be described as a front-steering vehicle if the fork part of the vehicle is treated as the rear side. The corresponding kinematic formulas write:
d x ( t ) d t = v ( t ) cos θ ( t ) d y ( t ) d t = v ( t ) sin θ ( t ) d v ( t ) d t = a ( t ) d θ ( t ) d t = v ( t ) tan ϕ ( t ) l d ϕ ( t ) d t = ω ( t )
where t is time; P , located at coordinate ( x , y ), indicates the mid-point of the rear wheel axis; and θ , v , a , ϕ , and ω respectively denote the orientation angle, linear velocity pertaining to point P , acceleration, steering angle of the front wheels, and steering rate. Furthermore, l stands for the wheelbase length, m denotes the rear overhang length, n refers to the front overhang length, and 2 b is the car width. Given that the initial values as well as ω t and a t are provided, the state variables can be calculated through integration over the dynamic process.
Meanwhile, a few boundaries are imposed on the state profiles over the entire simulation period throughout all dynamic maneuvers:
a ( t ) a max v ( t ) v max ϕ ( t ) ϕ max ω ( t ) ω max
where a m a x , v m a x , ϕ m a x , and ω m a x respectively indicate the upper limits of the corresponding variables.

3. ANN-Combined Score-Based Dispatching Approach

Filling a warehouse in an orderly manner requires several forklifts to perform multiple deliveries and return subtasks. In the current work, subtasks are assigned to different vehicles sequentially. With regard to such subtasks, selecting which vehicle will be used to plan the new trajectory and determining which goal coordinates the forklift is going to should comprise the fundamental initialization. Therefore, a dispatching system is necessary to solve these problems.
The dispatching approach (cf. Algorithm 1) utilizes the planned trajectories T of all vehicles, along with map information m a p , and vector F representing filled stacks for different clusters, as its inputs. The core of this approach lies within a while loop, wherein the potential subtask undergoes iterative updates (with a preset maximum iteration number i t e r d i s p a t c h ) until it reaches an optimal state, as defined by the proposed method.
Outside of the loop, the function rank ( ) sorts all vehicles based on t , arranging them in ascending order from first to last. This sorting process generates a ranking vector R consisting of four vehicle indices. If f a i l = 1 , indicating a failure in the trajectory search between P i x i , y i at instant t i and P f x f , y f , the corresponding vehicle is repositioned at the end in R and flagged as having been selected as n c u r r e n t . In the loop, the algorithm is divided into two parts. The first one (lines 4 to 13) concerns the selection of the current investigated vehicle with an index of n c u r r e n t , and the second part (lines 14 to 30), featuring a scoring system combined with the results of an ANN method, determines the goal coordinate for the current subtask. Notably, Algorithm 1 is applied when the ANN correction system, which is elaborated on in Section 3.2, is enforced.
Algorithm 1: ANN combined score-based dispatching algorithm
n c u r r e n t , P i , P f , t i   Dispatch T , F , f a i l , P i , P f , m a p
1.
Initialize α 0 ;
2.
R rank T , P i , P f , f a i l ;
3.
while  i t e r < i t e r d i s p a t c h , do
4.
if α = 0 , then
5.
   n c u r r e n t , t i SelectInitialState R ;
6.
else
7.
  if CheckSelection R > 0 , then
8.
    n c u r r e n t , t i SelectAlteredState R ;
9.
  else
10.
    n c u r r e n t , t i SelectBackupState R ;
11.
  end if
12.
end if
13.
P i = SetInitialPose ( T , n c u r r e n t , m a p ) ;
14.
if CheckDeliverTask P i is true, then
15.
   S d = PreAstarDeliver P i , F , m a p ;
16.
  if max S d > 0 , then
17.
    P f = SetFinalPose S d , m a p ;
18.
   return;
19.
  else
20.
    α 1 ;
21.
  end if
22.
else
23.
   S r = PreAstarReturn P i , m a p ;
24.
  if max S r > 0 , then
25.
    P f = SetFinalPose S r , m a p ;
26.
   return;
27.
  else
28.
    α 1 ;
29.
  end if
30.
end if
31.
end while
32.
return;

3.1. Vehicle Selection and Initial Pose of a New Subtask

The function SelectInitialState ( ) selects the vehicle, ranking the first one as n c u r r e n t , and sets t i , which is the initial instant of the trajectory to be planned, as the ending instant of the last trajectory pertaining to n c u r r e n t . Failure may occur in the determination of the new trajectory for the fork n c u r r e n t in the new subtask because other forklifts may block the only corresponding route for a considerable time. Under such circumstances, a flag variable α is set to 1, and all vehicles in R are checked to see if they have been selected as n c u r r e n t once for the current subtask by CheckSelection ( ) . The function SelectAlteredState ( ) is then utilized. In this function, the first motion-finished vehicle is discarded, and the other forklifts are subsequently selected in turn as n c u r r e n t on the basis of the rankings in R until the trajectory can be formed. Furthermore, no trajectory can be successfully planned for all forklifts at certain moments. In this case, SelectBackupState ( ) is applied, in which the vehicle that ranks last in R is selected, and the t i of the new trajectory is postponed for a fixed time length of t i relative to the end of the last subtask for vehicle n c u r r e n t . The vehicle n c u r r e n t final stopping pose P i x i ,   y i , θ i is set as the initial pose of the new subtask by function SetInitialPose ( ) .

3.2. Scoring System

A scoring system is applied to decide the goal coordinate of the new subtask. First, the function CheckDeliverTask P i is initially employed to ascertain whether the planned trajectory involves heading to clusters or returning. The functions PreAstarDeliver ( ) and PreAstarReturn ( ) are then applied separately depending on whether the goal is a rack cluster or a picking station. In both functions, the grid networks, outlined by light colors in Figure 2, indicate the nodes used to define the location of a vehicle and stacks. The resolution of such nodes is purposely reduced with the aim of lowering computation costs. Given that the nodes are defined, a time dimension involved preliminary A* search algorithm, whose expansion manner is presented in Figure 4, is used to generate preliminary trajectories that link the starting pose P i x i , y i at instant t i to each potential target coordinate. Five patterns in total for this search algorithm are applied to vaguely indicate the possible maneuvers a vehicle could perform. In particular, manner 5 in Figure 4 expands only in the time dimension, representing the stopping condition of a virtual forklift. The time consumed derived from this algorithm for a virtual vehicle represented by one node is then applied as a parameter to evaluate the difficulty grades of reaching different goal poses. Other vehicles and walls are treated as obstacles during the search. Notably, the orientation angles θ for the initial and final poses are not required to be determined in such a system. Thus, θ is not considered a dimension in this search for the sake of calculation simplification.
In the function PreAstarDeliver ( ) , the function S d , 0 is applied to evaluate the scores pertaining to different target stack locations s d (cf. Figure 1) when a subtask with one stack as the goal is considered. It writes:·
S d , 0 = C 1 G d + J d , 0 J d , 0 = H + C 2 I d + C 3       preliminary   trajectory   is   planned   C 4 preliminary   trajectory   planning   is   failed   within   i t e r p r e H = Δ t c o v e r I d = t f i n d t i + C 5 s d s 1 , s 2 t f i n d t i s d s 3 , s 4 , s 5 , s 6
where G d denotes the number of stacks to be filled/emptied in the target shelf cluster s d in order to balance the warehouse filling/emptying mission in different clusters. J d , 0 stands for the approximate difficulty to reach different goal coordinates in shelf cluster s d . C 4 is a negative constant applied when the goal in practice is not reachable. H indicates time length t c o v e r . t c o v e r counts the time units during which other vehicles occupy the shelf entrance node (cf. the grey grid for the upper left forklift in s 1 in Figure 1) of the goal pose within a fixed time length t e n t r y after the virtual vehicle entering the rack passage. The reason t c o v e r is added as a parameter is that, in practice, the availability of the above-mentioned node is critical during the planning of the final trajectory that considers kinematics. I d is a function of the time length t f i n d t i consumed to arrive at the target in this preliminary search. Notably, this function is built to normalize the results for different shelf clusters because shelf clusters s 1 and s 2 are far from the picking stations, and delivering goods to these locations consumes much time. Furthermore, a negative constant C 4 is assigned to J d , 0 when the target cannot be reached through the search within a predefined maximum iteration number i t e r p r e . Meanwhile, C 1 , C 2 ,…, C 5 are calibration parameters. Among these, a substantial weighting coefficient, C 1 , is allocated to regulate the stacks filled in each shelf cluster; aiming for balance, C 2 , C 3 , C 4 , and C 5 are designed to quantitatively assess scores with respect to time considerations. If S d , 0 0 is derived within i t e r p r e , the vehicle kinematics-considered trajectory is difficult to find. Thus, the flag variable α is set to 1 in the initialization cycle, where ANN is not enforced, and n c u r r e n t should be reassigned.
Similarly, in the function PreAstarReturn ( ) , only the time consumed with respect to different picking stations p r (cf. Figure 1) in the A* search is used in the evaluation of S r , which is expressed as:
S r = I r     preliminary   trajectory   is   planned 0     preliminary   trajectory   planning   is   failed   within   i t e r p r e
When ANN is not enforced, the potential targets are initially scored solely by means of Equations (3) and (4). The greater the functions to be evaluated are, the greater the likelihood of subsequent trajectory planning is and the faster the entire warehouse can be filled. In this case, arg   max   S d , 0 and arg   max   S r are selected as the goal poses for delivery and return subtasks, respectively.

3.3. ANN Correction Method

The function PreAstarDeliver ( ) is employed to refine goal score evaluations through a multilayer perceptron (MLP) network, which is elaborated upon as follows.
Figure 5 presents a typical MLP network of ANN with one hidden layer. Mathematically, with the trajectory planning states as known variables, the MLP network of the type reported in Figure 5 can be expanded step by step as follows:
y ^ ( w , W ) = F ( j = 1 m W j h j ( w ) + W 0 ) = F ( j = 1 m W j f j ( i = 1 n w j i z i + w j 0 ) + W 0 )
where w j i and W j denote the weights assigned to the connection of the neurons. W o and w j 0 are linked to the bias, whose values are simply the constant 1.
The ANN correction in the current study is designed for the scoring system for the goal pose determination of delivery subtasks. In the initialization phase of the ANN correction system, excluding the filling balance parameter of G d in Equation (3), J 1 , 0 , J 2 , 0 , , J 6 , 0 respectively denote the base values of the output elements in J = J 1 , e s t , J 2 , e s t , , J 6 , e s t (cf. y ^ in Figure 5) for six MLP networks. Among the six elements, the one whose corresponding pose is selected as the target for vehicle kinematics-considered trajectory planning is further fixed based on the corresponding trajectory variables. The values of the other elements remain as unchanged as the results in Equation (3). Suppose that s d is the shelf cluster investigated within a subtask. Given that the base value S d , 0 is derived with Equation (3), if s d that corresponds to arg   max   S d , 0 is selected as the goal to determine the trajectory, then the trajectory with the goal of s d will be planned, and the corresponding element in J will be derived with Equation (6) as follows:
J d , e s t = J d , 0 C 6 ( t f t i t ¯ + C 7 ) trajectory   is   planned ,   and   d 1 , 2 J d , 0 C 6 ( t f t i t ¯ + C 8 ) trajectory   is   planned ,   and   d 3 , 4 , 5 , 6 C 5 trajectory   planning   is   failed   within   i t e r p r e J d , 0 trajectory   is   not   planned
where t f is the ending instant of the trajectory, t ¯ stands for the average value of the time length pertaining to all previously derived trajectories, C 6 is a calibrated constant intended to balance J d , 0 , and the latter solely accounts for time consumed to reach a goal without factoring in the distance covered. C 7 and C 8 are respectively used to normalize the difference in distances corresponding to various shelf clusters and the expected moving time period of each subtask.
Similar to the parameter G d in Equation (3), the inputs (cf. z 1 ,   z 2 , ,   z n in Figure 5) of the ANN system are the number of stacks filled within each shelf cluster. A vector U with six elements { u 1 ,   u 2 , ,   u 6 }, which correspond to six shelf clusters s d , is used. Suppose that u d U is considered, it can be formulated as
u d = G d delivery   subtask   is   linked G d return   subtask   or   subtasks   of   both   types   are   linked 0 no   subtask   is   linked
The integers other than 0 can represent the filling states of the shelf clusters, which are associated with the currently moving forklifts. The vehicle motion states are also observed. This input of the MLP network vaguely provides information associated with the possible area the vehicles may be located in, given that each shelf cluster should be filled by following certain orders. Furthermore, at least two shelf clusters are not connected to any subtask because only four forklifts in the warehouse are employed. Evidently, these shelf clusters have no impact on the trajectory planning, and this situation is in line with the circumstances, where the investigated shelf group is filled. Thus, 0 is assigned to u d in this case.
As the number of hidden neurons is set to 12 according to an empirical technique [36], the MLP system used to score shelf cluster s d can be expressed in the form of Equation (5) as
J ^ d , e s t ( w d , W d ) = F ( j = 1 12 W d , j h j ( w d ) + W 0 ) = F ( j = 1 12 W d , j f j ( i = 1 6 w d , j i u d , i + w d , j 0 ) + W 0 )
The Levenberg-Marquardt method is then used to train the MLP and the values assigned to the elements of W and w .
In the following warehouse filling cycles and the final dispatching system, the scoring system described in Equation (3) is replaced by the expression below to determine the optimal goal pose for vehicle n c u r r e n t . Equation (9) is the final scoring equation of the discussed dispatching system, wherein C 9 and C 10 act as the calibration constants. These constants are determined via a trial-and-error approach to yield results. The MLP contributes without excessively disrupting performance concerning warehouse filling/emptying time.
S d = C 1 G d + J d J d = C 9 J d , 0 + C 10 J ^ d , e s t ( w d , W d )
After the initialization cycle of warehouse filling (referring to Figure 6), the training process can continue until C y cycles are finished. In the following cycles, the MLP has already been established depending on the data pertaining to the previous cycles, and the saved values of W and w are used to estimate the values J ^ d , e s t ( ) via Equation (8). Therefore, the base values of the MLP outputs J d , 0 are replaced by J d in Equation (9) when shelf cluster s d is considered. The expression of the corresponding element in J for the following cycles writes:
J d , e s t = J d C 6 ( t f t i t ¯ + C 7 ) trajectory   is   planned ,   and   d 1 , 2 J d C 6 ( t f t i t ¯ + C 8 ) trajectory   is   planned ,   and   d 3 , 4 , 5 , 6 C 5 trajectory   planning   is   failed   within   i t e r p r e J d trajectory   is   not   planned

3.4. Final Pose Selection

The final pose of each subtask is determined using the SetFinalPose ( ) function. For subtasks with the purpose of delivery, the destination coordinate P f x f , y f is chosen based on the arg   max   S d criterion; for return subtasks, the coordinate P f x f , y f corresponding to arg   max   S r is selected. Finally, the corresponding θ information should be added to complete the destination pose.

4. Improved Hybrid A* Search Algorithm

Hybrid A* algorithm [37] is an extension of the conventional 2D A* search algorithm due to its consideration of kinematics during node expansions over time. Different from the original hybrid A* search, the velocity variation is considered, and the node expansion manner is determined based on action purposes in the algorithm (cf. Algorithm 2) applied in this study. Furthermore, this improved heuristic method directly determines trajectory details throughout a subtask without an optimization stage.
Algorithm 2 is used to expand a parent node ( P p , t p ) in one manner N c through the improved hybrid A* search with the target of P f . Focusing on the motion of one vehicle, L i s t o p e n is used to store the data pertaining to all nodes ( P o p e n , t o p e n ), which has been explored and can be further expanded. The information contains the corresponding parent nodes, expansion manner (cf. Section 4.1), time expansion data (cf. Section 4.2), and costs (cf. Section 4.4). On the contrary, the node ( P c l o s e d , t c l o s e d ) cannot be expanded anymore and is stored in L i s t c l o s e d . Furthermore, the function DetectCollision ( ) is discussed in Section 4.3. In addition, the function AddNode ( ) is used to add a node with its affiliating data into L i s t o p e n or L i s t c l o s e d .
Algorithm 2: Improved hybrid A* search algorithm
σ , L i s t o p e n   SearchAStar T , N c , P p , P f , t p , L i s t o p e n , L i s t c l o s e d , m a p
1.
t h i g h , c , t m i d , c , t z e r o , c FixMovingTime L i s t o p e n , P p , t p , N c ;
2.
L i s t o p e n SetCost L i s t o p e n , P p , t p , N c ;
3.
γ DetectCollision T , N c , P p , t p , L i s t o p e n , m a p ;
4.
if P c , t z e r o , c L i s t c l o s e d   , then
5.
return;
6.
end if
7.
if P c , t z e r o , c L i s t o p e n  and  γ = 0 , then
8.
if f c < f p r e , then
9.
   L i s t o p e n ReplaceNode L i s t o p e n , P p , t p ;
10.
end if
11.
else
12.
if γ = 0 , then
13.
   L i s t o p e n AddNode L i s t o p e n , P c , t z e r o , c ;
14.
  if P c = P f , then
15.
    σ 1 ;
16.
   return;
17.
  end if
18.
else if γ = 1 , then
19.
   L i s t c l o s e d AddNode L i s t c l o s e d , P c , t z e r o , c ;
20.
  return;
21.
end if
22.
end if
23.
return;

4.1. Node Expansion Method

This section elaborates on the various possible expansion manners denoted as N a l l , with each individual possibility represented by N c . The drivable area is initially mapped with the above-discussed grid networks (cf. Figure 2). The dimensions of the grid should be skillfully coupled with the size of the vehicle, with the aim of reducing occupied cells during a certain action and enhancing the utility rate of the space. The case demonstrated in Figure 7 is a well-designed example. In this case, one vehicle covers two grid cells. Thus, on a 2D space domain, the orientation angle θ involved in forklift movements can be easily described. In addition, through one maneuver, the vehicle body occupies a small number of grids. This can reduce the possibility of interference with other forklifts during motion planning.
As reported in Figure 7a,b, the nodes expanding manners N a l l in the space domain can be divided into eleven patterns, which are summarized as four categories covering all possible maneuvers a forklift may intend to conduct. These include stopping, going straight (cf. semitransparent solid rectangles in Figure 7a), lane changing (cf. rectangles with diagonal stripes in Figure 7b), and turning (cf. solid rectangles in Figure 7b).
As far as the expanding manners are concerned, when v remains 0, one vehicle stops. As v is other than 0 with steering angle ϕ = 0 , one fork can go or reverse straight. Figure 8 demonstrates one modeled forklift path depicted by consecutive outlines, when the vehicle goes upward and turns to the left from the right lower side to the center left. Through this maneuver, ϕ is varied to gain an identical final location relative to grids as the initial state, despite the change of π / 2 in θ . Under such circumstances, the consecutive motions can be easily established. Meanwhile, the modeled maneuver of reverse turning to the left can also be noted in Figure 8, when the initial and final body outlines are exchanged.
Similar to turning actions, a typical lane-changing maneuver is also modeled and outlined in Figure 9. The forklift goes forward and changes to the left lane, which is shown on the top row in Figure 9. The other possible node-expanding paths of turning and lane changing are modeled by mirroring or rotating the examples reported in Figs. 8 and 9.
The smoothness of the modeled paths can be enhanced by adopting advanced techniques, and the identical heuristic trajectory search rule of this work can also be applied to the newly modeled paths.

4.2. Velocity Planner

The velocity planner determines the node expansion rule in the time domain and the potential node arrival instants. When one maneuver starts or finishes, the vehicle speed solely falls into one of three determined constants ( v h i g h , v m i d and v z e r o ), which correspond to high speed, mid speed, and zero speed. Figure 10 reports all possible velocity selections linked to the start and end of the maneuvers. v h i g h can only be selected with the expansion pattern of going straight to fulfill the kinematic constraints given by Equations (1) and (2). The stop maneuver is an expansion of finite value solely on the time domain, it follows that only v z e r o can be applied to this action.
The velocity level over one maneuver can only be linked to itself or an adjacent one, although all maneuvers can be arbitrarily linked along a trajectory. In other words, during one maneuver, the vehicle speed can maintain or alter among those three velocities, but the direct change between v z e r o and v h i g h is illegal. For instance, if the vehicle finishes a series of going straight actions with speed v z e r o , its ending velocity of the last second action should then be v m i d or v z e r o . Likewise, if turning follows going straight with an original velocity v h i g h , the speed of the last going straight maneuver should reduce to v m i d to gain an initial velocity v m i d for turning.
Following the velocity selection rules, all possible speed variations through one maneuver are shown in Figure 11. It is noteworthy that the link of v h i g h →Going straight→ v z e r o and the opposite link are infeasible. Thus, 16 connection choices in total are applicable. Furthermore, velocity is merely an intermediate state used to determine a time dimension expansion, although it is a vital parameter for motion planning. The detailed velocity time history during one maneuver can be modeled by applying varied techniques to obtain the required initial and finishing speeds. In the current work, deceleration or acceleration along one action between the same velocity levels is modeled with identical time lengths, with the aim of simple calculation. The number of time expansion selections (cf. T e with e 1 , 2 , 12 in Table 1) thus reduces to 12.
The function FixMovingTime ( ) determines time expansion data based on Table 2, which is elaborated as follows. When planning a trajectory, the maximum feasible velocities are consistently selected for all maneuvers with zero speed supplied to the start and end of the trajectory. With regard to one node expansion, three values of t h i g h , p , t m i d , p and t z e r o , p are initially saved in the parent node as the possible start instant of the maneuver, and these respectively correspond to ending velocities of v h i g h , v m i d , and v z e r o , if they exist. In the cases where v h i g h is not reachable, the value stored in t h i g h , p will be the minimum time length during the maneuver, with ending velocity v m i d . Consequently, t h i g h , p = t m i d , p . Similar result of t m i d , p = t z e r o , p is obtained when the maximum realizable ending velocity is v z e r o . As the parent node is initialized, values of t h i g h , c , t m i d , c , and t z e r o , c are derived and saved for the child node with the identical manner for the parent node. Furthermore, the starting velocity of a new expansion may be imposed as v z e r o , combined with the node expansion type of the previous maneuver. A stopping flag μ is then set as 1. Such a case happens when an expansion of stop occurs or moving direction of a vehicle is reversed. In the rest of the working conditions, the flag μ remains 0. Notably, a node is of four dimensions, x , y , θ , and t ; among them, t is an index used in the graph search, and more than one value stored in one index could complicate the problem. Consequently, only the v z e r o -related time instant t z e r o is stored as the node index.
Meanwhile, each node stores three ending time instants. Thus, one algorithm should be applied to select the exact ending instant of each node. This algorithm first chooses the minimum time t h i g h of each maneuver over the path. The deceleration phase is then imposed on the last two nodes of the trajectory, and the corresponding t m i d and t z e r o are in turn assigned to these two nodes as the maneuver-finishing time instants. As the above methods are implemented from the start to the end of the initialized trajectory, a few modifications are imposed to fix the node time indices t in certain circumstances. For instance, when μ = 1 , the maximum achievable ending time t p for the former maneuver node is determined by selecting the value of t m i d , p . In this case, the ending velocity of this maneuver turns to be v m i d . Meanwhile, in the same combination of maneuvers, if a previous maneuver exists and t p p > t m i d , p p , t m i d , p p is assigned to t p p , as the prior maneuver cannot achieve a higher velocity.

4.3. Collision Detection Strategy

The pseudocode of the collision detection function DetectCollision ( ) is recorded in Algorithm 3. A variable γ is used as a flag to indicate the validity of the node expansion as well as the type of collisions that may have occurred. When the current expansion N c is valid, γ is set as 0. γ = 1 means that at least the ending pose of the expansion risks colliding, and γ = 2 signifies that the invalidity is only found in the link between the starting and final locations. This algorithm consists of two parts, which separately refer to collisions that occurred with ending and intermediate pose of the vehicle. As regards both circumstances, collisions should be avoided throughout t t m i d , p , t z e r o , c .
Algorithm 3: Collision detection algorithm
γ DetectCollision T , N c , P p , t p , L i s t o p e n , m a p
1.
Initialize  γ 0 ;
2.
P c FindEndingPose P p , N c ;
3.
P i n FindIntermediatePose P p , N c ;
4.
if  CheckStaticCollision P c , m a p  is truethen
5.
γ 1 ;
6.
return;
7.
end if
8.
t z e r o , c , t m i d , p FixParkingTime P p , t p , L i s t o p e n , N c ;
9.
L i s t c u r , 1 FindFinalGrids P p , N c ;
10.
L i s t o t h e r FindObstaclesGrids T , t z e r o , c , t m i d , p ;
11.
if  CheckDynamicCollision L i s t c u r , 1 , L i s t o t h e r  is truethen
12.
γ 1 ;
13.
return;
14.
end if
15.
if  CheckStaticCollision P i n , m a p  is truethen
16.
γ 2 ;
17.
return;
18.
end if
19.
L i s t c u r , 2 FindIntermediateGrids P p , N c ;
20.
if  CheckDynamicCollision L i s t c u r , 2 , L i s t o t h e r  is truethen
21.
γ 2 ;
22.
return;
23.
end if
24.
return;
Static obstacles correspond to the walls of the shelves and warehouse, and the related collision detection method is expressed on lines 4 to 7 in Algorithm 3. FindEndingPose ( ) is used to obtain the current pose P c x c , y c , θ c based on the parent pose P p x p , y p , θ p and the current expansion manner N c . The function CheckStaticCollision ( ) is applied to check if collisions with static obstacles exist. This check is first judged by identifying the validity of the final pose. When it comes to maneuvers of turning and lane changing, an additional intermediate pose (cf. thick dashed rectangular in Figure 8 and Figure 9), obtained through FindIntermediatePose ( ) , should be further examined (cf. lines 15 to 18 in Algorithm 3). Focusing on the collision avoidance constraints formulation between the point Q j ( j = 1 , , N obs and N obs denotes the obstacle point number) and the current investigated vehicle featuring vertexes A , B , C and D . A collision forms when Q j enters the rectangle A B C D . The restriction that Q j is located outside of the rectangle A B C D can be formulated by applying a triangle-area-based criterion [38],
S P i A B + S P i B C + S P i C D + S P i D A > S A B C D
where S Δ indicates the triangle area, and S refers to the rectangle area. Applying Equation (11) to every node expansion with respect to every obstacle point, the static obstacle collision judgement is yielded.
Dynamic obstacles in the current study are only forklifts, whose motions have been saved in T , and the function FixParkingTime ( ) is applied to find t m i d , p in L i s t o p e n and t z e r o , c by calling Table 2. As reported in Figure 8 and Figure 9, the highlighted grids indicate the approximate area occupied during the movements of turning and lane changing. Similarly, the covered grids pertaining to other maneuvers in the same category can be determined by mirroring or rotating the highlighted grids (cf. Figure 8 and Figure 9). The functions FindFinalGrids ( ) and FindIntermediateGrids ( ) are respectively used to record the covered grids of the final pose P c and the intermediate pose P i n for the current node expansion patterns.
Without loss of generality, let us focus on the collision avoidance constraint formulation between the vehicle n c u r r e n t and the vehicle n k ( n c u r r e n t , n k = 1 , , 4 and n c u r r e n t n k ). The function FindObstaclesGrids ( ) is first utilized to search for the covered grids by the vehicles n k , whose trajectories are not being planned, with respect to time. It is possible that no actions of the forklift n k have been determined during the period when the current maneuver of vehicle n c u r r e n t could occupy. Under such circumstances, the grids that the vehicle n k finally parks are treated as the covered ones.
Finally, CheckDynamicCollision ( ) is used to detect if L i s t c u r , 1 or L i s t c u r , 2 is going to simultaneously cover the grids already stored in L i s t o t h over the valid time. Notably, during the collision detection, the current action is virtually regarded as the final maneuver with v z e r o set as the finishing velocity with t z e r o , c selected, thus an equal or longer time length of this maneuver in actual operation is considered. This treatment can enhance the safety performance to certain extents.
Notably, referring to Algorithm 3, the types of failure through an expansion have been noticed when one expansion has failed. This provides a tool to distinguish if one child node should be closed or skipped because that failure can be simply due to the intermediate trajectory of the action is interfered, while the corresponding child node could be valid in other situations.

4.4. Trajectory Cost Function

The function SetCost ( ) is explained in this section. With regard to one node expansion, the cost function f is the sum of two parts, as reported in Equation (12),
f = g + C 11 h
in which g stands for the cumulative cost from the initial pose to the current pose and h indicates the estimated cost from the child node of the current expansion to the target node of the trajectory being defined. C 11 is a calibrated weighting aimed at achieving a balance between the computational resources used for searching and the resultant trajectory’s quality. In total, it is expected that one forklift complete a delivery or return subtasks subjected to the minimum travelling time of t f t i . Meanwhile, times of turning, lane changing, and speed inverse maneuvers should be minimized in order to reduce unnecessary movements, which may decrease the traffic capacity of the passages.
As far as the function of trajectory to the goal is concerned, Equation (13) is applied. We used the Manhattan distance plus another function of θ c . Both the distance and the angle are evaluated in times of certain characteristic dimensions.
h = x g x c / Δ x + y g y c / Δ y + θ g θ c / ( π / 2 )
where x = y , indicating the grid side length; x g , y g , and θ g correspond to the goal space coordinates; x c , y c , and θ c refer to the coordinates of the current expansion child node.
The passed trajectory cost function that characterizes the time consumed from the starting pose of the trajectory to the current one writes:
g = t z e r o , c t i + p p = p t u r n m t u r n + p l a n e m l a n e + p i n v m i n v
where p t u r n , p l a n e , and p i n v respectively indicate the predefined penalties pertaining to single time of turning, lane changing, and speed inverse. m t u r n , m l a n e , and m i n v denote the cumulative number of corresponding maneuvers from the initial pose to the current child node along the trajectory being defined.
In addition, if one node originally stored in L i s t o p e n is reached a second time with a reduced value of f c during a node expansion compared to the previous saved f p r e for the same nodes, the function ReplaceNode ( ) is used to switch the parent node to the parent node of current expansion, with the aim of reducing calculation time length and optimizing the trajectory being planned.

5. Joint Dispatching and Cooperative Trajectory Planning Framework

The trajectories of forklift vehicles in the warehouse are sequentially determined in the complete cooperative operative algorithm (cf. Algorithm 4). Generally, the dispatching technique is first applied to select the current vehicle index as well as the goal coordinate in the space domain for the current trajectory planning subtask. Subsequently, an improved hybrid A* search algorithm is used to determine all details pertaining to the newly planned trajectory T . The above methods are repeated until all stack locations in the warehouse are filled, as the warehouse filling state F is being updated.
The function CheckFilling ( ) is used to derive the number of unfilled stacks. The function FindInitialPose ( ) is then applied to find the parent node P p , t p for the next expansion with minimum cost f found in L i s t o p e n and to simultaneously remove this node from L i s t o p e n .
A maximum number of iterations i t e r s e a r c h in the improved hybrid A* algorithm is induced to break the endless iterations that may derive a trajectory involving an unacceptable waiting period. The improved hybrid A* search algorithm is finished by satisfying any one of the three criteria, which are, respectively, the goal coordinate reached as a new child node, the iteration number exceeding i t e r s e a r c h , and no node saved in L i s t o p e n . Among them, only the first criterion indicates the trajectory of the current subtask is successfully planned, and a corresponding flag σ is thus set as 1 to declare this success.
The function GenerateTraj ( ) is employed to generate the trajectory with the newly planned trajectory by backtracking from P f to P i with t f , z e r o as the ending instants. During the backtracking, the maximum realizable velocities of the intermediate nodes are selected based on the manner depicted in Figure 11 and Table 2.
Finally, the function UpdateFillingState ( ) updates F as the stack located at P f x f , y f , θ f has been filled.
Algorithm 4: Cooperative operation algorithm
T , F OperateCooperative T , F , i t e r s e a r c h , m a p
1.
while  CheckFilling F > 0
2.
n c u r r e n t , P i , P f , t i Dispatch T , F , f a i l , P i , P f , m a p ;
3.
σ 0 ;
4.
f a i l 0 ;
5.
t h i g h , p t i , t m i d , p t i , t z e r o , p t i ;
6.
L i s t o p e n P i , t i , L i s t c l o s e d ;
7.
while  L i s t o p e n  or  i t e r i t e r s e a r c h  or  σ 1 do
8.
   [ L i s t o p e n , P p , t p ]   FindInitialPose L i s t o p e n ;
9.
   L i s t c l o s e d   AddNode L i s t c l o s e d , P p , t p ;
10.
  for each  N c N a l l do
11.
    σ , L i s t o p e n   SearchAStar T , N c , P p , P f , t p , L i s t o p e n , L i s t c l o s e d , m a p ;
12.
  end for
13.
end while
14.
if  σ = 1 then
15.
   T GenerateTraj P i , P f , t i ;
16.
   F UpdateFillingState T ;
17.
else
18.
   f a i l 1 ;
19.
end if
20.
end while
21.
return;

6. Numerical Experiments

Simulations of filling and emptying missions were conducted on the MATLAB 2021b platform, utilizing the parametric settings reported in Table 3. Each warehouse filling or emptying mission began with fully empty or filled initial conditions, respectively. The motion planning for each forklift in the simulation environment solely takes into account obstacles such as warehouse walls and other forklifts.
The simulation results are divided into two parts. The first part focuses on assessing the consistency of the motion planning strategy by presenting three short-time trajectories for multiple forklifts. In the second part, various dispatching strategies were benchmarked using different scores specifically designed for the current warehouse scenario. This evaluation was conducted to determine the performance and efficiency of different dispatching strategies in the given warehouse scenario.

6.1. On the Performance of the Trajectory Planning Technique

Figure 12 reports how forklift 1 returns to a picking station from shelf cluster s 1 , where it has just unloaded goods. Before the start of the scenario, forklifts 2 and 4 are in the same state of return, and forklift 3 is unloading in shelf cluster s 4 . When the scene in Figure 12 starts, forklift 1 accelerates and decelerates to reverse to the entrance of shelf cluster s 1 . At the meantime, forklift 4 turns and heads to the picking station p 4 , leaving the crossing of the wide passages empty. With the purple square showing the newly filled stack, forklift 3 leaves shelf cluster s 4 and enters s 3 to finish turning round. When it is inside of s 3 , forklift 1 goes through the passage between s 3 and s 4 . Subsequently, forklift 1 reverses and turns to the picking station p 2 , and forklift 3 then enters the wide passages to return to the assigned picking station, which is p 3 . Finally, forklift 2 is the first vehicle that arrives at the crossing area of wide passages with the goods picked. During the scenario, all trajectories are directly determined by means of the approach elaborated on in Section 4. The least priority is dynamically offered to the newly departed vehicle. For instance, in this scenario, forklift 3 initially has the lowest priority. Thus, it should judge if there is enough time for the vehicle to turn around. Therefore, if the time is limited, vehicle 3 would wait at the entrance of s 4 , until the passing of forklift 1.
Figure 13 shows how the last stack is filled. Because there is nowhere to be filled hereafter, forklift 3 is parked in one picking station, and forklift 1 heads to picking station s 4 and stops. Initially, forklift 4 has just finished one unloading process, then it enters shelf cluster s 2 to turn over. After forklift 4 leaves s 2 , forklift 2 enters the shelf cluster to complete the last delivery subtask. Meanwhile, forklift 4 accelerates, decelerates, and turns to picking station s 3 for final parking.
Figure 14 shows a scenario where the trajectory decision for forklift 3 encounters a failure. In this case, forklift 1 reaches shelf s 4 to unload goods, while forklift 3 completes its unloading process earlier than forklift 1. Consequently, forklift 3 should be assigned a higher priority to plan the trajectory based on function rank ( ) . However, at this moment, forklift 1 remains stationed at the entrance of shelf s 4 , obstructing the passage for forklift 3. As a result, a decision failure occurs, leading to an update in the trajectory planning priority stored in R . Only after the completion of the trajectory planning for forklift 1 can the trajectory planning for forklift 3 proceed accordingly.

6.2. On the Performance of Dispatching Strategies

Table 4 benchmarks five distinct dispatching strategies for both filling and emptying tasks. It is worth noting that these dispatching methods are applicable only to specific coupling scenarios between grids and vehicle maneuvers, as they incorporate a preliminary A* search during the dispatching process.
The approaches differ in the assignment methods pertaining to delivering subtasks. While the decision failure times in Table 4 indicate the number of times n c u r r e n t or t i is changed without a solid trajectory decided throughout the entire warehouse filling mission, the end time is the finishing timing of the same task. Among the strategies, ANN combined determines the goal through arg   max S d (cf. S d in Equation (9)) when the coefficients of the MLP are determined. Comprehensive strategy applies S d , 0 in Equation (3), and the target is determined based on arg   max S d , 0 . The other three strategies only concern a few parameters in Equation (3). Greedy, traffic jam removal, and balance strategies select the goal pose through arg   max I d , arg   max J d , 0 , and arg   max G d , respectively.
As reported in Table 4, the end time of the emptying task is approximately 10% longer than that for filling tasks. However, the occurrence of decision failure times is generally lower. This indicates that during emptying tasks, instances of a forklift obstructing routes and blocking other vehicles for extended periods are rare. Yet, with the current configuration of i t e r p r e and i t e r s e a r c h , all vehicles might experience longer wait times on average during each subtask.
The ANN combined strategy is the optimal method in terms of end time for both filling and emptying tasks, requiring approximately 2% less time than the comprehensive strategy. However, it does exhibit comparable or even slightly greater decision failure times when compared to the comprehensive strategy. In general, a reduction in the first parameter can reduce the computation burden of the hardware by performing fewer useless calculations in searching a trajectory. The second parameter directly shows the efficiency of the cooperative operation of multiple forklifts. Notably, the ANN approach applied to train the dispatching system only concerns the efficiency of planning a trajectory for the current vehicle. Therefore, it is possible that some stacks in only one shelf cluster are left to be filled by selecting the best solution multiple times. In this case, multiple forklifts have to cooperatively fill one shelf cluster, and the waiting period of each vehicle must increase. Therefore, both the decision failure times and the end time may sharply increase.
In Figure 15, Figure 16, Figure 17 and Figure 18, the sequence in which the warehouse is filled with different methods is reported. The colored blocks plotted in the schematic warehouse stand for the stacks filled during different periods (red for the first quarter of time, blue for the second, green for the third, and cyan for the last). It is observed that the filling of different shelf clusters for the ANN combined strategy (cf. Figure 15) in different quarters of the period is balanced to a certain extent. There are, respectively, 35, 37, 34, and 30 stacks filled during each quarter. Particularly in the last quarter of the period, similar numbers of stacks filled are distributed within all six shelf clusters, and this provides a possibility for a feasible and fast solution to filling the warehouse.
By excluding the MLP of ANN, a comprehensive strategy is also capable of assigning subtasks to each forklift explicitly. Although the performance is slightly worse than the ANN combined one, it does not require the data pertaining to previous cycles. Figure 16 demonstrates the filling sequence for this approach. Similar to that of the ANN combined approach, the filling state of the shelf clusters in the warehouse is relatively balanced throughout the period. A few seconds are added in terms of the end time for this strategy without the fine adjustment of the MLP.
Figure 17 exhibits the filling sequence of the traffic-jam-removing strategy. The difference between this method and the greedy strategy is that the previous one simultaneously considers H in Equation (3). One stack in shelf cluster s 2 is filled in the first quarter of the period, and both the decision failure times and the end time pertaining to the traffic jam removal approach reduce compared to the greedy strategy. Thus, a simple additional function of H can slightly further characterize the difficulty of the trajectory planning. Meanwhile, because the filling sequences of these two strategies are similar, the figure pertaining to the greedy method is omitted. As shown in Figure 17, it basically evenly fills shelf clusters s 4 , s 5 , and s 6 in the first quarter of the period. The majority of stacks in s 1 and around a third of stacks in s 2 are filled in the last quarter. Considering four vehicles cooperatively delivering goods, violations between trajectories evidently arise, and the times of failure in trajectory decisions and the waiting time can increase markedly.
The filling sequence of the balance strategy is plotted in Figure 18. The filling of different shelf clusters is nearly perfectly balanced. Evidently, this strategy can better arrange the trajectories to fill the last few stacks of the warehouse compared to any other approach. Therefore, the number of stacks filled during the last quarter of the warehouse filling cycle is comparable to that during other periods. In particular, when the 5th, 14th, and 23rd stacks of shelf clusters s 3 and s 4 (cf. Figure 2) are being filled, the paths between the shelf clusters ( s 1 and s 2 ) in the left side of the warehouse (cf. Figure 1) and picking stations are cut. Meanwhile, the first delivery vehicle reaching these certain stack locations between shelf clusters s 3 and s 4 can simultaneously block the routes to certain shelf clusters. This phenomenon also appears in shelf clusters s 5 and s 6 when the identical stack indexes are considered. Those are the unique conflicts that arose to cause the failure decisions and the lengthening of the end time for the balance strategy.

7. Conclusions

This study presents a joint dispatching and cooperative trajectory planning framework. In this framework, the dispatching method applies a time dimension involved A* search, which is used to score different goal poses during a delivery subtask. ANN is simultaneously implemented to evaluate the difficulties for a forklift to arrive at a goods unloading pose from a picking station. In addition, the stacks to be filled pertaining to different shelf clusters are used as the third parameter to balance the shelf filling states, with the aim of avoiding deadlocks at the final stage of the mission.
As far as the trajectory planning approach within the framework is concerned, a model-based improved A* search algorithm is used to sequentially determine the trajectory of each vehicle without further optimization-based techniques. The node expansion manners are determined on the basis of the purpose of a maneuver, and the speed pertaining to the start and end of a maneuver is divided into stages.
Different dispatching strategies are benchmarked. The ANN combined strategy shows the best performance in warehouse filling efficiency, but the decision failure times of this method are comparable to those of the comprehensive strategy. Meanwhile, it is observed that the balance of the filling state pertaining to shelf clusters is as important as the evaluation of the goal-reaching difficulties. Furthermore, the trajectories of multiple vehicles in a short time are presented. It is shown that although the priority of a vehicle during one subtask is predefined, the vehicles are capable of cooperatively and continuously finding feasible trajectories.
In the future, our joint strategy will be tested in a downsized unmanned warehouse setup. Additionally, we aim to implement a zonal shutdown feature for enhanced safety in case pedestrians enter the area. We also plan to integrate a forklift failure detection mechanism to enable uninterrupted warehouse operations even in the presence of a few malfunctioning forklifts.

Author Contributions

Conceptualization, T.Z., M.L. and K.C.; methodology, H.L. and Y.F.; validation, T.Z., Y.F. and K.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Fundamental Research Funds for the Central Universities: 531118010692; National Natural Science Foundation of China: 62103139; Fundamental Research Funds for the Central Universities: 531118010509; Natural Science Foundation of Hunan Province, China: 2021JJ40114.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Bozek, P.; Karavaev, Y.L.; Ardentov, A.A.; Yefremov, K.S. Neural network control of a wheeled mobile robot based on optimal trajectories. Int. J. Adv. Robot. Syst. 2020, 17, 2. [Google Scholar] [CrossRef]
  2. Blatnický, M.; Dižo, J.; Sága, M.; Gerlici, J.; Kuba, E. Design of a Mechanical Part of an Automated Platform for Oblique Manipulation. Appl. Sci. 2020, 10, 8467. [Google Scholar] [CrossRef]
  3. Yang, Q.; Lian, Y.; Xie, W. Hierarchical planning for multiple AGVs in warehouse based on global vision. Simul. Model. Pract. Theory 2020, 104, 102124. [Google Scholar] [CrossRef]
  4. Li, G.; Wang, X.; Yang, J.; Wang, B. A new method to design fuzzy controller for unmanned autonomous forklift. In Proceedings of the 2015 IEEE International Conference on Robotics and Biomimetics (ROBIO), Zhuhai, China, 6–9 December 2015; pp. 946–951. [Google Scholar] [CrossRef]
  5. Zhong, Z.; Wan, M.; Pei, Z.; Qin, X. Spatial and temporal optimization for smart warehouses with fast turnover. Comput. Oper. Res. 2021, 125, 105091. [Google Scholar] [CrossRef]
  6. Weidinger, F.; Boysen, N.; Briskorn, D. Storage assignment with rack moving mobile robots in KIVA warehouses. Transp. Sci. 2018, 52, 1297–1588. [Google Scholar] [CrossRef]
  7. Yu, W.; Wang, S.; Wang, R.; Tan, M. Generation of temporal-spatial Bezier curve for simultaneous arrival of multiple unmanned vehicles. Inf. Sci. 2017, 418–419, 34–45. [Google Scholar] [CrossRef]
  8. Ouyang, Y.; Li, B.; Zhang, Y.; Acarman, T.; Guo, Y.; Zhang, T. Fast and optimal trajectory planning for multiple vehicles in a nonconvex and cluttered environment: Benchmarks, methodology, and experiments. In Proceedings of the 2022 IEEE International Conference on Robotics and Automation (ICRA), Philadelphia, PA, USA, 23–27 May 2022; pp. 10746–10752. [Google Scholar]
  9. Chen, B.M. On the trends of autonomous unmanned systems research. Engineering 2022, 12, 20–23. [Google Scholar] [CrossRef]
  10. Miyamoto, T.; Inoue, K. Local and random searches for dispatch and conflict-free routing problem of capacitated AGV systems. Comput. Ind. Eng. 2016, 91, 1–9. [Google Scholar] [CrossRef]
  11. Stutzle, T.; Ruiz, R. Iterated Local Search. In Handbook of Heuristics; Marti, R., Pardalos, P., Resende, M., Eds.; Springer: Cham, Switzerland, 2018. [Google Scholar] [CrossRef]
  12. Zhang, X.; Sang, H.; Li, J.; Han, Y.; Duan, P. An effective multi-AGVs dispatching method applied to matrix manufacturing workshop. Comput. Ind. Eng. 2022, 163, 107791. [Google Scholar] [CrossRef]
  13. Lin, L.; Shinn, S.W.; Gen, M.; Hwang, H. Network model and effective evolutionary approach for AGV dispatching in manufacturing system. J. Intell. Manuf. 2006, 17, 465–477. [Google Scholar] [CrossRef]
  14. Bao, Y.; Jiao, G.; Huang, M. Cooperative optimization of pod repositioning and AGV task allocation in Robotic Mobile Fulfillment Systems. In Proceedings of the 2021 33rd Chinese Control and Decision Conference (CCDC), Kunming, China, 22–24 May 2021; pp. 2597–2601. [Google Scholar] [CrossRef]
  15. Lee, I.G.; Chung, S.H.; Yoon, S.W. Two-stage storage assignment to minimize travel time and congestion for warehouse order picking operations. Comput. Ind. Eng. 2020, 139, 106129. [Google Scholar] [CrossRef]
  16. Fragapane, G.; de Koster, R.; Sgarbossa, F.; Strandhagen, J.O. Planning and control of autonomous mobile robots for intralogistics: Literature review and research agenda. Eur. J. Oper. Res. 2021, 294, 405–426. [Google Scholar] [CrossRef]
  17. Hu, H.; Jia, X.; He, Q.; Fu, S.; Liu, K. Deep reinforcement learning based AGVs real-time scheduling with mixed rule for flexible shop floor in industry 4.0. Comput. Ind. Eng. 2020, 149, 106749. [Google Scholar] [CrossRef]
  18. Zhou, T.; Tang, D.; Zhu, H.; Zhang, Z. Multi-agent reinforcement learning for online scheduling in smart factories. Robot. Comput.-Integr. Manuf. 2021, 72, 102202. [Google Scholar] [CrossRef]
  19. Lian, Y.; Yang, Q.; Liu, Y.; Xie, W. A spatio-temporal constrained hierarchical scheduling strategy for multiple warehouse mobile robots under industrial cyber-physical system. Adv. Eng. Inform. 2022, 52, 101572. [Google Scholar] [CrossRef]
  20. Liu, Y.; Du, Y.; Dou, S.; Peng, L.; Su, X. Research summary of intelligent optimization algorithm for warehouse AGV path planning. In Lecture Notes in Operations Research; Shi, X., Bohács, G., Ma, Y., Gong, D., Shang, X., Eds.; Springer: Singapore, 2022. [Google Scholar] [CrossRef]
  21. Ma, Y.; Wang, H.; Xie, Y.; Guo, M. Path planning for multiple mobile robots under double-warehouse. Inf. Sci. 2014, 278, 357–379. [Google Scholar] [CrossRef]
  22. Xidias, E.K. A decision algorithm for motion planning of car-like robots in dynamic environments. Cybern. Syst. 2021, 52, 1909844. [Google Scholar] [CrossRef]
  23. Li, B.; Acarman, T.; Zhang, Y.; Ouyang, Y.; Yaman, C.; Kong, Q.; Zhong, X.; Peng, X. Optimization-based trajectory planning for autonomous parking with irregularly placed obstacles: A lightweight iterative framework. IEEE Trans. Intell. Transp. Syst. 2022, 23, 11970–11981. [Google Scholar] [CrossRef]
  24. Li, B.; Zhang, Y.; Feng, Y.; Zhang, Y.; Ge, Y.; Shao, Z. Balancing computation speed and quality: A decentralized motion planning method for cooperative lane changes of connected and automated vehicles. IEEE Trans. Intell. Veh. 2018, 3, 340–350. [Google Scholar] [CrossRef]
  25. Zhou, Y.; Huang, N. Airport AGV path optimization model based on ant colony algorithm to optimize Dijkstra algorithm in urban systems. Sustain. Comput. Inform. Syst. 2022, 35, 100716. [Google Scholar] [CrossRef]
  26. Zacharia, P.T.; Xidias, E.K. AGV routing and motion planning in a flexible manufacturing system using a fuzzy-based genetic algorithm. Int. J. Adv. Manuf. Technol. 2020, 109, 1801–1813. [Google Scholar] [CrossRef]
  27. Kneissl, M.; Madhusudhanan, A.K.; Molin, A.; Esen, H.; Hirche, S. A multi-vehicle control framework with application to automated valet parking. IEEE Trans. Intell. Transp. Syst. 2021, 22, 5697–5707. [Google Scholar] [CrossRef]
  28. Dresner, K.; Stone, P. Multiagent traffic management: A reservation-based intersection control mechanism. In Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, New York, NY, USA, 23 July 2004; pp. 530–537. [Google Scholar]
  29. Guney, M.A.; Raptis, I.A. Dynamic prioritized motion coordination of multi-AGV systems. Robot. Auton. Syst. 2021, 139, 103534. [Google Scholar] [CrossRef]
  30. Digani, V.; Caramaschi, F.; Sabattini, L.; Secchi, C.; Fantuzzi, C. Obstacle avoidance for industrial AGVs. In Proceedings of the 2014 IEEE 10th International Conference on Intelligent Computer Communication and Processing (ICCP), Cluj-Napoca, Romania, 4–6 September 2014; pp. 227–232. [Google Scholar] [CrossRef]
  31. Stern, R. Multi-Agent Path Finding—An Overview. In Artificial Intelligence. Lecture Notes in Computer Science; Osipov, G., Panov, A., Yakovlev, K., Eds.; Springer: Cham, Switzerland, 2019; Volume 11866. [Google Scholar] [CrossRef]
  32. Zhang, J.; Li, Z.; Li, L.; Li, Y.; Dong, H. A bi-level cooperative operation approach for AGV based automated valet parking. Transp. Res. Part C 2021, 128, 103140. [Google Scholar] [CrossRef]
  33. Meng, Y.; Li, L.; Wang, F.; Li, K.; Li, Z. Analysis of cooperative driving strategies for nonsignalized intersections. IEEE Trans. Veh. Technol. 2018, 67, 2900–2911. [Google Scholar] [CrossRef]
  34. Lee, C.K.M.; Lin, B.; Ng, K.K.H.; Lv, Y.; Tai, W.C. Smart robotic mobile fulfillment system with dynamic conflict-free strategies considering cyber-physical integration. Adv. Eng. Inform. 2019, 42, 100998. [Google Scholar] [CrossRef]
  35. Rochel, P.; Rios, H.; Mera, M.; Dzul, A. Trajectory tracking for uncertain Unicycle Mobile Robots: A Super-Twisting approach. Control Eng. Pract. 2022, 122, 105078. [Google Scholar] [CrossRef]
  36. Zhang, T. An estimation method of the fuel mass injected in large injections in Common-Rail diesel engines based on system identification using artificial neural network. Fuel 2022, 310, 122404. [Google Scholar] [CrossRef]
  37. Dolgov, D.; Thrun, S.; Montemerlo, M.; Diebel, J. Path planning for autonomous vehicles in unknown semi-structured environment. Int. J. Robot. Res. 2010, 29, 485–501. [Google Scholar] [CrossRef]
  38. Li, B.; Shao, Z. A unified motion planning method for parking an autonomous vehicle in the presence of irregularly placed obstacles. Knowl. Based Syst. 2015, 86, 11–20. [Google Scholar] [CrossRef]
Figure 1. Schematic of a warehouse layout.
Figure 1. Schematic of a warehouse layout.
Electronics 12 03820 g001
Figure 2. Filling sequence pertaining to each shelf cluster.
Figure 2. Filling sequence pertaining to each shelf cluster.
Electronics 12 03820 g002
Figure 3. Parametric notations related to vehicle size and kinematics.
Figure 3. Parametric notations related to vehicle size and kinematics.
Electronics 12 03820 g003
Figure 4. Expansion manner of the preliminary hybrid A* search.
Figure 4. Expansion manner of the preliminary hybrid A* search.
Electronics 12 03820 g004
Figure 5. Schematic of an MLP network.
Figure 5. Schematic of an MLP network.
Electronics 12 03820 g005
Figure 6. Flowchart of MLP training.
Figure 6. Flowchart of MLP training.
Electronics 12 03820 g006
Figure 7. Node expansion manners of improved hybrid A* algorithm: (a) Stopping and going straight; (b) lane changing and turning.
Figure 7. Node expansion manners of improved hybrid A* algorithm: (a) Stopping and going straight; (b) lane changing and turning.
Electronics 12 03820 g007
Figure 8. Path of turning.
Figure 8. Path of turning.
Electronics 12 03820 g008
Figure 9. Path of lane changing.
Figure 9. Path of lane changing.
Electronics 12 03820 g009
Figure 10. Velocity selections when one maneuver starts or ends.
Figure 10. Velocity selections when one maneuver starts or ends.
Electronics 12 03820 g010
Figure 11. Velocity variations through one maneuver.
Figure 11. Velocity variations through one maneuver.
Electronics 12 03820 g011
Figure 12. Planned trajectories between t = 687.25 s and t = 715.25 s.
Figure 12. Planned trajectories between t = 687.25 s and t = 715.25 s.
Electronics 12 03820 g012
Figure 13. Planned trajectories between t = 2327.25 s and t = 2354.50 s.
Figure 13. Planned trajectories between t = 2327.25 s and t = 2354.50 s.
Electronics 12 03820 g013
Figure 14. Planned trajectories between t = 284.75 s and t = 303.75 s.
Figure 14. Planned trajectories between t = 284.75 s and t = 303.75 s.
Electronics 12 03820 g014
Figure 15. Filling sequence of ANN combined strategy.
Figure 15. Filling sequence of ANN combined strategy.
Electronics 12 03820 g015
Figure 16. Filling sequence of comprehensive strategy.
Figure 16. Filling sequence of comprehensive strategy.
Electronics 12 03820 g016
Figure 17. Filling sequence of traffic jam removing strategy.
Figure 17. Filling sequence of traffic jam removing strategy.
Electronics 12 03820 g017
Figure 18. Filling sequence of balance strategy.
Figure 18. Filling sequence of balance strategy.
Electronics 12 03820 g018
Table 1. Node expansion selections in time domain.
Table 1. Node expansion selections in time domain.
AbbreviationDescription
T 1 v z e r o Going   straight v z e r o
T 2 v z e r o Going   straight v m i d   or   v m i d Going   straight v z e r o
T 3 v m i d Going   straight v m i d
T 4 v m i d Going   straight v h i g h   or   v h i g h Going   straight v m i d
T 5 v h i g h Going   straight v h i g h
T 6 v z e r o Turning v z e r o
T 7 v z e r o Turning v m i d   and   v m i d Turning v z e r o
T 8 v m i d Turning v m i d
T 9 v z e r o Lane   changing v z e r o
T 10 v z e r o Lane   changing v m i d   or   v m i d Lane   changing v z e r o
T 11 v m i d Lane   changing v m i d
T 12 v z e r o Stop v z e r o
Table 2. Model-based approach for saving time consumption data.
Table 2. Model-based approach for saving time consumption data.
Current Maneuver μ = 0 μ = 1
Stop- t z e r o , c t z e r o , p + T 12
t m i d , c t z e r o , p + T 12
t h i g h , c t z e r o , p + T 12
Going straight if   t h i g h , p = t m i d , p , then
t z e r o , c t m i d , p + T 2
t m i d , c t m i d , p + T 3
t h i g h , c t m i d , p + T 4
else
t z e r o , c t m i d , p + T 2
t m i d , c t h i g h , p + T 4
t h i g h , c t h i g h , p + T 5
t z e r o , c t z e r o , p + T 1
t m i d , c t z e r o , p + T 2
t h i g h , c t z e r o , p + T 2
Turning t z e r o , c t m i d , p + T 7
t m i d , c t m i d , p + T 8
t h i g h , c t m i d , p + T 8
t z e r o , c t z e r o , p + T 6
t m i d , c t z e r o , p + T 7
t h i g h , c t z e r o , p + T 7
Lane changing t z e r o , c t m i d , p + T 10
t m i d , c t m i d , p + T 11
t h i g h , c t m i d , p + T 11
t z e r o , c t z e r o , p + T 9
t m i d , c t z e r o , p + T 10
t h i g h , c t z e r o , p + T 10
Table 3. Parametric settings regarding model and approach.
Table 3. Parametric settings regarding model and approach.
ParameterDescriptionSetting
n Forklift front overhang length0.3 m
m Forklift rear overhang length1 m
l Forklift wheelbase1.5 m
2 b Forklift width1 m
l b x , u b x Horizontal boundaries of map 18 , 18 m
l b x , u b y Vertical boundaries of map 12 , 12 m
r e s o l x y Node resolution for search
algorithms
2 m
i t e r p r e Maximum iteration in the time dimension involved A* search500
i t e r d i s p a t c h Maximum iteration of
redispatching
10
i t e r s e a r c h Maximum iteration in the
improved A* search
5000
C 1 , C 2 , , C 11 Calibration parameters 6 , 1.5 , 80 , 40 , 6 , 0.5 , 4 , 2 , 0.5 , 0.5 , 3
T 1 , T 2 , , T 12 Modeled time lengths of
maneuvers
4 , 2 , 1.25 , 0.75 , 0.5 , 8 , 5 , 3 , 12 , 8 , 5 , 1 s
p t u r n Penalty for turning maneuver4
p l a n e Penalty for lane changing
maneuver
6
p i n v Penalty for speed inverse
maneuver
6
t i Time postponed when one
trajectory planning is failed
10 s
t c o v e r Time   period   to   evaluate   H in Equation (3)20 s
t e n t r y Time length after the virtual vehicle entering the rack
passage to evaluate t c o v e r
20 s
t c l a i m , t u n l o a d Time period for picking goods and unloading goods 5 , 5 s
Table 4. Benchmark of the different dispatching strategies.
Table 4. Benchmark of the different dispatching strategies.
Strategy NameFillingEmptying
Decision Failure TimesEnd
Time (s)
Decision Failure TimesEnd
Time (s)
ANN combined strategy382388.25272597.25
Comprehensive strategy382441.00212658.25
Greedy strategy1352768.75753419.00
Traffic jam removing
strategy
1202709.00513117.75
Balance strategy552557.75352660.50
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, T.; Li, H.; Fang, Y.; Luo, M.; Cao, K. Joint Dispatching and Cooperative Trajectory Planning for Multiple Autonomous Forklifts in a Warehouse: A Search-and-Learning-Based Approach. Electronics 2023, 12, 3820. https://doi.org/10.3390/electronics12183820

AMA Style

Zhang T, Li H, Fang Y, Luo M, Cao K. Joint Dispatching and Cooperative Trajectory Planning for Multiple Autonomous Forklifts in a Warehouse: A Search-and-Learning-Based Approach. Electronics. 2023; 12(18):3820. https://doi.org/10.3390/electronics12183820

Chicago/Turabian Style

Zhang, Tantan, Hu Li, Yong Fang, Man Luo, and Kai Cao. 2023. "Joint Dispatching and Cooperative Trajectory Planning for Multiple Autonomous Forklifts in a Warehouse: A Search-and-Learning-Based Approach" Electronics 12, no. 18: 3820. https://doi.org/10.3390/electronics12183820

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop