Abstract

This paper proposes a support vector machine (SVM)-based AGV scheduling strategy that enhances the scheduling efficiency of automated guided vehicles (AGVs) in intelligent factories. The developed scheme optimizes the task area division process to endow the AGVs with the ability to avoid obstacles in complex dynamic environments. Specifically, given the two AGV motion cases, i.e., towards a single target point and multiple target points, the optimal path was determined utilizing the exhaustive and the Q-learning methods, while path optimization was realized by utilizing different schemes. Based on the shortest path obtained, a nonlinear programming model with the shortest time as the objective was built, and the AGV’s turning path was proved to be optimal by the non-dominated sorting genetic algorithm (NSGA-II). Several simulation tests and calculation results validated the proposed method’s effectiveness, highlighting that the developed scheme is a rational solution to the obstacle congestion and deadlock problems. Moreover, the experimental results demonstrated the proposed method’s superiority in path planning accuracy and its ability to respond well in complex dynamic environments. Overall, this research provides a reference for developing and applying AGV cluster scheduling in real operational scenarios.

1. Introduction

Traditionally, warehouses have been mostly managed manually. With the rise of e-commerce [1], s (AGVs), shuttles, and Delta sorting robots [2] are playing an essential role in automated warehouse logistics systems. Therefore, to ensure proper automated warehouse logistics system management, achieving reasonable obstacle avoidance of AGVs in complex smart warehouses is necessary to reach optimal scheduling. Some scholars have tried solving the scheduling of AGVs using path planning algorithms, such as Dijkstra’s algorithm [3], A algorithm [4], and ant colony algorithm [5]. However, as the task dimensionality increases, the solution takes more time and becomes more complicated. Moreover, these algorithms have drawbacks, such as slow convergence and a tendency to fall into locally optimal solutions. Besides, the methods only focus on avoiding obstacles and do not consider the impact of local obstacle avoidance planning on subsequent operations. Consequently, the trajectory should be adjusted after obstacle avoidance to bring the AGVs back to the global path [6], reducing operational efficiency.

AGV assignment in scheduling problems has been studied by some researchers. To minimize the makespan and intercellular motions of components, Azadeh et al. [7] developed a nonlinear CFP which included intra-cell scheduling and material handling using AGVs. However, on the other hand, nonlinear CFP is only suited for small-scale AGV systems since it cannot adjust dynamically to the transportation environment. Chu et al. [8] used an adaptive memetic differential search method to tackle the problems, which included cross-training with a learning/forgetting impact that improves the flexibility of routing. However, the method cannot quickly determine the optimal solutions of multiple objective functions, limiting instantaneity and global optimality in large-scale transportation. To address the work assignment issue of AGVs, Radhia et al. [9] presented a hybrid method based on the Dijkstra algorithm, genetic algorithm, and heuristic algorithm, which can ensure conflict-free control of a large fleet on any layout, and that permits optimized routing for all AGVs’ schedules. For AGV collision and deadlock complications, Malopolski [10] devised a novel approach to determine one-way, two-way, or multilane flow fields, which can adapt AGV control techniques in real time to the mobility environment. However, this method cannot motivate AGVs to arrive at destinations as rapidly as possible. In the meantime, it does not completely overcome the restrictions of the disadvantages in AGV control. Through the framework of a time-windows graph, Kim and Jin [11] used Dijkstra's shortest-path method to design AGV's course. The vehicle agent optimizes the distribution of transportation for AGVs and improves efficiency. A multi-AGV A algorithm based on a collision-free dynamic route planning approach was described by Chunbao Wang et al. [12]. The method categorized probable conflicts in order to find the shortest route that is conflict-free. The method classified potential conflicts in order to find the shortest conflict-free route. Similarly, Tai et al. [13] introduced a priority route planning method and achieved coordinated management of multi-AGVs based on time frames, which contributes to the conflict-free routing and shorter completion time, which contributes to the conflict-free routing and shorter completion time.

Considering the characteristics of existing AGV systems, this paper develops an AGV scheduling strategy based on the nonlinear programming model and the non-dominated sorting genetic algorithm NSGA-II [14]. Furthermore, obstacle avoidance simulations were conducted to minimize the total AGV moving path by optimizing the model so that the sorting stations reached a reasonable balance. Our trials concluded that the AGV path was optimal for the minimum radius and the circle center located at the obstacle’s vertex (the circular obstacle was located at the circle’s center), thereby avoiding obstacle congestion and deadlock in the current AGV scheduling [3].

The main work of this paper are as follows:(1)Proposing an SVM-based AGV scheduling strategy that enhances the scheduling efficiency of AGVs in intelligent factories,(2)Optimizing the division process to endow the AGVs with the ability to avoid obstacles,(3)Determining the optimal path by utilizing the exhaustive and the Q-learning methods,(4)Proving the optimal path by the non-dominated sorting genetic algorithm (NSGA-II),(5)Providing a reference for developing and applying AGV cluster scheduling in real operational scenarios.

The structure of the remaining sections is as follows. Section 2 includes the problem statement. Section 3 describes the considered models and computational results of the study. Section 4 provides a discussion of the results and validation analyses. The concluding remarks and further research directions are provided in Section 5.

2. Problem Description and Hypotheses

2.1. Warehouse Information and Problem Description

The goal of scheduling is to assign handling tasks to s (AGVs) in different locations so that the total handling time is minimized without collision or deadlock. The unmanned warehouse discussed in this paper is a 32 × 22 rectangular area, simplified in Figure 1.

The node types are as follows.(1)Path node (gray): AGV can pass freely.(2)Storage node (green): Place pallets or ordinary shelves.(3)Reserved node (yellow): Reserved position.(4)Column node (black): Obstacle.(5)Work station node (blue): AGVs pack the goods at the work station and exit from the conveyor belt.(6)Replenishment node (purple): the placement point of replenished goods.(7)Empty pallet recovery node (red): empty pallet recovery place.

The map used in Figure 1 is simplified to a dotted map as follows (Figure 2).

It is assumed that the AGV can only move within the limits of this plane scene. The region representation is shown in Table 1.

2.2. Condition Hypotheses

To simplify the calculations, the following reasonable assumptions are given.(1)The AGV can turn accurately along a circular arc.(2)The initial speed of the AGV is 5 units per second.(3)The speed of the AGV will not be affected when it cuts from a straight line to an arc.(4)AGVs do not stop accidently.(5)Ignore the factors that affect the non-minimum turning radius and minimum safety distance of AGV travel.

2.3. Illustration of Symbols

The symbols are specified as follows (Table 2).

Idling: A continuous process in which an AGV stops moving but runs at its lowest possible speed.

Acceleration: A continuous process in which the acceleration of an AGV is greater than 0.1 ms−2.

Deceleration: A continuous process in which the acceleration of an AGV is less than −0.1 ms−2.

Constant speed: A continuous process in which the absolute value of the acceleration of an AGV is less than 0.1 ms−2, a nonidling speed.

Average speed: The arithmetic mean of the speed of an AGV over a period.

Average driving speed: The arithmetic mean of the speed of an AGV when it is driven, excluding idling.

Idling time ratio: The percentage of total idling time in the total running time of an AGV.

Average acceleration: The arithmetic mean of acceleration per unit time (second) of an accelerating AGV.

Average deceleration: The arithmetic mean of deceleration per unit time (second) of a decelerating AGV.

Acceleration time ratio: The percentage of accumulated time in acceleration in the total time of a period.

Deceleration time ratio: The percentage of accumulated time in deceleration in the total time of a period.

Speed standard deviation: The standard deviation of the speed of an AGV over a period, including idling.

Acceleration standard deviation: The standard deviation of the acceleration of an AGV that is accelerating over a period.

3. Modeling and Solution Finding

3.1. Minimization of Walking Path

Given the ignorance of possible collisions caused by AGVs during task implementation, we designed appropriate for an unmanned warehouse scenario. The possible shortest path from the starting point to the target point was determined, and the optimal path was found through the exhaustive and Q-learning methods [15]. Moreover, two AGV motion scenarios were considered, i.e., towards a single target point and multiple target points. In the former situation, the basic line-circle structure was first constructed as a solution for a single turn. Then, we developed a computational model for the different positional relationships between the straight-line and circular paths for the multiple-turn scenario. In the latter situation, straight-line turning was impossible for the AGV, which should turn in advance when passing the target point so that the intermediate target point was on the turning arc. Hence, we developed an optimization model to obtain the center of the turning arc at the intermediate target point, which was then considered an “obstacle” to transform the problem into a single-target point problem. The shortest path length was 2,812.52 units, and the traveling time was 585.6712 s. Thus, the objective function was the minimum value of the total handling AGVs’ path when each AGV was as busy as possible.where represents k tasks of outgoing/returning/recycling assigned to the AGV under the order.

The scheduling algorithm in the unmanned warehouse scenario was designed for the case where potential collisions caused by the AGV handling during task implementation were ignored and aimed to determine the shortest path from the starting point to the target point utilizing the exhaustive method and the Q-learning method [16]. This paper considered two AGV scenarios, i.e., single- and multi-target points.

3.1.1. Single-Target Point Model (Involving Only the Starting and the Target Points)

The turning trajectories of AGVs are arcs tangent to the straight-line paths. Consequently, the traveling routes can be viewed as a combination of multiple basic lines and circles.

3.1.2. Division of Prohibited Areas

As shown in Figure 3, given that there is a minimum distance limit between AGVs and obstacles in travel, we first draw the forbidden zone of the enveloping obstacles. The forbidden zone is still a circle for circular obstacles, while the corners of the forbidden zone are circular for obstacles with vertices.

3.1.3. Shortest Path in the Case of AGV Turning at the Vertex of the Prohibited Area

As shown in Figure 4, from point A to point C, an AGV must make a turn. However, making the turn near the edge of the penalty area will shorten the total path.

Without considering the turning radius and other factors, let point D be the top of the penalty area and point B be any point outside the penalty area. From point A to point B, the shortest total path is realized when the AGV makes a turn at point D.

Adding the two equations yields:

Simplifying gives:

In the process of A > B, the AGV chooses the shortest path when turning at D.

3.1.4. Shortest Path in the Case of the Minimum Turning Radius

For an AGV to move from point A to point B, it should bypass the prohibited area and turn near its vertex. The smaller the turn radius is, the shorter the path is. As depicted in Figure 5, the path from point A to point B is regarded as a stretchable rope and is assumed to naturally stretch (line segment AB) when the two points connect. The rope is stretched as the AGV needs to steer clear of the prohibited area. Additionally, the minimum turning radius of the AGV and the diameter of the prohibited area are 10. Thus, the rope can pass directly around the edge of the prohibited area.

According to the principle of minimum potential energy, the systems’ potential energy reaches its minimum value when the elastic body is in equilibrium. Here, the circle was considered elastic in the initial state illustrated in the above figure. Under the forces illustrated in the figure, the system gradually reaches equilibrium as the circle tends to shrink, ultimately obtaining the minimum potential energy of the elastic rope, which decreases as the circle radius reduces, i.e., the shortest path goes down. This finding proves that the path is the shortest in the case of the minimum turning radius.

3.1.5. Shortest Path in the Case of the Center of the Turning Arc Located at the Vertex

When the AGV turns near the vertex of the prohibited area, the path is the shortest if the center of the turning arc is at the vertex. As illustrated in Figure 6, both circles O and O′ with a radius of R and R′, respectively, bypass the farthest point D. Notably, the center of circle O falls on the vertical line. L1, S, L2, and L1', S′, L2' are the lengths of the tangent segments from points A and B to the circles O and O′ and the arcs contained, respectively.

Point D and circle center O fall on the vertical line (passing D) of the line segment AB, while the circle center O′ is located beyond this vertical line. Therefore, the two circles are only intersected rather than tangent. To pass point D, the maximum distance from the point on circle O′ to the straight-line AB must be greater than that from the point on circle O to the straight-line AB. Based on the conclusions proved above, the result below was generated:

When the AGV bypasses an obstacle with a vertex, the shortest path is the arc turning with the vertex as the center and the minimum turning radius so that the minimum turning radius is the same as the minimum safe distance between the AGV and the obstacle. In other words, the path is the shortest when the AGV makes a turn along the edge of the prohibited area.

3.1.6. Modeling

Based on the above conclusions, the shortest path is always constituted by several tangents and arcs, regardless of the number of obstacles between the starting and target points. As proved previously, the path is the shortest when the AGV passes through all obstacles by turning along the edge of the prohibited area, where the radius of the turning arc is that of the hazardous area. Therefore, in the model below, when passing through obstacles, the AGV turns at the obstacle’s vertex with the minimum turning radius r placed at the obstacle’s center.

Given that the AGV aims to move from starting point A (x1, y1) to target point B (x2, y2), while turning on the arc centered at vertex D (x3, y3) with radius r, and C and E are the tangency points, the point C and E coordinates and the length of should be calculated:

Setting the coordinates of points C and E as (xi, yi) and (xj, yj), respectively, provides the following formulae:

Moreover, the coordinates of point E are expressed as:and the coordinates of point C:

(17) and (18) can be connected to the coordinates of C and E.

The length of arc is

The path length for the AGV turning once on the way was calculated as:

Furthermore, the traveling time was:

3.1.7. Model Calculation

It was assumed that on the path from the starting point to the target point, there were straight lines with a length of and arcs with a length of . Thus, the total distance for the AGV moving from the starting point to the target point was expressed as:

Moreover, the traveling time was:where and is the turning radius.

Figure 7 illustrates the possible optimal paths, as proved in the above sections.

The two paths having a basic line-circle structure could be directly solved by model 1, obtaining 471.032 and 505.9835 units, respectively, through MATLAB. Therefore, the optimal total distance of the AGV passing from the upper left of obstacle 5 was S = 471.0372 units, the total traveling time was T = 96.0178 s, containing two straight-line segments and one arc line segment (at center (80, 210) and radius r = 10). The specific conditions of the path are listed in Table 3:

Through the calculations performed by MATLAB, the optimal total distance of the AGV was S = 3,812.52 units. The traveling time was T = 585.6712 s, involving 16 straight-line segments and 15 arcs. Further details are reported in Tables 4 and 5.

3.2. Task Equalization

This section presents the hardware conditions of the experiment in order to validate the performance of the proposed methodology. The algorithms was coded in MATLAB 2021 software using a computer with the following specifications: Intel (R) Core (TM) i9-10885H CPU @ 2.40 GHz.

To better balance the sorting stations’ load and prevent local AGV handling congestion, we proposed a SVM-based AGV scheduling strategy. According to the experimental results of dividing the equilibrium task area, the proposed SVM method outperformed the single-attribute AGV scheduling rules. In the suggested scheme, first, we set a sorting station for each pallet, thus realizing the pairing optimization. On this basis, the model of problem 1 was re-built to:where represents the picking station specified for the pallet.

Generating the training sample involves the following steps. As a supervised learning technique, SVM can output a maximum-boundary hyperplane to perfectly separate two types of training samples [17]. The training samples in the supervised learning comprise features and labels, which refer to the current system state and optimal scheduling rules in the case of AGV scheduling, respectively. This rule can be dynamically chosen based on the current system state.

The training sample set is not linearly separable, i.e., it cannot be well separated by a linear hyperplane. In this case, the kernel function will map the feature vector into a higher-dimension Hilbert space . Here, the Hilbert space was reproduced by a Gaussian kernel (or radial basis function, RBF), where the RBF parameters were defined by candidate scheduling rules selected via an SVM scheduler [18]. Thus, the scheduling rules were taken as the SVM labels. The AGV scheduling rules considering the order due date were also the candidate scheduling rules since the average work delay was taken as the performance indicator.

Partitioning rules for the candidate scheduling area:(1)Shortest travel time (STT): select the task closest to the AGV.(2)First come, first served (FCFS): select more urgent tasks.(3)Minimum remaining output queue space (MROQS): select tasks in the output queue with the remaining space.(4)Earliest due date (EDD): select the task with the closest due date.(5)Critical ratio (CR): select tasks with the lowest critical ratio [CR = (due date-this date)/(remaining operation time)].(6)Dynamic slack (DS): select tasks with the shortest slack time (remaining slack time = remaining operation time).(7)Nearest vehicle (NV): select the AGV closest to the task location.(8)Longest idle AGV (LIV): select the AGV with the longest idle time.(9)Lowest utilization AGV (LU): select the AGV with the lowest utilization ratio.

The division distribution of the ten regions was determined by the division rules as shown in Figure 8.

In machine learning, every system attribute potentially influencing the system’s performance should be considered an environmental state. Given the environment of this research, we chose the system attributes relative to the AGV, which empirically are nine, utilized as the training samples. Moreover, the average delay rate was recorded at the end of scheduling and was employed as a performance indicator. After performing N simulations with a fixed random seed, the performance of N scheduling rules was recorded, and the scheduling rule with the highest performance was used as the label of the training sample.

The error diagrams of the SVM scheduling after training in each division are provided in Table 6:

The error of the trained SVM scheduling division results is shown in Figure 9. The single-attribute scheduling rule shows a slow convergence and weak stability. Obviously, the SVM scheduler significantly outperformed most of the single-attribute scheduling rules.

Different combinations of single-attribute scheduling rules and SVM scheduling methods were tested and simulated. For each scheduling method, 30 tests were conducted using a typical random number (CRN) technique. Specifically, the comparison value was expressed as the average delay value of a single scheduling rule divided by that of the SVM scheduler, thereby comparing the performance of the single scheduling rule and the SVM scheduler. A statistical analysis of the results was carried out to determine whether the SVM scheduling rule was superior to the single-attribute scheduling rule [19]. Obviously, the SVM scheduler significantly outperformed most of the single-attribute scheduling rules.

4. Results and Evaluation

4.1. Simulation and Validation

According to the relationship between the turning radius and speed, path optimization was realized based on the model for calculating the shortest path. Then, a nonlinear programming model for the shortest time was constructed. Utilizing NSGA-II provided the shortest time of 94.22825 s, the turning radius of 12.9886 units, and the circle center coordinates of the turning were 82.1414 and 207.1387 [20]. The obstacles were classified into two categories: having vertices and having no vertices. Next, the path proved to be optimal when the AGV made a turn with the minimum radius and the circle center was located at the obstacle’s vertex (the circular obstacle was located at the circle center), thus reasonably solving the problems of obstacle congestion and deadlock.

The problem here was the shortest time spent by the AGV traveling from point O to point A and bypassing obstacle 5. According to the shortest path calculated, the time was also the shortest when the AGV passed from the upper left of obstacle 5. The total time spent by the AGV comprised the time on the straight line and arc line segments. The paths with the shortest time and the shortest distance were different, so the path with the shortest time should be calculated first. Since the closest distance between the AGV and the obstacle must not be less than 10 units, the traveling range of the AGV was determined by the obstacles to be avoided on its path and the range influencing its action.

The maximum turning speed of the AGV was expressed as:

The time spent by the AGV in passing the arc was:

Based on the above formulae, decreasing the turning radius slows the AGV, and the arc it followed became shorter. When the turning radius of the AGV increased, both the AGV’s turning speed and the arc’s length increased. Thus, we did not identify any direct linear relationship between the turning time and the turning radius . Hence, the turning radius change was limited. In practice, an excessively large may cause collisions, while the AGV may roll over for a very small (less than 10).

Then, a nonlinear programming model for the minimum time was built based on the variation range of , thereby obtaining at the minimum . Here, was set as the starting point, was the target point, and was the upper left vertex of obstacle 5. It was assumed that the AGV made a turn at point , with as the center and as the radius, passed arc , and then turned at point , to ultimately obtain the path with the shortest time spent. Points and were connected to generate line segment , perpendicular to segment . The length of , , tangent , and tangent is denoted as , and , respectively.

Let BD = d. Since both point B and point C are tangent points, the following results were obtained:

The radius of the circle in which B and C are located is

The objective of the shortest time was formulated based on the straight-line distance, speed of AGV, the arc length, and the AGV’s speed while passing the arc:

Given that the distance between the AGV and the obstacle must be more than 10 units, the radius r was also constrained, i.e., the arc-obstacle distance must be over 10 units:

The range of the two tangent points is

The constraints that the circle center had the shortest distance from the obstacle were expressed as follows:

The AGV obstacle avoidance simulated through MATLAB programming is illustrated in Figures 10 and 11:

The turning path of an AGV must include an arc tangent to the straight-line path. Points C and B are on a circle with a radius .

Based on the conditions above, the optimization model for the shortest time was built:

Utilizing Lingo and NSGA-II software, the shortest time of 94.22825 s and the turning radius of r = 12.9886 were obtained. The results are reported in Table 7 and the resulting diagram Figure 12:

4.2. Model Promotion

In this paper, the problem of deadlock was simplified. Nevertheless, further model optimization is required when the actual deadlock situation is more complicated. This work assumed that the AGV moved from the starting point to the target point , where the path comprised linear segments and arcs, set as and , respectively. Thus, the objective function was expressed as:

Computer software such as MATLAB or Lingo can be used to solve the optimal path between the start point and the target point.

5. Conclusion

According to the features of existing AGV cluster systems, an SVM-based AGV scheduling strategy was developed by determining the shortest possible path between the starting point and the target point. Then, the optimal path was determined by the exhaustive method and the Q-learning method and was optimized by several schemes, through which the optimal path in the relative optimization was obtained. A nonlinear programming model for the shortest time was built based on the relationship between the turning radius and speed based on the shortest path. The calculation with NSGA-II proved that the AGV path was optimal when the AGV turned with the minimum radius, and the circle center was located at the obstacle’s vertex (the circular obstacle was located at the circle center). Several simulation tests and calculation results validated the proposed method, which rationally solved the problem of obstacle congestion and deadlock. After optimization, the constructed model was solved with high accuracy via analytical geometry. However, the processing burden increased, resulting in low utilization efficiency. Hence, the proposed method is inefficient for complicated deadlock situations.

Future research will consider the minimum spanning tree, and the shortest path of the AGV combined the dynamic monitoring and real-time data of AGV cluster systems.

Data Availability

The data used to support the findings of the study can be obtained from the author upon request.

Conflicts of Interest

The author declares that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

The author would like to express his gratitude to EditSprings (https://www.editsprings.cn) for the expert linguistic services provided and Zhi Dong Li for experimental assistance. This research was supported by the 2021 Innovation Training Program for University Students (nos. 202112757001, 202112757005, 202112757006, and 202112757016) and the Chongqing Police College Studies Research Project Program.