Abstract
The paper considers the single-server Markovian queues with delayed multiple vacations. Once the system becomes empty, the server will experience a changeover time to begin the vacation. By using the matrix-geometric solution method, we obtain the stationary probability distribution and the mean queue length under almost unobservable and fully unobservable cases. Based on the system status information and a linear reward-cost structure reflecting the desire of customers for service and their unwillingness for waiting, the social welfare function is determined. We also investigate optimal pricing strategies of the server under the ex-postpayment scheme and the ex-antepayment scheme, and we get the customer’s equilibrium strategy under optimal pricing strategy. Finally, we illustrate the effect of several parameters and information levels on socially optimal strategies and optimal social benefit by numerical examples.
1. Introduction
Over the past few decades, more and more people have become keen to study queueing systems from a game-theoretic perspective. This study of queueing models was initiated by Naor [1] and Edelson and Hildebrand [2] who studied the model with a simple linear reward-cost structure. They obtained customers’ equilibrium and socially optimal strategies for observable and unobservable cases. From then on, many authors have investigated the same problem for various queueing systems incorporating diverse characteristics. The monograph written by Hassin and Haviv [3] introduced the main results and solution methodology about these models with extensive bibliographical references.
Queueing models with vacations have been developed as useful performance analysis tools for computer systems, communication networks, and flexible manufacturing systems. As to strategic behavior in the vacation queues, Burnetas and Economou [4] first studied a Markovian single-server queueing system with setup times. They derived equilibrium strategies for the customers under different levels of information and analyzed social benefit under these strategies. Then, Economou and Kanta [5] considered equilibrium threshold strategies in the fully observable and almost observable Markovian single-server queue with breakdowns and repairs. Guo and Hassin [6] studied the strategic behavior and social optimization in Markovian queues with N policy. This work was extended by Guo and Hassin [7] to heterogeneous customers. Sun et al. [8, 9] analyzed customers’ equilibrium and socially optimal balking strategies in the observable and unobservable queues with three types of setup/closedown policies, respectively. Liu et al. [10] investigated equilibrium thresholds of customers in observable M/M/1 queueing systems with a single vacation. Zhang et al. [11] and Sun and Li [12] studied almost simultaneously equilibrium balking strategies in M/M/1 queues with multiple working vacations for different cases with respect to the system information. But Sun and Li [12] also observed socially optimal joining strategies and optimal social benefit. Subsequently, Sun et al. [13, 14] considered the customers’ optimal balking behavior in some single-server Markovian queues with two-stage working vacations and double adaptive working vacations, respectively. Wang and Zhang [15] discussed the equilibrium strategies in the Markovian queues with a single working vacation. Then, Tian et al. [16] studied equilibrium and optimal strategies in M/M/1 queues with multiple working vacations and vacation interruptions under three different levels of system information. Doo Ho Lee [17] observed customer’s equilibrium joining/balking behaviors in M/M/1 queues with a single working vacation and vacation interruptions. About comprehensive and excellent study on strategic queueing systems with vacations, readers are referred to chapter 10 in the book of Hassin [18].
As for the work studying pricing strategy and server profits in queueing systems, Sun et al. [19] studied price decisions of the two servers in equilibrium under competition in batch-arrival queues with complementary services. They introduced two pricing scheme: ex-postpayment (EPP) scheme and ex-antepayment (EAP) scheme. Under EPP scheme, the server imposes a price proportional to sojourn time after completing service, while under EAP scheme the server charges a fee before offering service. Ma and Liu [20] observed customers’ equilibrium behavior and optimal pricing strategies in a discrete-time queue under EPP and EAP schemes. Doo ho Lee [21] dealt with optimal pricing strategies in almost and fully unobservable queues with negative customers. Moreover, Sun et al. [9] analyzed social optimization and profit maximization and they got optimal price for motivating customers to adopt socially optimal strategies. Wang and Zhang [22] investigated the optimal price that maximized server profits in a retrial queue with delayed vacations.
In this paper, we consider almost and fully unobservable queueing models with delayed multiple vacations. As to the almost unobservable case, arriving customer can only observe the state of the server and cannot observe the number of present customers before making decision, while for the fully unobservable case, a customer cannot observe the states of the system. The aim of this paper is to study the socially optimal strategies and optimal pricing strategies in the context of an unobservable M/M/1 queue with delayed multiple vacations.
This paper is organized as follows. Descriptions of the model are given in Section 2. In Sections 3 and 4, we determine social benefit and optimal pricing strategies in the almost unobservable and fully unobservable cases, respectively. In Section 5, some numerical examples are presented to illustrate the effect of several parameters on socially optimal strategies and optimal social benefit (server profits) in two cases. Finally, conclusions are given in Section 6.
2. Model Description
Consider a single-server M/M/1 queue with delayed multiple vacations where customers arrive according to a Poisson process with a rate . In a busy period, service times are supposed to be exponentially distributed with a rate . Upon the completion of service, if there is no customer in the system, the server will stay in the system for a period of time, called the delayed period or the changeover time. The delayed period follows an exponential distribution with parameter . If a customer arrives during the delayed period, the server starts to serve the customer immediately. Otherwise, after the delayed period, the server will take a vacation immediately at the end of delayed period. If the system is still empty after a vacation, the server will take another vacation; otherwise he begins to server the customers. The vacation times are exponentially distributed with a rate .
We assume that the interarrival times, the service times, and the vacation times are mutually independent. In addition, the service discipline is first in first out (FIFO).
Denote by the number of customers in the system at time t and let Then, the process is a two-dimensional continuous-time Markov chain with the state space .
Arriving customers are assumed to be identical and they can decide whether to join or balk upon arrival. We assume that every customer receives a reward of units after completing service and suffers a waiting cost of units per time unit that the customer remains in the system. Then, a customer’s utility consists of a reward for receiving service minus a waiting cost based on a linear cost structure. Customers are risk neutral and maximize their expected net benefit. Finally, we assume that there are no retrials of balking customers nor reneging of waiting customers.
In the following sections, we consider social benefit and server profit under two different cases.
3. Almost Unobservable Queues
In this section, we focus on the almost unobservable case, where arriving customers can only observe the state of the server at their arrival instant and do not know the number of customers present.
3.1. Social Optimization
There are four pure strategies available for a customer, i.e., to join the queue or not to join the queue at the state . A pure or mixed strategy can be described by a fraction , which is the probability of joining when the server is at the state , and the effective arrival rate of the arriving customers is . The transition diagram is illustrated in Figure 1.

Using the lexicographical sequence for the states, the transition rate matrix can be written as the tri-diagonal block matrix: where
The structure of indicates that is a quasibirth and death process (see Neuts [23]). To analyze this QBD process, we need to solve the matrix quadratic equation for the minimal nonnegative solution and this solution is called the rate matrix and denoted by .
Because the coefficients of (4) are all upper triangular matrices, we can assume that has the same structure as Substituting and into (4), we get the following results: and where
Denote the stationary distribution as
Theorem 1. Assuming that and , the stationary probabilities are as follows: where
Proof. Using the matrix-geometric solution method, we have and satisfies the set of equations where . Solving (14) and letting , we get By (13) and (15), we obtain Finally, can be determined by the normalization condition.
From (10) and (11), we get the steady-state probability that the server is in state , denoted by , as follows: So the effective arrival rate is
The probability generating function of the queue length, denoted by , is as follows: By the stochastic decomposition property, we get the mean number of the customers in the system, denoted by ,
Hence, the mean sojourn time of a customer who decides to enter upon his arrival can be obtained by using Little’s law And the social benefit when all customers follow a mixed policy can now be easily computed as follows:
The goal of a social planner is to maximize overall social welfare. Solving the nonlinear programming (NLP) problem max (of course, , ), we can obtain the socially optimal strategy to maximize social welfare.
Let Then, we have So the social benefit can be expressed as follows:
The first partial derivative of the function in (26) and (27) can be expressed as Therefore, the first and two partial derivatives of are given as
Since the objective function expression is more complex, we only provide a specific and sufficient condition to ensure convex programming, and in this case we get the best joining probability.
Theorem 2. If and , the optimization problem of is a convex maximization problem and there exists a unique optimal joining strategy .
Proof. The optimization problem of has three constraints: and , and they are all real-valued linear functions. Therefore, the set of constraints is convex.
Let be the Hessian matrix of the objective function, and it has the form of According to the definition of the convex maximization in Boyd and Vandenberghe [24], if and , is negative definite and the objective function should be strictly concave in the feasible region, which leads to the optimization problem of being a convex maximization problem. One property of convex programming is that the local optimal solution is the global optimal solution.
According to the discussion above, we can use Lagrange multiplier method, Karush-Kuhn-Tucker conditions, or Newton method to find the optimal solutions. However, the explicit form of the optimal is too long and complicated; we only introduce the method and give numerical results later. Details about two above methods can be found in Boyd and Vandenberghe [24]. This completes the proof.
Figure 2 shows that social benefit is strictly concave and takes the maximum. We also obtain .

3.2. Profit Maximization
Now, we consider a monopolistic server that sets a profit-maximizing price under EPP and EAP schemes.
EPP scheme model. We first consider the EPP scheme model in which the server sets a fee proportional to the customer’s sojourn time. Let be the fee charged by the server and be the expected server profit per time unit. In this case, we have the following results regarding the customer’s utility.
Lemma 3. In the almost unobservable queue, the expected utility of an ordinary customer who decides to join the system is .
Proof. Let be the conditional expected sojourn time of an ordinary customer who decides to join at the state . Then, we have . And the conditional expected benefit of such a customer who decides to enter the system at the state is .
So for any customer who is willing to enter the system, his expected utility is At customer’s equilibrium, if , we also have and obtain . Hence, at customer’s equilibrium, we find that the toll is a function of the customer’s joining probabilities, denoted by and For the system with delayed multiple vacations, the monopoly’s problem is a nonlinear programming problem to maximize with and : Here, and in are replaced by and . Then the objectives of a monopolistic server and the society are consistent with each other, so we can induce the socially optimal joining probabilities by an appropriate price, which also maximizes profit of the server.
Moreover, as mentioned above, if the server chooses the optimal pricing strategy, the expected benefit of the customer equals
Based on the above analysis, we can give the following theorem.
Theorem 4. In the EPP scheme model, if and have the same sign, there exists a unique equilibrium strategy where customers join the queue with probabilities and and the optimal price , and the server’s maximal profit is .
Proof. Using the standard methodology of equilibrium analysis, we derive the following equilibria.
If , that is, : Because and have the same sign, so we have and . In this case, even if no other customer joins, the residual utility of a customer who joins is less than or equal to 0. Therefore, is an equilibrium strategy. Moreover, in this case, balking is a dominant strategy.
If , that is, , and : In this case, for every strategy of the other customers, the tagged customer has a positive expected net benefit if he decides to enter. Hence, the equilibrium strategy .
If : In this case, if all customers who find the system empty enter the system with probability , then the tagged customer suffers a negative expected benefit if he decides to enter. Hence, does not lead to an equilibrium. Similarly, if all customers use then the tagged customer receives a positive benefit from entering; thus also cannot be part of an equilibrium mixed strategy. Therefore, there exists a unique satisfying , , and . From (41), it can be seen that is the zero point of the function . In other words, they are also customers’ equilibrium strategy.
EAP scheme model. Under the EAP scheme model, the server imposes a flat entry fee for all those who decide to join. Let be the server’s expected profit per time unit. The expected benefit for a customer is equal to . We can obtain customers’ equilibrium when , and we obtain . At customer’s equilibrium, is a function of and and has the form of .
Similar to the EPP scheme, we establish the following NLP problem to maximize with respect to and : And we also have the following theorem.
Theorem 5. In the EAP scheme model, if and have the same sign, there exists a unique equilibrium strategy where customers join the queue with probabilities and and the optimal price and the server’s maximal profit is .
From the above analysis, we can see that the server can make the equilibrium strategy and the socially optimal strategy unified by setting the appropriate price.
Figure 3 shows that optimal prices and first increase when the arrival rate is very low and then decrease. Therefore, an increase in demand may result in a reduction in price.

4. Fully Unobservable Queues
In this section, arriving customers can observe neither the state of the server at their arrival instant nor the number of customers present.
4.1. Social Optimization
There are two pure strategies available for a customer, i.e., to join the queue or not to join the queue. A strategy can be described by a joining probability , and the joining rate is . The transition diagram is illustrated in Figure 4.

The stationary distribution of the system when all customers follow a given strategy can be obtained by taking in the almost unobservable queue. So we have where
From (43), we get the mean queue length, denoted by , as follows: By using Little’s law, the mean sojourn time of a customer who decides to enter upon his arrival is as follows: So the social benefit per time unit can now be easily computed as
Let denote the socially optimal joining probability , and let be the root of the first-order optimal condition . They have the following relation.
Theorem 6. If and , then ; if and or , then .
In the analysis below, we will provide a specific and sufficient condition to ensure that the function in (47) is unimodal and strictly convex, and under this condition we obtain the optimal joining probability.
Theorem 7. Suppose . If and , then
the social benefit is unimodal and strictly convex in , where if , then . Otherwise, .
Proof. Let . The first two derivatives of the mean sojourn time are given by and The second-order derivative of the social benefit is Thus, if , then , and the social benefit is unimodal and strictly convex in .
In Figure 5, we observe that first increases and then decreases with and have a maximum value at . Hence, we have .

4.2. Profit Maximization
Under the EPP scheme, the price proportional to the customer’s waiting time is charged by the server. Thus, the expected residual utility of a customer is . If , we have . Hence, at customer’s equilibrium, the price is expressed with the joining probability , The monopoly’s problem is a NLP problem to maximize with : This is also a problem of maximizing social benefit. The price that induces social optimization and profit maximization equals , and the server’s maximal profit is . What is more, under the optimal price, is also an equilibrium strategy.
In the EAP scheme model where the server imposes a flat fee for services, the expected benefit for a customer is equal to . When , we obtain customers’ equilibrium strategies and . At customers’ equilibrium, is a function of joining probability and has the form of . The monopoly’s problem is to maximize .
So the optimal price is , and the server’s maximal profit is . This price makes customers’ equilibrium strategy consistent with the socially optimal strategy.
Figure 6 shows the socially optimal and monopolistic price for the unobservable queue.

From the above analysis, we can observe that the EPP and EAP mechanisms do not affect the customer equilibrium joining rate and the server’s maximum profits.
5. Numerical Examples
In this section, we present some numerical analysis to investigate the sensitivity of socially optimal joining probabilities for customers and optimal social benefit (expected server profits) on system parameters.
We first explore the sensitivity of the socially optimal entrance probabilities. From Figures 7–10, we can make the following observations of socially optimal joining strategies for the almost and fully unobservable cases. The optimal probability in the fully unobservable queue is always between two optimal joining strategies in the almost unobservable case. However, the relative ordering of and cannot be determined. Therefore, when customers cannot observe the information about server’s state, they enter the queue with an intermediate strategy between those that the customers use when they are given the server’s state. As for the sensitivity about the system’s parameters, we observe that socially optimal strategies are first increasing with respect to service rate in Figure 7, and when continues to increase all probabilities are equal to 1. It is intuitive. That is, regardless of whether the customer has status information for the server, an increase in the service rate will reduce the expected waiting time of the customer. Figure 8 shows that optimal strategies decrease with arrival rate . This is because the system is more congested and an arriving customer is less willing to join the system when increases. In Figures 9 and 10, the behavior of optimal probabilities varies. While for the most part they are all increasing with , which is intuitive, there is a range of small values of in which is decreasing. When the vacation time becomes shorter, the customer will be more willing to enter in vacation, which will lead to system congestion in the busy period. When increases, the server can make the vacation faster. This may result in too many customers entering the system during the vacation period and the customers who arrive on busy period are less willing to enter.




Subsequently, we turn to the social benefit or server profit under optimal strategy for two information levels. In Figure 11, the social benefit or server profit first monotonously increases and then keeps stable when exceeds the certain value. The reason is that customers’ waiting time is shorter when is small, which makes the social welfare improve. And when the arrival rate becomes larger, the customer’s maximum arrival rate tends to be stable. Figures 12 and 13 show that the social benefit or server profit is always increasing with service rate and vacation rate , which is intuitive. This is because the utility of individual customer becomes larger and the social benefits are also improved. Figure 14 shows that the social benefit or server profit decreases with respect to . This is because shorter delayed time results in too many customers entering the vacation, which leads to system congestion and reduces social benefits. From these figures we find that . In order to get more social benefits, the social planner can appropriately disclose the status information of the server to the customers.




Finally, we are concerned with the delayed period which is the special feature of this paper. Taking the fully unobservable queue as an example, comparison work is carried out to show which mechanism, multiple vacations or delayed multiple vacations, is better in controlling the mean queue length and improving the social welfare. Figure 15 shows that mean queue length is increasing with respect to , and the social benefit first increases and then decreases with and has a maximum in Figure 16. On the other hand, Figures 15 and 16 show that the mean queue length for queues with vacations is always larger than that for queues with delayed vacations, and the social benefit for queues with delayed vacations is always greater than that for queues with vacations. Because in queues with vacations, when the system is empty, the server immediately begins to take a vacation and customers suffer a longer waiting time and a greater waiting cost, while in queues with delayed vacations. Some of the customers who arrived after the system was empty are served during the delayed time. All in all, delayed vacation somewhat relieves the system congestion problem and increases social benefits.


6. Conclusion
In this paper, we discuss the socially optimal joining probabilities and optimal pricing strategies of the server in the almost and fully unobservable M/M/1 queueing systems with delayed multiple vacations. We first obtain the steady-state performance measures and social benefit in two different cases. Meanwhile, we assume that the server can choose the optimal pricing strategies that maximize the monopoly profit under ex-postpayment (EPP) scheme and ex-antepayment (EAP) scheme. Moreover, the servers’ maximum profits in the two pricing schemes are also identical and have equal optimal social benefit. Finally, we compared socially optimal strategy and optimal social benefit for almost and fully unobservable cases numerically, and we observed that the disclosure of system information had certain benefits for social planner. Furthermore, the direct generalization is the study of the corresponding non-Markovian queues. One can also study the pricing strategies for other queues with sojourn time-dependent reward.
Data Availability
All data included in this study are available upon request by contact with the corresponding author.
Conflicts of Interest
The author declares that they have no conflicts of interest.
Acknowledgments
This research was supported by the National Natural Science Foundation of China (No. 11601469), the Hebei Province Natural Science Foundation of China (No. A2018203088), the Science Research Project of Education Department of Hebei Province (No. ZD2018042), and the Science and Technology Research and Development Project of Qinhuangdao (No. 201502A221).