Abstract
This paper investigates the problem of approximating the global minimum of a positive semidefinite Hankel matrix minimization problem with linear constraints. We provide a lower bound on the objective of minimizing the rank of the Hankel matrix in the problem based on conclusions from nonnegative polynomials, semi-infinite programming, and the dual theorem. We prove that the lower bound is almost half of the number of linear constraints of the optimization problem.
1. Introduction
The matrix rank minimization problem, or minimizing the rank of a matrix subject to linear constraints, recently has gained tremendous popularity due to wide applications in various fields including machine learning [1], compressed sensing [2, 3], system identification and control [4, 5], and computer vision [6]. This problem can be formulated as the following program:where is the optimization variable, is the rank of , is a linear map, and . A special case of this problem is the matrix completion problem [2], recovering an unknown low-rank matrix from a sampling of its entries. For this special case, the constraints are simply a subset of the entries of the matrix. The majority of existing work on algorithms for problem (1) has concentrated on this special case. Many papers addressed methods to solve the matrix completion problem, for instance, the singular value thresholding algorithm [7], the accelerated proximal gradient algorithm [8], the matrix factorization-based approach [9], and the facial reduction [10–12]. Various other methods are well documented in [13–15] and references therein.
In many engineering and statistical applications, for example, dynamical systems problems [16] and time-domain signal problems [17], the recovered matrix of problem (1), in addition to being low-rank, is required to have a Hankel structure. Driven by the wide applicability of the Hankel matrix minimization problem, there have been several algorithms available for solving it. Based on the nuclear norm model which minimizes the nuclear norm of the matrix instead of its rank, Fazel et al. [4] discussed several first-order algorithms for solving the structured rank minimization problem, including gradient projection methods, proximal point algorithms, and alternating direction methods of multipliers. Several evolutionary algorithms were proposed by Cai, including an iterative hard thresholding algorithm based on the nuclear norm model [18], an alternating direction method of multipliers based on Vandermonde factorization of a Hankel matrix model [19], and a projected Wirtinger gradient algorithm [20]. The major drawbacks of these methods are that the solutions obtained are not guaranteed to be a good approximate global solution for the problem since there have been no approximation bounds developed for these methods. Moreover, they are highly sensitive to some parameters which depend on a guess of the rank of the concerned matrix. Hence, in view of improving performance of algorithms for solving the problem, it is important to find an approximation for the rank of the optimal Hankel matrix. Along this direction, in this paper, we provide a lower bound on the minimum rank of a positive semidefinite Hankel matrix minimization problem with linear constraints, which is modeled as the following program:
Note that the positive semidefinite Hankel matrix completion problem is a special case of problem (2).
Most literature about estimation of the rank of a matrix concentrates on the rank of the optimal matrix of a positive semidefinite program [21–24] or a band matrix [25–28]. Though the minimum rank of problem (2) is guaranteed theoretically through these methods from the previous work, they are not efficient since the structure emerging from a positive semidefinite Hankel matrix is not completely utilized. To our knowledge, this work is one of the first efforts to address the lower bound of the objective value of the positive semidefinite Hankel matrix rank minimization problem. We first construct a semidefinite program to describe the feasible region of problem (2); then derive an equivalent semi-infinite program for the dual of the semidefinite program using a conclusion from nonnegative polynomials; and finally, by the optimality conditions for the semi-infinite problem, we prove that a lower bound for the minimum objective of problem (2) is almost half of the number of its constraints.
The remainder of the paper is organized as follows. Section 2 summarizes some notations and preliminaries that serve as fundamentals of analysis. In Section 3, a detailed stepwise analysis is shown to convert our problem into a semi-infinite program. Moreover, we establish the main results under some mild assumptions. Finally, we conclude this paper in Section 4.
2. Preliminaries
In this section, we introduce some notations and basic definitions which will be used in the remaining part of the paper and some lemmas that serves as fundamentals to recast a positive semidefinite system as an equivalent semi-infinite program.
Notations. Let be the -dimensional Euclidean space and be the set of real matrices. and denote the rank and trace of a matrix , respectively. A matrix is symmetric positive definite (resp., positive semidefinite) and is denoted by (resp., ). represents the -th component of vector , and represents the entry of matrix . is the number of elements of set . The symbol represents the transpose. represents the identity matrix. is the set consisting of the polynomials with coefficients in and degree at most . An -degree polynomial : is written as . Denote coefficients of polynomial by . means is a nonnegative polynomial. Considering a collection of matrices , where , we writefor the operations between a collection of matrices and a vector (resp., a matrix), where and . Let be a base of the Hankel matrix space, which is defined as follows:
Definition 1. An real Hankel matrix is a matrix of the form:where . It is obvious that an Hankel matrix could be completely determined by its last row and first column, a total of entries. Let . The Hankel matrix can be constructed by vector and can be expressed uniquely as based on (3) and (4). Thus, program (2) can be rewritten as follows:where .
Problem (6) is a special case of the matrix rank minimization problem (1) due to the special structure of the positive semidefinite Hankel matrix. However, the additional positive semidefinite Hankel structure requirement comes at a price, with theoretical and practical difficulties involved in the process of solving such a problem. Hence, we present some lemmas to provide tools to simplify the high technical requirement.
Lemma 1. (see [29]). Suppose that and . Then, is a sum of squares, that is,where .
We are now ready to state a lemma, which accords us with a mechanism to transform a semidefinite positive system to a nonnegative polynomial, which is the most useful part of our main result.
Lemma 2. Suppose and is the associated coefficient vector. Then, the following statements are equivalent:(a)(b)There exists a matrix such thatwhere is defined in (4)
Proof. Because and , according to Lemma 1, we havewhere .
Let and , then and , which means thatDefine ; then,Based on the equivalence of the coefficients of on the two sides of the above equation, we obtain and complete the proof in one direction.
For the other directions, since , ( is the rank of ), which together with , it implies that
The following lemma, which can be obtained from ([21], Theorem 2), provides us a useful property of the optimal matrix of program (6).
Lemma 3. Let be the cone of positive semidefinite matrices and be an affine subspace of the space of symmetric matrices such that the intersection is nonempty and bounded. If codim , where , then there is a matrix such that .
3. A Lower Bound on the Optimal Objective of a Positive Seimdefinite Hankel Rank Minimization Problem
In this section, we investigate a lower bound on the minimum rank of problem (5). Beginning with a lemma, we first explore an important property of the optimal value of problem (6).
Lemma 4. The objective value of problem (6) is small than .
Proof. An equivalent formulation for program (2) is stated as follows:where represents the floor function. It is clear that the affine subspace of the above program, denoted by , can be expressed by the following system:It is evident that codim , which meanswhere is as defined in Lemma 3. A direct application of the statement of Lemma 3 to estimating the objective value of program (2) results in . Recalling , it follows that , which implies . Hence, we prove the desired conclusion of the lemma due to the equivalence of programs (2) and (6).
Lemma 4 shows that optimal solutions of program (6) lie exactly on the boundary of the feasible region since the rank of that optimal solutions is smaller than . This says that there exists a nonzero vector such that program (6) and the following semidefinite program have the same solution:Hence, it suffices to estimate the objective value of program (16). It is easy to see that system (16) is a semidefinite program, which can be solved by many methods, for example, the interior point method [30] and the feasible direction method [31]. However, these methods just give numerical solutions for given and and cannot provide analytical solutions for any and . Hence, we explore a novelty method to solve this issue.
Recall that the dual of program (16) is given bywhere and are associated multipliers.
Hereafter, we state a lemma showing that the strict complementarity holds almost everywhere, whose most essential nature is given by ([32], Lemma 2).
Lemma 5. Suppose that and are primal and dual optimal solutions, respectively. Then, we have thatholds almost everywhere.
Proof. For program (16), the KKT conditions areLet and . Using the complementarity conditions , we have and eigendecompositions as follows:with and . Plunging eigendecompositions (20) into the KKT system, (19) yields the following system of equations:This is a system of equations in variables . If , the system is unsolvable almost. Hence, holds almost everywhere. We derive the conclusion of the lemma.
Let . According to Lemma 2, program (17) can be represented as the following semi-infinite program,The lemma below states that the feasible set of program (22) is bounded.
Next, we will establish the main result on a lower bound on the optimal value of the positive semidefinite Hankel matrix rank minimization problem.
Theorem 1. If is full rank and the Slater condition holds for program (16), then the objective value of the positive semidefinite Hankel matrix rank minimization problem (program (6)), denoted as , is almost not smaller than , that is, .
Proof. Based on Lemma 4 and the Slater condition, the solution set of program (22) is nonempty and finite. Let be an optimal solution of program (22). Then, is a local minimizer of the following program:where is an finite index set of the active constraints of problem (22) at since if is infinite, we can select its finite subset such that, for all in the subset holds. Thus, satisfies the following KKT system:where is the corresponding Lagrange multiplier vector.
Equation (24) is a system of equations in variables . If , the system is almost unsolvable. Hence, holds almost everywhere. Moreover, for , we have , which means since , where . Thus, since . This says that . Consequently, according to the conclusion of Lemma 5, we almost have
4. Conclusion
This paper focuses on approximating the global optimal value of a Hankel matrix rank minimization problem subjective to linear constraints. A lower bound for this optimization problem is provided based on some conclusions from nonnegative polynomials, dual theorem, and semi-infinite problems. We conclude that the lower bound is almost half of the number of linear constraints of the problem. This result is useful for simplifying the requirement of guessing the rank of the objective matrix in existing methods for solving such problem.
Data Availability
No data were used to support the study.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
Authors’ Contributions
All authors contributed equally and significantly in this paper. All authors read and approved the final manuscript.
Acknowledgments
Y. Xu was supported in part by the National Natural Science Foundation of China (nos. 11501100, 11571178, 11671082, and 11871149). X. Yan was supported in part by the National Natural Science Foundation of China (no. 11901424) and NSF of Shanxi province No. 201801D121022.