Abstract

In order to study the investment portfolio problem, this paper propose a class of stochastic programming model with rough feasible region, where randomness and roughness coexist. Based on the covering rough set, the concept of the discrete degree covering is defined to divide the rough feasible region. Furthermore, the discrete degree covering stochastic rough programming model (DDC-SRP) is constructed depending on a synthesis effect function that considers discrete degree and the expectation and variance for random objective function. Properties of the DDC-SRP model are discussed. In addition, the convexity of the DDC-SRP model is obtained in some certain conditions. Considering the random rough simulation, a genetic algorithm is introduced. Finally, a numerical example is given to show the validity of the DDC-SRP model.

1. Introduction

Investment portfolio problem is very important in resource allocation with uncertainty. For dealing with such decision-making problems with uncertainty, many scholars have introduced some models including random or fuzzy variables to formulate the uncertainty. Mean-variance portfolio selection has been a central issue in finance since Markowitz’s pioneering work [1] on a single-period investment model. Fu et al. [2] derived explicit closed-form solutions, for the efficient frontier, optimal investment strategy and for the dynamic mean-variance portfolio selection problem under the constraint of a higher borrowing rate. Yu et al. [3] considered the effects of some frequently used utility functions in portfolio selection by comparing the optimal investment outcomes corresponding to these utility functions. Zhang et al. [4] proposed a new portfolio selection model with the maximum utility based on the interval-valued possibilistic mean and possibilistic variance, which is a two-parameter quadratic programming problem. Zhang [5] discussed the portfolio selection problem for bounded assets based on upper and lower possibilistic means and variances. The mean-standard deviation model for portfolio selection can be transformed to a linear programming under possibility distributions, so this methodology can be used to solve large-scale portfolio selection problems. Guo et al. [6] considered a fuzzy multiperiod portfolio selection problem with V-shaped transaction cost. Within the framework of credibility theory, a mean-variance model is formulated with the objective of maximizing the terminal return under the total risk constraint over the whole investment. Giner [7] proposed a new and simple variance decomposition based on splitting the domain space, and through the law of total variance and the first and second moments of each truncated distribution, the requested decomposition formulas are deduced. In [8], the improved artificial bee colony (ABC) algorithm with an external population is applied to the optimization of the blockchain investment portfolio.

These above studies considered the investment portfolio problem with randomness and fuzziness. However in real life, when making portfolio selection, we choose one stock as the key investment object (core stock) and others as noncore stocks to form a portfolio. Since each combination is a partition of an alternative stock set, the alternative stock set can be regarded as a rough set. The traditional classification by regression analysis can only deal with randomness to a certain extent, but cannot deal with the roughness caused by portfolio classification of decision-makers’ subjective consciousness. So here we apply rough set theory to deal with portfolio. When we select a stock as an investment objective, we must consider many related factors, such as the national policy, the enterprise benefit, the prospects for the development of the enterprise, and so on. Because the investment risk is bigger when investors select only one stock, they often select a stock portfolio, which means that they will select a class of stocks. In order to disperse risk in investment, investors tend to choose the obvious difference enterprises or the enterprises whose futures are not related and use their stocks as an investment portfolio. For example, we choose a stock that is more affected by the market and several stocks that are less affected by the market to form a portfolio. So if one cost of stock falls, it will not affect the other stocks. Here, a core stock and other stocks associated with it are used to form a portfolio. Taking each stock as the core stock can form a portfolio. These portfolios just cover the set composed of all optional stocks, that is, all portfolios are equivalent to the division of this optional stock set. Roughness is the uncertainty caused by the division of sets, so rough set theory is used to analyze stock investment problems. When selected a stock as the investment core, the investment portfolio with the core has roughness due to the difference result of classification. Therefore, the feasible region of the considered decision problem is rough set. In this article, a class of stochastic programming model with rough feasible region is put forward in investment portfolio.

is the set of investment stocks that can be selected. In accordance with the alternatives, expected return attribute that is numerical attribute, stocks can be classified. Stocks can also be classified in accordance with the different enterprise futures that is categorical attribute. On the investment object classification, we must consider both the numerical attribute and the categorical attribute. The stocks that are in the same class will be with similar relation . is the comprehensive return value with the core investment object in each portfolio. is a set in which these elements have similar relation with . These are two investment objects in the investment portfolio. Investors’ goal is to choose the best investment portfolio in order to get the maximum expected profit, and this similar relation has the roughness based on covering.

Because this roughness based on covering, rough set, and covering rough set theory must be introduced. Rough set theory [9], a tool for data analyzing, was first proposed by Pawlak in 1982. As a new mathematical theory to model incomplete knowledge, it can deal with problems with inexact data or imprecise information for complicated systems effectively. Now many researchers are studying on rough set with randomness. Zhu [10] studied a type of generalized rough sets based on covering and the relationship between this type of covering-based rough sets and the generalized rough sets based on binary relation. In [11], several uncertainty measures of neighborhood granules are proposed, which are neighborhood accuracy, information quantity, neighborhood entropy and information granularity in the neighborhood systems. Zhou et al. [12] defined a new neighborhood rough set relation with adapted neighbors named the gap relation and propose a new online streaming feature selection method based on this relation, named OFS-A3M. With the “maximal-dependency, maximal-relevance and maximal-significance” evaluation criteria, OFS-A3M can select features with high correlation, high dependency, and low redundancy. Many researchers have used probabilistic and fuzzy set methods to study rough sets and try to formulate axiomatic systems for classical rough sets from different viewpoints [1318].

Based on rough set for the knowledge partition, rough programming is an equivalent classification method to determine the approximate domain of a given problem. Youness [19] proposed the concept of a rough programming and defined the surely optimal solution and possibly optimal solution. By equivalent classification method of rough set, the global rough optimal solution and local rough optimal solution are defined, in order to solve the problem of rough programming. Liu [20] introduced a concept of rough variable and further gave the concept of random rough variable, rough expected value model, and rough chance-constrained model. They provide a useful basis to solve the uncertainty of randomness and roughness. Xu et al. [21, 22] put forward a class of multiobjective linear programming models with random rough coefficients and gave some characters and solution methods. In [23], the concept of ‘‘rough interval’’ is introduced in the modeling framework to represent dual-uncertain parameters. Osman et al. [24] proposed that rough programming will be divided into three types in according to the roughness of the feasible domain and the objective function. They discussed the characteristics of feasible domain, optimal value, and optimal solution of the rough programming. Tao et al. [25] proposed a class of rough multiple objective programming and used it to solid transportation problem. Many researchers have studied rough programming from different viewpoints [26, 27].

The above literature puts forward some basic concepts for rough programming, but the practical application is narrow, and most of them should be discussed theoretically. This study studies the roughness of talent in the process of investment decision making, constructs an effect based rough programming model, and discusses its solution process.

Based on the above analysis, this paper made some aspects of the work: (1) by the investment portfolio problem, a class of rough programming model with discrete degree covering is proposed; (2) based on the random information and discrete degree covering roughness information compound quantitative, a class of rough programming model based on the synthesis effect and discrete degree covering is established, which is named discrete degree covering stochastic rough (DDC-SRP) model; (3) the convexity of the DDC-SRP model is analyzed; and (4) the DDC-SRP model is used into investment portfolio problem and the features of the solution are analyzed.

2. Preliminaries

In this section, we will introduce covering rough set model based on neighborhood, and the concepts of covering and neighbor can be showed in [10, 2834]. Here, U is a universe.

Definition 1. Given arbitrary and , the neighborhood in feature space is defined aswhere is a distance function. For , it usually satisfies: (1) , if and only if ; (2) ; (3) .

Definition 2. Let and be numerical attributes and categorical attributes, respectively. The neighborhood granule of sample induced by and is defined as (1) ; (2) ; (3) , where means ‘‘and’’ operator.

Definition 3. Given a set of objects U and a neighborhood relation N over U, we call a neighborhood approximation space. For any , two subsets of objects, called N-lower approximation and N-upper approximation of X in , are defined asObviously, we can see that the elements of are in X and the elements of are not in X. But the elements of are neither in X nor in . So in approximation space , X can be described by . We shall employ the following denotations: is N-positive region of X; is N-negative region of X; is N-borderline region of X; and is N-approximation precision of X. Here, , denotes the number of the elements in A. In order to describe clearly, we note that if and only if ; if and only if .

3. Discrete Degree Covering Stochastic Rough (DDC-SRP) Model

3.1. Stochastic Rough Programming Model Based on Covering Approximation Relation

In optimization problems, the core is to make a decision that can make objective function optimal in a given feasible region. The model is as follows:

Here, is a set on ; is a numerical or non-numerical function on . That is said is the feasible region; is the objective function.

In according to the difference of feasible region and objective function, model (1) can be divided into crisp programming and uncertain programming. In this paper, we will discuss a class of uncertain programming whose objective function contains random variables and feasible region is a rough set based on similar relation. This model is called rough programming model based on similar relation.

There are too many decision problems with the above characteristics such as equipment renewal, resource management, complex system optimization, and so on. In different problems, the effect of is different, but the decision processes involve the elements that have relation with in . That is the N-upper approximation of X in . Therefore, the model of rough programming model based on similar relation is as follows:

Here, is a set on ; is a numerical or non-numerical random function on . The effect on can be described by the expectation and variance of , which is called direct effect; is a similar relation on . means . If we only consider the expectation of random variable, we can get the expectation value model:

The effect on can be also described by which is called indirect effect. Therefore, when we make a decision by rough programming, direct effect and indirect effect must be considered together. Indirect effect can be represented by a function . Then direct effect and indirect effect can be considered as a function , which named synthesis effect function. Synthesis effect function should satisfy the following conditions.

Condition 1. For any given , is monotone nondecreasing in each .

Condition 2. For any given , is monotone nondecreasing in each .

Condition 3. For any given , is monotone nondecreasing in each .

Condition 4. is monotone increasing in each .
Let , , , model (2) can be converted into the following model (4):Model (4) is named rough synthesis effect model. The following two kinds of functions are synthesis effect function. , , .
We see that rough programming can be seen as composite of a stochastic programming and a similar relation on . With different synthesis effect operators, the rough programming model can be converted into different crisp programming models.

3.2. Description of DDC-SRP Model

In Section 3.1, in according to several typical decision-making problems, general form of the rough programming model is given and rough synthesis effect model is proposed. In different problems, represents different means. So how to build the indirect effect metric mode is a key part in solving rough programming problem. In this section, we use dispersion as indirect effect and propose the DDC-SRP model.

Definition 4. If means the value of attribute , thenMeans discrete degree of .
Using as indirect effect of , model (4) can be converted into the following model (5), which named the DDC-SRP model:Here, is a random vector.
Specially, if , model (5) can be converted into model (6)Here, means the lowest level of discrete degree of .
Let , model (5) can be converted into model (7)As above analysis, we can see that using indirect effect and synthesis effect function a rough programming model can be converted into a crisp programming model. that reflects the objective function can be seen as a supplement of solution and can be seen as a correction of solution . For example, , is the index that can describe the characteristics of random variable. Here , . If variance is smaller the effect that is represented by is better. Parameters reflect the effect degree from to . If is bigger, the effect degree from to is bigger. Specially, means that is not considered.
is the index that can describe the characteristics of discrete degree of . Here . If discrete degree is bigger, the solution is better. Parameters reflect the effect degree from to the solution. If or is bigger, the effect degree from to the solution is bigger. Specially, means that is not considered.
Using above analysis, we can get the following steps of building the DDC-SRP model.Step 1: With similar relation , we can get , which is an indirect effect of the solution.Step 2: Using synthesis effect function , we can synthesize expectation and variance of random variable and discrete degree. Then we can build the DDC-SRP model.

3.3. Features of the DDC-SRP Model

Theorem 1. If we select a class of special synthesis effect functions, the DDC-SRP model (9) can be converted into the expectation model (7) of rough programming.

Proof. We use a class of synthesis effect functions . If we let , the DDC-SRP model (9) can be converted into the expectation model (7) of rough programming.

Definition 5. A real function is pseudo-concave, if for all and ; is a set of convex sets, is a function on , is pseudo-concave if for all and .

Theorem 2. If random variable function is joint convex with respect to , and probability measure Pr is pseudo-concave, then for any given confidence level , , chance constrained set is a convex set.

Proof. Let , then and . SoBecause probability measure Pr is pseudo-concave, thenTherefore, . So .
It shows that chance constrained set is a convex set.

Definition 6. Let be a convex set, be a given random variable on probability space , and be a random variable function on .(1)If for any given and , then is stochastic convex with respect to on ;(2)If for any given and , then is stochastic concave with respect to on .

Definition 7. Let be a convex set. If for any given and , then is convex with respect to on

Theorem 3. Let be a convex set, be a given random variable on probability space . If(1) is joint convex with respect to on ,(2)for any given , is convex with respect to on ,(3) is stochastic concave with respect to on , and(4) is convex with respect to .

Then the DDC-SRP model (9) is a convex programming.

Proof. From Condition 2, we can see that for any given , .
SoIt shows that is convex with respect to on .
Because is monotone nondecreasing in each , we can getFrom Condition 3, we know thatSo is monotone nondecreasing in each , soFrom Condition 1, we can see that is joint convex with respect to on . SoTherefore, the objective function of model (5) is a convex function. So model (5) is a convex programming.
If the DDC-SRP model is a convex programming, we can use 0.618 method, Newton method, restricted function method, etc. to solve it. But when we deal with some complex problems, the DDC-SRP model is a nonconvex programming, and the above methods cannot be used. Genetic algorithm is an effective method to solve such programming problems. In part 4, we will give the solution procedure by genetic algorithm, and an example about investment portfolio problem will be given in part 4.

4. Solution Procedure

In what follows, we will give the solution procedure as chart 1 to solve stochastic rough programming problem. Here we will establish the DDC-SRP model by synthesis strategy of objective and discrete degree of .Step 1 Input the initial programming model. Based on the investment portfolio problems, a class of stochastic programming model with rough feasible region is established. Step 2 Select the proper a discrete degree and a synthesizing effect function . The discrete degree that we select must follow the Definition 4 and the synthesizing effect function that we select must meet Conditions 14 in Section 3. We can also select a class of synthesizing effect functions such as . With different coefficients , we can get synthesizing effect functions. Step 3 Convert the rough programming model to the DDC-SRP model. Through proper synthesizing effect functions, we can convert the stochastic programming model with rough feasible region to the DDC-SRP model. The DDC-SRP model is a programming model without uncertainty. Step 4 Solve the DDC-SRP model by genetic algorithm. In following, we introduce the genetic algorithm based on real coding and random rough simulation. Step 4.1 Input the parameters population size Npop-size, crossover probability Pc, mutation probability Pm. Step 4.2 Initialize Npop-size chromosomes and check feasibility of chromosomes. Step 4.3 Update the chromosomes by crossover and mutation operations and check the feasibility of offspring. Step 4.4 Select the chromosomes by spinning the roulette wheel. Step 4.5 Make the crossover operation and the mutation operation. Step 4.6 Repeat the Step 2 to the Step 5, until completes the cycle index. Step 4.7 Get the best chromosome as the optimal solution and obtain the optimum value. Step 5 Output the solution.

A flowchart for the solution procedure is provided, as shown in Figure 1.

5. Number Example

A investor want to choose a portfolio among 20 given stocks. The total fund is 100, 000 dollars. The performance and relevance of the 20 stocks in the following Table 1. that is the profit of jth stock (j) is a random variable. Here, obey normal distribution whose expectation is 18 and variance is 4. obey normal distribution whose expectation is 12 and variance is 4. obey normal distribution whose expectation is 4 and variance is 2.

Remark 1. The data in Table 1 come from the data of China’s stock market in the past 10 years, which are obtained through the database of China stock network. Here, in the past 10 years, it is “good” if the stock market value fluctuation increases, and it is “poor” if the stock market value fluctuation decreases. N2 represents the relevance of the relationship between the rise and fall of the stock and the rise of the market, which also comes from the data of China’s stock market in the past 10 years. The worse the relevance, the greater the N2 value of the two stocks. Here, N2 is calculated according to the change relevance statistics of the rise and fall of the two stocks.
is a set of all stocks that can be chosen. From Table 1, we can see that performance () is a categorical attribute and relevance () is a numerical attribute. Using categorical attribute , can be divided into two parts. Then we can get the following rough programming model.Here, . is a covering under . is a random vector represent the profit of stock in .Each core investment object must come from the class of “good” that is . The discrete degree of the portfolio must be as big as possible in order to disperse risk. Here let the relevance distances between every secondary investment object of and core investment object are bigger than 2 (). Then we can get the following coverings.Furthermore, in the same investment portfolio, the relevance between every two secondary investment object can be closer than the relevance between every secondary investment object and core investment object. In a same , the relevance distances between every two secondary investment object can be smaller than the relevance distances between every secondary investment object of and core investment object. So the relevance distances between every two secondary investment object are bigger than 1 (). Then we can get the following coverings.Next we will get the optimal portfolio from the .
Using synthesizing effect function , model (8) can be converted into the DDC-SRP model with discrete condition:Here is the weight of the stock ().
Using genetic algorithm (its parameters setting are binary code mode; mutation probability is 0.001; crossover probability is 1; population size is 80; evolutionary generations is 1000), we can get Table 2.
Using genetic algorithm (its parameters setting are binary code mode; mutation probability is 0.01; crossover probability is 0.8; population size is 80; evolutionary generations is 1000), we can get Table 3.
Using genetic algorithm (its parameters setting are binary code mode; mutation probability is 0.1; crossover probability is 0.6; population size is 80; evolutionary generations is 1000), we can get Table 4.
By Tables 24, we can see that every portfolio contains three stocks (one core investment object and two secondary investment objects). Because the weights of core investment objects are 2 and weights of secondary investment objects are 1, the amount of investment of core investment object is 50,000 dollars, and amount of investment of each two secondary investment object is 25,000 dollar.
Obviously, in , a and b are the parameters describing the uncertainty consciousness. When the weights of core investment objects are 2 and weights of secondary investment objects are 1, Table 2 shows the every solutions in different decision-making consciousness. Using expectation value model, the portfolio is . It means that stock (7) is core investment object, and stocks (9) and (16) are secondary investment objects. In the DDC-SRP model, when is smaller, the investor will pay more attention to the expectation of income in each stock. So the result is same to the expectation value model. When is bigger, the investor will pay more attention to the income risk in each stock that can be represented by variance. When are bigger, the investors will pay more attention to the risk of investment portfolio that can be represented by discrete degree.
With different mutation probability and crossover probability, the solution is different. When the mutation probability is smaller and the crossover probability is bigger, the kinds of solutions are smaller.
As above analysis, if we let , we can get the following coverings.These coverings does not contain and . Through calculating the results for the DDC-SRP model (13) with discrete condition is same as the DDC-SRP model with discrete degree .
When we select the DDC-SRP model with discrete degree , using genetic algorithm, we can get the Tables 57.
In Tables 57, we can see that using the expectation value model, the portfolio is . In the DDC-SRP model, with different parameters a and b, the optimal solutions are or which contain different decision-making consciousness. In the DDC-SRP model, when is smaller, the investor will pay more attention to the expectation of income in each stock. So the result is same to the expectation value model. Other situations show the investors will pay more attention to the income risk and the risk of investment portfolio, which is the same as Table 2. With different mutation probability and crossover probability, the solution is different. When the mutation probability is smaller and the crossover probability is bigger, the kinds of solutions are smaller.
When we select the DDC-SRP model with discrete degree , the feasible region is only one covering . Therefore, the optimal solutions both in the expectation value model and the DDC-SRP model is with different mutation probability and crossover probability. When we select the DDC-SRP model with discrete degree , the feasible region is with different mutation probability and crossover probability.
Using the DDC-SRP model, investors can comprehensively consider expectation of income, income risk, and risk of investment portfolio. The above give optimal solutions of the DDC-SRP model with synthesizing effect function and discrete degree , which is the division way of feasible region. When we use different synthesizing effect functions and different division way of feasible region, we can get more results.

6. Conclusion

In this article, a class of the stochastic programming model with rough feasible region is proposed to study the investment portfolio problem. The DDC-SRP model is developed that contains the expectation value model and improves the stochastic programming method. We analyze the convexity of the proposed model. With real-coded and random rough simulation genetic algorithm, we verify the variable of the solutions. Then the advantages and disadvantages of different solutions in different decision-making consciousnesses are shown. This method is more concise and effective than traditional programming solution models in investment portfolio problem [35].

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported by the Youth Top Talent Project of Research Project of Humanities and Social Sciences in Colleges and Universities of Hebei Province (BJ2021088), the Soft Science Research Project of Innovation Capacity Promotion Program of Hebei Province (21557629D), and the Research Project on the Development of Social Sciences in Hebei Province in 2021 (20210201325).