Abstract
Digital finance provides an ideal entrepreneurial environment for returning migrant workers (RMWs). From the perspective of entrepreneurs, many scholars have quantified the factors affecting entrepreneurship, as well as the entrepreneurial environment, theorized the importance, motives, and internal/external impactors of RMW entrepreneurship, and put forward quite a lot of countermeasures. This paper innovatively evaluates how digital finance influences the efficacy of RMW entrepreneurship. Firstly, the authors established an influencing factor analysis model and an RMW entrepreneurship model and explained principles for the structural equation modeling of the influence of digital finance on RMW entrepreneurship efficacy. Next, the traditional partial least squares (PLS) regression was optimized, the optimal initial iteration values (IIVs) were obtained, and the algorithm convergence was achieved. Finally, a multilayer structural equation model (SEM) was constructed to evaluate the influence of digital finance on RMW entrepreneurship efficacy. The proposed algorithm and model were proved valid and feasible through experiments.
1. Introduction
China has entered the new era of mass entrepreneurship and innovation. Effective entrepreneurial activities are the foundation of national development and an important aspect of building a well-off society. Returning migrant workers (RMWs), i.e., the migrant workers returning from cities to their hometowns form a large group of potential entrepreneurs and have a rather large need to start their own businesses.
RMW entrepreneurship provides an effective solution to the economic problems of RMWs and creates more job opportunities for rural surplus labor [1–4]. By promoting rural economy, RMW entrepreneurship contributes to the construction of new socialist countryside, which is advocated by the Chinese government [5–7].
Free from the shackles of physical financial outlets, digital finance, with a low marginal cost, offers an ideal entrepreneurial environment for RMWs. To fuel the success and enthusiasm of RMW entrepreneurship, it is particularly important to sort out the effects of digital finance on RMW entrepreneurship.
Since the birth of the strategy of rural vitalization, China and the Chinese society have attached great importance to RMW entrepreneurship [8–10]. Ferreira et al. [11] carried a field survey on the relationship between rural revitalization and the local employment of rural surplus labor, constructed a binary logistic regression model, and carried out an SPSS-based analysis on the efficacy of RMW entrepreneurship under different changing factors, including age, gender, family background, local policy support, and regional economic level.
It is of certain practical significance to study the obstacles to RMW entrepreneurship [12–14]. Santoro et al. [15] divided these obstacles into network obstacles and policy obstacles, tested these factors through Kaiser–Meyer–Olkin (KMO) test and Bartlett’s test of sphericity, and extracted 21 principal components. Ferreras-Méndez et al. [16] summarized the obstacles to RMW entrepreneurship through interpretive structural modeling (ISM) and decision-making trail and evaluation laboratory (DEMATEL) and divided five classes of core obstacles into different layers. Castellano et al. [17] designed a backpropagation neural network (BPNN) to predict the number of RMW entrepreneurs and forecasted that number in 2015–2020 with self-designed test set and training set.
The advancement of social informatization and proliferation of Internet finance have spurred the development of rural economy. But there is not yet a mature information platform or network system. In the field of rural revitalization, many researchers are interested in how modern digital finance information platform supports the flexible, adaptive, and continuous development of largescale RMW entrepreneurship [18–20]. Khalid [21] deeply explored the influence of rural digital finance informatization on RMW entrepreneurship and described the ideas and countermeasures for building a reasonable service system of rural digital finance informatization.
Combined with national conditions, most studies focus on analyzing the factors affecting entrepreneurship from the perspective of entrepreneurs and qualifying the entrepreneurial environment. There are many theoretical analyses on the importance, motivation, and internal/external impactors of RMW entrepreneurship, leading to numerous countermeasures and suggestions [22–26].
Digital finance is a simple, low-cost, highly collaborative technology that facilitates interaction and communication. Considering the boom of digital finance in China, this paper introduces the technology to evaluate the efficacy of RMW entrepreneurship. The main contents are as follows: (1) setting up an influencing factor analysis model and an RMW entrepreneurship model and explaining the principles for the structural equation modeling of the influence of digital finance on RMW entrepreneurship efficacy; (2) optimizing the traditional partial least squares (PLS) regression to overcome the problem with model parameter estimation, solving the optimal initial iteration values (IIVs) under the constraints of reasonable least squares sense and unit vector length, and presenting the idea of algorithm convergence; (3) building a multilayer structural equation model (SEM) to evaluate the influence of digital finance on RMW entrepreneurship efficacy. The proposed algorithm and model were proved valid and feasible through experiments.
2. Principles of Structural Equation Modeling
By the causes of behaviors, this paper divides the factors affecting RMW entrepreneurship in the context of digital finance into external factors and internal factors. According to the influencing factor analysis model in Figure 1, the internal dimension includes education level, personal quality, interpersonal network, and other personal factors of RMWs, while the external dimension covers three aspects: policy supports of digital finance, macro environment of market economy, and social service environment.

Next, an RMW entrepreneurship model was constructed (Figure 2). The model separates the RMW entrepreneurship process into multiple phases: generating entrepreneurship motives, identifying entrepreneurial opportunities, making entrepreneurial decisions, acquiring entrepreneurial resources, and delivering entrepreneurial results. However, some digital finance factors affecting the implementation of RMW entrepreneurship cannot be measured directly, but be characterized indirectly with measurable indices. These factors include regional entrepreneurial atmosphere, digital finance network embedding, and cognitive embedding. Specifically, the creation of entrepreneurial atmosphere, digital finance network embedding, and digital finance model can be characterized by the degree of digital finance operation that of digital marketing and that of digital finance services; the cognitive embedding can be characterized by the willingness, motivation, and decision of entrepreneurship. If the variation in RMW entrepreneurship efficacy only brings changes to the scale of entrepreneurial assets, which is directly measurable, then the measured scale of entrepreneurial assets can be used as a yardstick of the variation in RMW entrepreneurship efficacy. As a result, the SEM with latent variables becomes increasingly popular in the fields of corporate performance and entrepreneurial behavior.

This paper intends to evaluate the influence of digital finance on the efficacy of RMW entrepreneurship. According to the incubation model of RMW entrepreneurship (Figure 3), the RMW entrepreneurship covers multiple stages, such as generation of entrepreneurial motives, identification of entrepreneurial opportunities, making of entrepreneurial decisions, acquisition of entrepreneurial resources, and delivery of initial entrepreneurial results. The model is a multilayer statistical analysis tool containing multiple latent and observable variables. It is extremely difficult to estimate the model parameters. Relying on the overall variation, PLS regression provides a suitable tool to handle complex SEMs, with no strict requirements on the distribution of observations or the number of samples.

To disclose the influence of digital finance on RMW entrepreneurship efficacy, this paper constructs a causality-based SEM to estimate the degree of the said influence. Let and be the n endogenous latent variables and l exogenous latent variables of the SEM, respectively; ZHn×n be the factor loading matrix of β; Θn×l be the factor loading matrix of δ; and be the residual. Then, the relationship between the latent variables of the model can be described by
The measured variables of the SEM can indirectly characterize the latent variables. Suppose the SEM contains N measured variables, which measure M samples for M times. Then, the measured data can be compiled into an M×N matrix. Let aij and bij be the measured variables related to δτ and βi, respectively; Φτj be the aggregation coefficient from measured variables to exogenous latent variables; and σδτ be the corresponding error term. Then, the relationship between the exogenous latent variables and measured variables can be described by
Let ζij be the aggregation coefficient from measured variables to endogenous latent variables and σβi be the corresponding error term. Then, the relationship between the endogenous latent variables and measured variables can be described by
Suppose , , , and . Then, formula (2) can be rewritten in a general form:
Formula (3) can also be rewritten in a general form:
By reverse thinking, the variation in a measured variable of the SEM is driven by the changes of the corresponding latent variable. Let θij and σδτj be the loading coefficient and error term from latent variables to endogenous measured variables, respectively. Then, we have
Let μij and σβij be the loading coefficient and error term from latent variables to exogenous measured variables, respectively. Then, we have
Suppose . Then, formulas (6) and (7) can be rewritten in a general form:
Combining formulas (1), (4), and (5), a formative SEM with forward measurements can be established:
Combining formulas (1), (8), and (9), a reflective SEM with reverse measurements can be obtained as
Traditionally, SEM parameters are solved through PLS regression. The specific iterative process of this approach can be expressed as
However, the PLS regression algorithm often fails to converge to a suitable solution, owing to the randomness of IIVs. The IIVs must be selected suitably to ensure algorithm convergence.
3. IIV Optimization and Algorithm Convergence
The traditional PLS regression proceeds with the IIVs randomly set by formulas (2) and (3). Here, the IIVs are optimized by formulas (6) and (7) or formulas (8) and (9) for the indeterminate SEM, under the constraints of reasonable least squares sense and unit vector length. For the measured variable , any component aτi is involved in M measurements aτj=(aij1, … , aijM). By right multiplication of , formula (8) can be converted into
Suppose latent variable δτ is a unit vector satisfying . Then, we have
Formula (14) can be expanded into
In the left matrix, every element is a product of vectors; in the right matrix, every element is a product of numbers. The corresponding diagonal elements of the two matrices are equal. Hence, we have
By formula (16), the loading coefficient θτ can be estimated as . The estimation of θτ is biased, because the variance of the error term is ignored. Thus, the parameter estimation needs to be optimized to eliminate the bias. Suppose δτ and σaτ are independent of each other, and CM(σaτ) = 0. Then, there exists an equation CM(δτσaτ) = CM(δτ)CM(σaτ) = 0. Let be the covariance matrix ∑σaτ of error vector σaτ. Then, formula (8) can be converted into
Since the diagonal elements of the symmetric matrix ∑σaτ are the variances in different dimensions, we have
Comparing with the diagonal elements in matrix 14, we have
To solve formula (19), the first step is to solve . Suppose , and exists. Let be a diagonal element of . Drawing on factor analysis, can be estimated by
Combining formulas (19) and (20),
Similarly, can be solved. Finally, it is necessary to estimate latent variable δτ. Suppose . Then, the W-th component of δτ can be expressed as
By left multiplication of , formula (22) can be converted intowhere characterizes the influence of σaτ on aτ. Then, formula (23) can be improved by the least squares principle:
Then, δτW can be estimated by
The estimated value of Φτj can be obtained by substituting the estimated δτ into formula (2). By the same method, the other parameters can be computed, namely, and . In other words, the optimal IIVs for the PLS regression under the constraint of unit vector length satisfy
The following is a discussion of the convergence of the PLS regression with optimal IIVs. Formula (1) can be converted into
In the evaluation model for the influence of digital finance on RMW entrepreneurship efficacy, matrices ZH and ZHτ = I−ZH are both triangular matrices, where the sum of diagonal elements equals 0. Besides, |ZHτ| = 1, and exists. Then, we havewhere ; . If the PLS solutions to latent variables δ and β are obtained under the constraint, it is possible to find the PLS solutions to the factor loading matrices ZH and Θ. If the PLS solution of formula (28) is directly iterated without being unitized, then . In this case, it is necessary to verify the convergence of the PLS regression. Expanding the i-th component of the PLS solution,
Let U(i) be the number of δj related to βi. Then, the least squares sense of formula (29) can be described by
Combining formula (30) with formulas (2) and (3),
Formula (31) is equivalent to the mutual projections of two unconstrained subspaces. Thus, the solution to the formula approximates the minimum of zero. Therefore, the improved PLS regression can converge iteratively through the control of any parameter error.
4. SEM for Influence of Digital Finance on RMW Entrepreneurship Efficacy
In traditional SEM, latent variables are directly connected with measured variables via a single layer of paths. Considering the complexity of RMW entrepreneurship process, this paper constructs a multilayer statistical model containing multiple latent and observable variables. Figure 4 presents the evaluation model for the influence of digital finance on RMW entrepreneurship efficacy.

Our evaluation model contains 29 observable variables. The eight latent variables, namely, digital operation δ1, digital marketing δ2, digital finance services δ3, entrepreneurial motives β1, entrepreneurial willingness β2, entrepreneurial foundation β3, entrepreneurial decisions β4, and entrepreneurial performance β5, correspond to 6, 3, 2, 0, 5, 6, 4, and 3 measured variables, respectively.
Let aτj be the measured variable corresponding to an exogenous latent variable of the model. Then, the measurement equation between exogenous latent variables and measured variables can be established as
Let θτj be the loading coefficient between the τ-th exogenous latent variable and the j-th measured variable and σaτj be the error corresponding to the j-th measured variable of the τ-th exogenous latent variable. Then, formula (23) can be expanded into
Let θij be the measured variable corresponding to an endogenous latent variable of the model. Then, the measurement equation between endogenous latent variables and measured variables can be established as
Let μij be the loading coefficient between the i-th endogenous latent variable and the j-th measured variable and σbτj be the error corresponding to the j-th measured variable of the i-th endogenous latent variable. Then, formula (34) can be expanded into
Figure 5 shows the paths of the evaluation model. Let ω12 and ω13 be the paths from δ2 and δ3 to β1, respectively, and . Then, a structural equation can be established for a high-level variable β1 not connected to any measured variable:

Based on the least squares sense, the optimal IIVs can be solved for θτj, μij, βi, δi, and ητj. Substituting the IIVs of βi and δi into the basic SEM, it is possible to solve ω, γij, and μij. Further, the βi estimates could be derived for low-level and high-level endogenous latent variables. Let ζij and be aggregation coefficients; K(i) be the number of measured variables corresponding to the i-th low-level endogenous latent variable; and V(i) be the number of exogenous latent variables corresponding to the i-th high-level endogenous latent variable. Then, low-level endogenous latent variables could be linked up with measured variables based on the βi estimates. Then, the structural equation based on the least squares sense can be given by
Let bij be the measured variable corresponding to an endogenous variable of the model. Each endogenous latent variable is linked with each measured variable via loading coefficient and error. Then, the relationship between the two values can be measured by
Then, the aggregation coefficients can be solved by formulas (37) and (38).
5. Experiments and Results Analysis
The RMW entrepreneurship samples under the effect of digital finance were collected through a questionnaire survey. After normalization, the collected data were separately processed by the traditional PLS regression and our improved PLS regression, aiming to disclose the relationship between the latent variables in our evaluation model. Tables 1 and 2 present the estimates of the latent variables and their coefficient relationship. Tables 3 and 4 provide the calculation results of endogenous and exogenous variables under optimal IIVs and random IIVs, respectively. The insignificant difference between the results of the two algorithms suggests the effectiveness of our algorithm. The improved PLS regression, which introduces the optimal IIVs to the traditional PLS regression, could obtain the path coefficients between all variables and calculate the influence coefficient of all measured variables for RMW entrepreneurship under the effect of digital finance. The digital finance impacts on RMW entrepreneurship efficacy could be ranked clearly by our method. The results of our method are more accurate than the traditional PLS regression.
Table 5 shows the cumulative variances of inputs and outputs in RMW entrepreneurship. It can be inferred that five variances can explain 82.53% of all information. Comparing the paths between latent variables, five core variables, including fixed asset investment, equipment rent, research fund, human resources investment, and digital finance platform operation cost, greatly affect the state of latent variables. These core variables promote every phase of RMW entrepreneurship, such as identifying entrepreneurial motives and laying entrepreneurial foundation.
Table 6 shows the cumulative variances of inputs and outputs in policy supports of digital finance. It can be inferred that five variances can explain 84.94% of all information. Comparing the paths between latent variables, five core variables, namely, fiscal support, implementation of policy supports, protection of intellectual property rights, openness of policy supports, and coverage of policy supports, greatly affect the state of latent variables. These core variables promote the entrepreneurial environment.
Table 7 shows the cumulative variances of inputs and outputs in third-party digital finance service providers. It can be inferred that six variances can explain 87.52% of all information. Comparing the paths between latent variables, six core variables, namely, precision financial services, independent collection and docking of financial data, online-offline integrated service model, construction of mobile payment scenes, construction of rural big data platform, and risk control of financial digitization, greatly affect the state of latent variables. These core variables promote the entrepreneurial environment, entrepreneurial motives, and entrepreneurial performance.
Tables 8 and 9 show the cumulative variances of RMW entrepreneurial community and digital finance environment, respectively. It can be inferred that 95.07% of the information of RMW entrepreneurial community could be explained by three variables, and 85.62% of the information of digital finance environment could be explained by two variables. Comparing the paths between latent variables, five core variables, namely, incubation model of entrepreneurial community, degree of coordination between upstream and downstream enterprises, completeness of entrepreneurial services, banking service, and insurance service, have large influences on the state of other latent variables.
Tables 10 and 11 show the cumulative variances of macro environment of market economy and social service environment, respectively. It can be inferred that 95.27% of the key information of macro environment of market economy could be explained by two variables. Four variables exert large influences of the state of subsequent latent variables, namely, industrial structure, per-capital consumption, contribution to regional economy, and contribution to regional employment. The changes of the four variables significantly influence the entrepreneurial environment, entrepreneurial motives, and entrepreneurial performance.
6. Conclusions
This paper mainly evaluates the influence of digital finance on the efficacy of RMW entrepreneurship. The first step is to build an influencing factor analysis model and an RMW entrepreneurship model and detail the principles for the structural equation modeling of the said influence. Next, the traditional PLS regression was optimized, the best IIVs were obtained, and the algorithm convergence was guaranteed. After that, a multilayer SEM was established to evaluate the said influence. Then, our improved algorithm was found to be more accurate than the traditional PLS regression through comparative experiments, which estimate the latent variables and their coefficient relationships and derive the endogenous and exogenous latent variables under optimal and random IIVs. In addition, the cumulative variances of multiple factors (i.e., the inputs and outputs in RMW entrepreneurship, the inputs and outputs in policy supports of digital finance, the inputs and outputs in third-party digital finance service providers, RMW entrepreneurial community, digital finance environment, macro environment of market economy, and social service environment) were summarized to identify the core measured variables that greatly affect the latent variables.
Data Availability
The data used to support the findings of this study are available from the corresponding author upon request.
Conflicts of Interest
The authors declare that they have no conflicts of interest.