Abstract
Exploring the impact of the urban built environment on online car-hailing, an increasingly popular mode of urban transportation, is crucial for developing corresponding transportation strategies and addressing sustainable urban planning. This paper captures that the form and orientation of the neighborhood network have a certain degree of influence on travel behavior. Therefore, the road network orientation entropy indicator is introduced based on the originally built environment elements, and the orientation-order (OO) indicator calculation model is constructed to normalize it. This study further tests and verifies its impact on the travel behavior of online car-hailing, thereby improving the factor characterization of neighborhood design latitude in the urban built environment. From the perspective of spatial correlation and spatial heterogeneity, the optimal models are selected by model comparative analysis, namely, the spatial Durbin model (SDM) and the mixed geographically weighted regression model (MGWR). Based on the optimal model estimation results, the influence mechanism of the urban built environment on the spatial-temporal distribution of online car-hailing travel in three cases (considering road network density (RD), considering RD and OO, considering OO) is compared and analyzed. The results show that the model fitting considering the OO is the best in the above three cases, and the OO has a significant impact on DiDi travel compared with the RD in this study, which verifies the rationality and necessity of selecting the OO in this study. In addition, catering service, corporate business, and OO have significant positive spillover effects, while the spillover effects of sports and leisure service and land-use mix are negative. The indicators of the bus station, residential district, catering service, shopping service, corporate business, land-use mix, life service, and OO have significant spatial heterogeneity.
1. Introduction
In recent years, with the continuous expansion of the urban scale, it is vital to excavate the influence of the urban built environment on traffic travel. Built environment refers to the human environment constructed and configured to provide the needs of human activities, including the spatial environment formed by the interaction of multiple factors such as land-use form, transportation infrastructure, and urban design [1]. At present, “five Ds” indicators are mostly used to characterize the built environment, including density, diversity, design, destination accessibility, and distance to transit [2]. Based on the development background of big data technology, online car-hailing emerges as a new business model with its convenient and flexible booking service. The development of online car-hailing can integrate idle vehicle resources and improve the utilization rate of vehicles, which eases the difficulty of people taking taxis to a certain extent and conforms to the development concept of sharing economy. However, the rapid development of the emerging transportation mode will also bring corresponding difficult challenges to urban traffic planning and management. Such challenges mainly include how to guide and manage the development of DiDi travel, how to integrate it into a variety of transportation systems (e.g., cars, buses, taxis, subways, and nonmotorized traffic), and how to reasonably integrate the built environment policies (e.g., regional development planning, comprehensive land development, and street network construction) with transportation policies (e.g., relevant policies such as bus, taxi, and online car-hailing operation management) [3], which are also issues that researchers need to focus on at present.
Related studies have shown that the built environment is significantly correlated with the travel behavior of online car-hailing [4–9]. However, different researchers often select different indicators to measure the built environment. For example, in previous studies, the neighborhood design dimension in “five Ds” is often characterized by factors such as road network density or intersection density. When the above density is small and the impact on travel behavior is not significant, researchers believe that the impact of neighborhood design on travel behavior is not significant [3]. However, the form and orientation of the neighborhood network are crucial to defining the spatial logic and order of the city, and this feature of neighborhood design is not taken into account in the above analysis. Therefore, it is necessary to find another indicator to reflect the orientation and order of the neighborhood to further improve the representation of the dimension of the neighborhood design. In addition, the influence of environmental variables may vary with the urban spatial forms, and neglecting the spatial influence may lead to inaccurate results. Thus, when modeling and analyzing the impact of the built environment on travel behavior, it is necessary to further consider the influence of spatial dependence (spatial autocorrelation) and spatial structure inhomogeneity (spatial heterogeneity) so as to more accurately grasp the impact mechanism of urban form factors on online car-hailing travel [10–12].
Therefore, this study focuses on the selection of urban built environment measurement indicators and the selection and analysis of modeling methods, specifically including three main tasks. Firstly, based on the principle of the street network model and the quantification and visualization of street network orientation entropy by Mohajeri and Gudmundsson [13], we use OpenStreetMap software to construct the calculation model of orientation-order indicator to improve the selection of independent variables of the urban built environment, that is, taking orientation entropy as one of the characterization factors of independent variables of the built environment. Secondly, this study constructs models from two different perspectives of spatial correlation and spatial heterogeneity using the DiDi travel data of Chengdu released by DiDi and POI data collected by Gaode Map, selects the models with a higher fitting degree, and then determines the factors that can better reflect the characteristics of neighborhood design. Finally, the modeling results are analyzed to explore the spatial and temporal distribution of DiDi travel, which provides a theoretical basis for reducing the empty driving rate of DiDi trips, optimizing the operation of the urban road network, and forecasting the spatial demand of DiDi travel.
The remainder of this paper is organized as follows. Section 2 reviews relevant literature studies. Section 3 describes the data and variables used in this study. The construction of spatial correlation and heterogeneity models is described in detail in Section 4. In Section 5, the optimal models of spatial correlation and spatial heterogeneity are selected, the estimation results of the optimal models are presented in detail, and the influences of built environment indicators (especially the orientation order) on DiDi travel are discussed. The final section summarizes the main findings and proposes future research directions.
2. Literature Review
2.1. Studies on the Relationship between the Built Environment and Travel Behavior of Online Car-Hailing
For online car-hailing, previous studies have focused on the characteristics and preferences of passengers [14, 15], the matching degree of online car-hailing services with passengers [16], scheduling strategies [17], and so on. In recent years, scholars have begun to follow the influence of the built environment on the travel behavior of online car-hailing, which is mainly reflected in two study areas. Most of the studies were based on the data of online car-hailing in the United States. For example, Alemi et al. [4] used the California Millennials Dataset to prove that land-use diversity and destination accessibility have significant impacts on the demand for Uber. Yu and Peng [5, 6] divided RideAustin data into two types: weekday travel data and weekend travel data. It is found that population density, sidewalk density, land-use mix, and traffic accessibility are positively correlated with ride demand. Sabouri et al. [7] illustrated that the demand for Uber in New York was positively correlated with the land-use mix, population density, and transit travel density. Another few studies were based on the data of DiDi travel in the local area of the Second Ring Road of Chengdu, China. For instance, Li et al. [3] demonstrated that the distance to CBD and the mixing degree of land use were positively correlated with DiDi ride-hailing and argued that recreation and entertainment POI and residential district POI were the main influencing factors for nighttime ride-hailing. With the promotion of sharing travel, the carpooling services approach to traffic has become popular, and ridesplitting orders can be further extracted from DiDi orders. Related studies have confirmed that the distance to the CBD and the land-use diversity affect the ridesplitting rate [8, 9].
Given the studies on the impact of the built environment on online car-hailing travel behavior, we find that, in previous studies, the measurement factors (e.g., road network density or intersection density) of neighborhood design dimension have three types of effects on online car-hailing travel. First, road network density or intersection density has a significant positive impact on travel behavior. For example, Yu and Peng [5, 6] found that higher road density could attract more online car-hailing travel, mainly “because of higher exposure to available ridesourcing vehicles on the road and less response.” Second, contrary to their findings, Sabouri et al. [7] indicated that the intersection density was negatively correlated with Uber travel demand, which might be attributed to the relative advantages of other transportation modes, such as private cars, buses, pedestrians, or bicycles in areas with denser street networks. Third, Li et al. [3] found that the road network density had no appreciable impact on the boarding ridership of online car-hailing. The possible reason is that the selection of appropriate variables should be based on local conditions rather than simply transplanting those.
In summary, there are two main problems in current studies on online car-hailing. First of all, the influence of all dimensions (namely, “five Ds”) of the built environment is not fully controlled when selecting variables in the above study, which may lead to deviation in the degree of influence of the built environment on online car-hailing travel. Secondly, when the influence of road network density on online car-hailing travel is not obvious, it is necessary to further find and verify new indicators to characterize neighborhood design so as to more accurately determine whether it has an impact on travel behavior. Therefore, to accurately measure the impact of the built environment on online car-hailing travel behavior, it is necessary to further improve the selection of built environment measurement factors.
2.2. Studies on Orientation Entropy of Street Network
Spatial networks, such as streets, paths, and traffic routes, organize human travel in complex urban systems, which shape travel behavior, location decisions, and urban fabric. In particular, the form and orientation of street networks help to define the spatial logic and order of the city [18, 19]. Previous studies usually used geographic information systems (GIS) to directly measure geometric grid, neighborhood, and street attributes, such as street connectivity, density, and intersection types [20]. Accordingly, researchers have recently focused more on the patterns, performance, complexity, and configuration of street networks. A stream of research has explored the nature of entropy and order in urban street networks, attempting to quantify the spatial order and disorder patterns of the urban circulatory system. It showed that entropy measures can well reveal the order and disorder of urban streets, in which the order refers to a higher degree of the grid and has better connectivity, while the disorder is vice versa [13, 21]. Furthermore, scholars often use entropy to study the order of street networks, quantify street orientation, and evaluate the degree of order by visualizing how the street network follows the geometric logic of a single grid. Barthelemy et al. [22] revealed that entropy has a profound theoretical connection with the complexity of urban blocks in the urban blocks spatial network, which will have a certain impact on the travel behavior of urban traffic. Further, the spatial order of the urban order system can be quantified by exploring the essence of urban street network entropy [23]. Based on the study of circuity, order, and entropy, Boeing [24] investigated cities around the world using OpenStreetMap data, explored the patterns and relationships of the street networks, and obtained the network structure of different cities along the process of human interaction and transportation. In addition, it is suggested that future work can consider the importance of taking multiple indicators in concert to study their impact on travel behavior.
Through the above studies, it can be seen that street network orientation entropy has an impact on travel behavior. Therefore, the spatial logic and geometric order generated by the orientation of the street network should be considered when analyzing the impact of the built environment on online car-hailing behavior; that is, the orientation entropy is taken as one of the factors to measure the neighborhood design in “five Ds,” and then the degree of its influence on travel behavior is measured.
2.3. Studies on Spatial Correlation and Heterogeneity
To explore the relationship between the built environment and travel behavior, the traditional methods are mainly ordinary least squares (OLS) [25] and the discrete choice model in the four-stage method (e.g., logit models [4] and probit models [26]), but the above models can only be used for global estimation without considering the spatial properties [27]. However, travel behavior is highly sensitive to the urban built environment and will change with spatial change. Ignoring spatial instability may lead to inconsistent parameters or inaccurate test results. To further study the spatial correlation and spatial heterogeneity of the built environment on travel behavior, scholars have used the spatial econometric model to explore the following. On the one hand, considering the spatial dependence of travel demand, scholars have used the spatial correlation model [28, 29] (e.g., spatial autoregressive (SAR) model; spatial Durbin model (SDM); spatial error model (SEM)) to capture the direct and spatial spillover effects of the built environment on travel demand. Chen et al. compared and analyzed the impact of the urban built environment on taxi travel demand based on OLS, SAR, SDM, and SEM models and obtained the optimal fitting result of the SDM model; that is, this method can not only capture the spatial interaction between dependent variables and independent variables but also consider the spatial correlation of explanatory variables, namely, spatial spillover effect [29]. On the other hand, considering the impact of spatial heterogeneity on travel, relevant researchers have adopted the spatial heterogeneity correlation models (e.g., geographically weighted regression (GWR); multiscale geographically weighted regression (MGWR)), which effectively revealed the spatial variation of the influencing coefficients of various influencing factors in the study region [30, 31]. For instance, Li et al. [3] used the GWR model to explore different impacts on spatial dimensions, and Yu and Peng [5] further constructed a geographically weighted regression Poisson distribution model (GWPR) to analyze the spatial heterogeneity in different periods (morning peak, subpeak, and evening peak).
Through the above summaries, it can be seen that current studies only consider one of the above two aspects, while this study uses the same data to contain both the spatial dependence and spatial heterogeneity of travel behavior and more accurately determine the impact degree of the built environment on online car-hailing travel.
3. Data Preparation
3.1. Study Area and Data Source
This study selected the DiDi trajectory data and order data of the local area of the Second Ring Road in Chengdu, China (longitude: 30.65–30.72; latitude: 104.04–104.12), from November 1 to November 30, 2016. The trajectory dataset comes from the DiDi Gaia Data Opening Plan, including driver ID, order ID, timestamps, longitude, and latitude; the order data contains the order ID, start and end timestamps, and longitude and latitude of pick-up and drop-off locations. The study area of this paper is consistent with the data coverage. To study the impact of the built environment on online car-hailing travel on a fine scale, the study area was categorized into 289 grids with a 0.5 km 0.5 km grid as the analysis unit, and then the GPS data of pick-up points were subject to coordinate transformation and map matching. The grid division of the study area is shown in Figure 1(a).

(a)

(b)
The geographical location information of the study area is as follows. The study area mainly includes Jinniu District, Qingyang District, Jinjiang District, and Chenghua District, as indicated in Figure 1(b). Jinniu District, located in the northwest of Chengdu, is the central district of Chengdu and has the north railway station of Chengdu, which is the largest comprehensive transportation hub in the southwest of China. Qingyang District is located in the west of Chengdu, with famous scenic spots such as Qingyang Palace, Kuanzhai Alleys, and business district. The urban land is mainly used for commercial and residential purposes, with a relatively dense road network and developed public transportation facilities and the Chengdu west railway station located in this area. Jinjiang District, located in the southwest of Chengdu, is a historic and prosperous business district with a high degree of land-use mix because it has the East Street known as “the first street of Shu” and the Chunxi Road known as “the Centennial Golden Street.” Chenghua district is located in the eastern part of Chengdu’s central city, most of which are outside the Second Ring Road. Compared with the other three districts, this district has lower road network density and less number of public transport facilities.
3.2. Dependent Variable Analysis
To explore the built environment factors affecting online car-hailing travel, this study selected the traffic volume of each grid as the dependent variable. Firstly, the travel volume of DiDi orders in one month was statistically analyzed, which showed a changing trend with a cycle of seven days, as shown in Figure 2(a). Further, the travel orders of one month were analyzed in different periods, and the average travel volume of each period in a week was calculated, as shown in Figure 2(b). It can be seen that the distribution of hourly travel volume on weekdays (Monday to Friday) is relatively close, including four peak travel periods, namely, 8:00 to 9:00, 13:00 to 14:00, 16:00 to 17:00, and 20:00 to 21:00. The travel volume from 0:00 to 5:00 on rest days (Saturday and Sunday) was more than that on weekdays, and the travel peak was delayed until 13:00, which may be due to more free time on weekends and people’s travel time being relatively free.

(a)

(b)

(c)

(d)
To dissect the travel behavior of DiDi from the fine-grained level, this paper mainly dissects the travel behavior of DiDi on weekdays, so the average daily travel volume data on weekdays were selected to analyze the temporal and spatial distribution of passenger flow, as illustrated in Figures 2(c) and 2(d). The hot spots of passenger flow travel were mainly concentrated in the vicinity of Chunxi Road and Wanda Plaza in Jinjiang District and Qingyang District within the study area, and the passenger flow was small in the evening, increased sharply in the morning, and then reached the peak at noon. It could be inferred that the impact of the urban built environment on online car-hailing travel may change with time and space. A large travel volume during the peak period has the greatest impact on the overall travel of the road network compared to the off-peak period, and the current policy measures are mostly carried out based on the research results during the peak period. Therefore, to further study the above space-time evolution rules, the monthly average DiDi travel volume during the peak period (13:00 to 14:00) was selected in this paper.
3.3. Independent Variable Analysis
Based on the analysis of travel characteristics of dependent variables, this study found that the peak time of DiDi’s travel volume is different from that of other travel modes (e.g., cars and taxis) from 13:00 to 14:00. At this time, the overall travel condition of road traffic was relatively smooth compared with the peak travel in the morning and evening of the city, which may lead to the phenomenon that road network density or intersection density has no significant impact on online car-hailing travel. At this time, it is necessary to improve the attention of neighborhood orientation and order and test its impact on the travel behavior of online car-hailing. Therefore, based on only selecting road network density or intersection density to characterize neighborhood design, this study drew on the principle of orientation entropy, normalized it to obtain orientation-order indicator, and tested and verified its influence on travel behavior so as to measure the characterization factors of neighborhood design more accurately. In this study, nineteen elements were selected to represent the “five Ds” indicators of the built environment and were used as independent variables in subsequent modeling research, as detailed in Table 1.
3.3.1. Density and Destination Accessibility Indicator Selection
Density mainly includes population density and thirteen POIs data. POIs data incorporate residential district, catering service, shopping service corporate business, sports and leisure service, science zone, public facility, finance and insurance services, education and culture, life service, medical and health, government and administration, and accommodation service. Bus station POI and distance to CBD were used to represent the distance to transit and the destination accessibility, respectively.
3.3.2. Diversity Indicator Selection
The land-use mix was taken as the measurement of land-use diversity, and the calculation formula was obtained by applying the principle of the Herfindahl Hirschman Index, as shown in formula (1).where is the total POIs numbers in grid i, is the total POIs numbers of class j in grid i, and m is the category of POIs.
3.3.3. Neighborhood Design Indicator Selection
The selection of block design indicators in this section mainly includes the following two parts:(1)Orientation entropy calculation: this study relied on Shannon’s entropy principle [24], the street orientation was divided into 36 categories (boxes), the true north was set to 0°, and 36 intervals were created at an interval of 10°. If the street orientation falls in the interval, the orientation is adjudged to be in that interval, which could more intuitively determine the degree of blocks disordered; that is, the smaller the orientation entropy, the higher the degree of gridding and the better the connectivity and street orientation order, and vice versa. The calculation of orientation entropy was based on python’s OSMnx toolkit, from which the spatial data, including city boundaries and streets, can be downloaded from OSM and normalized into the dimensionless natural unit. The calculation formula of orientation entropy is shown in formula (2), and its schematic diagram is displayed in Figure 3. where represents the street orientation entropy, n is the total number of boxes, k refers to boxes k, and refers to the proportion of streets with orientation k.(2)Orientation-order indicator calculation: the orientation-order indicator was obtained by normalizing the orientation entropy, which means linearizing the location of the city from complete disorder (uniform distribution) to perfect order (grid distribution), making its value between 0 and 1 to achieve the effect of normalization. The calculation formula is as follows: where is the maximum entropy; that is, the street bearing is perfectly and uniformly distributed in 36 intervals of the street, and its value is 3.584; represents the ideal grid entropy, and the blocks are arranged in four equal proportions (e.g., east-south-west-north), with a value of 1.386.

If the orientation order , it indicates the uniform distribution of streets in all orientations; if , it means that the streets appear as a perfectly ordered grid. That is, the larger the orientation-order indicator, the higher the degree of grid and connectivity of the block and the better the direction order of the street, and vice versa. Relevant studies have shown that considering the four indicators, namely, orientation order, average circuity, average node degree, and median street segment length, can present the characteristics of the street network. In addition, the orientation-order indicator is significantly correlated with the other three spatial order indicators of the above four indicators (the orientation-order is moderately negatively connected with the average circuity and median street length and moderately positively correspond with the average node degree) [10, 23, 24]. Therefore, this study selected the orientation-order indicator as one of the characterization factors of neighborhood design.
Through the above analysis, the heatmaps of the respective variables selected in this paper are shown in Figure 4.

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)

(i)

(j)

(k)

(l)

(m)

(n)

(o)

(p)

(q)

(r)

(s)
4. Methodology
4.1. Spatial Correlation Model Construction
To explore the influence of spatial correlation, the spatial nonfixed effects caused by the changes of independent variables due to the different study regions were considered, and the spatial correlation models were established to clarify the spatial direct effect and spillover effect of the built environment on demand for DiDi travel.
4.1.1. Model Construction
This part aims to explore the impact of the urban built environment on the spatial correlation of online car-hailing. The general form of the spatial econometric model adopted is as follows [29, 32]:where represents the DiDi travel volume within traffic zone i; is the built environment variable in traffic zone i; is the spatial weight matrix; , , and are the spatial lag term of the explained variable, the explanatory variable, and the error property, respectively; n is the number of traffic zones; and are the spatial correlation coefficients of explained variables and explanatory variables, respectively; is the independent variable regression coefficient; is the autocorrelation coefficient of spatial error; is a random error term; is a random perturbation term.
Based on the general form of the spatial econometric model, the spatial econometric models selected in this study are shown in Table 2.
4.1.2. Direct Effect and Spillover Effect
The spatial econometric modeling analysis can evaluate the complex spatial dependence between traffic zones. The influence of explanatory variables on the DiDi travel volume in the traffic zone itself and adjacent traffic zones not only includes the direct impact (direct effect) of explanatory variables in certain traffic subareas on the DiDi travel volume in the traffic zone but also changes the travel volume in the adjacent traffic zone (spillover effect) to a certain extent [29]. SDM model introduces weight variables (including explanatory variables and explained variables of other traffic zones) on the basis of independent variables to realize the measure of the above direct effect and spillover effect [28]. Further, the SDM model is described in the form of a vector:where is the lag coefficient of spatial variable, is the identity matrix, , and . Matrix transformation is performed on (5), as shown in the following:
Through the diagonal analysis of (6), the average direct effect , the average total effect , and the average spillover effect are as follows:
4.2. Spatial Heterogeneity Model Construction
The classical GWR model is often used to explain spatial heterogeneity, but there are two defects in selecting this model in this study. On the one hand, it is assumed that all parameters in the model are kept at the same scale by default, but this study should select different scales of bandwidth for different built environment variables, which is contrary to the setting principle of the GWR model. On the other hand, the GWR model overemphasizes that the local influence of variables is greater than the whole, and the influence degree of all variables varies with space. The MGWR model refines the bandwidth by allowing variables to have different spatial smoothing terms, which solves the limitation that all variables in GWR have the same bandwidth. At the same time, the specific bandwidth of each variable can also be used as a measure for each spatial process scale, which makes the spatial process model generated by multiple bandwidths more practical [33].
Based on the MGWR model, this study examined the spatial scale differences of online car-hailing travel factors under the influence of orientation-order indicator and road network density, respectively. The formula is as follows:where represents the regression coefficient of the local variable, is the bandwidth used by the regression coefficient of the variable j, represents the spatial coordinates of the sample point, is the observed value of the variable j at the sample point i, and is the random disturbance term. This study selected the bandwidth according to the Akaike information criterion (AIC) [34], which is widely used to measure the quality of the statistical model.
5. Results and Discussion
To verify the accuracy of the two factors of orientation order and road network density to characterize the neighborhood design under the data statistical condition of this study, the modeling and comparative analysis were carried out in three cases: in Case 1 (RD), road network density was used to characterize neighborhood design as one of the independent variables; in Case 2 (RD and OO), both road network density and orientation-order indicator were put into the model as independent variables; in Case 3 (OO), the orientation-order indicator was selected.
Before modeling the spatial econometric model, the distribution characteristics of spatial data should be investigated first. Moran’s I was applied to test whether there is a correlation between the same independent variables in different traffic zones, and then Moran’s I values were obtained as shown in Table 3. From the analysis in Table 3, according to value, the global Moran index values of all variables are significant (except population density); that is, there is a significant spatial correlation between adjacent traffic zones.
5.1. Optimal Model Selection
Through models comparison, this section selects the best fitting model from spatial correlation and heterogeneity.
5.1.1. Spatial Correlation Optimal Model Selection
The influence analysis of spatial correlation needs to select an appropriate spatial econometric model. Previous studies mostly relied on the Lagrange multiplier test and other methods. In this study, regression analysis is carried out for each model, and the optimal model is selected according to the judgment criterion that the greater the judgment coefficient (adjusted R2) and log-likelihood are, the smaller the AIC is. The model results of the peak period (13:00 to 14:00) are shown in Table 4.
Table 4 shows that the fitting results of SDM models are significantly better than other models, and the descending order of fit goodness order of the SDM model in three cases is shown as Case 3, Case 2, and Case 1. Specifically speaking, in Case 1, the adjusted R2 of the SDM model is 0.725, which is 0.087 better than that in Case 1 (0.712), and log-likelihood and AIC (−3265.376 and 6867.753) increased and decreased to some extent, respectively, compared with Case 1 (−3312.376 and 6972.307). In Case 3, the adjusted R2 of the SDM model is 0.774, an improvement of 0.062 compared with Case 1, and log-likelihood and AIC (−3265.376 and 6867.753) also increased and decreased to a certain extent separately compared with Case 1. From the above analysis results, the orientation-order indicator does have a certain impact on the online car-hailing travel behavior, and the fitting degree of the single consideration of the orientation-order indicator (Case3) is higher than that of the model considering both the orientation-order indicator and road network density (Case2). It indicates that road network density is not the key influencing factor of online car-hailing travel behavior in this study, and more attention should be paid to the orientation-order indicator of neighborhood and road network connectivity. In other words, the higher the road grid is, the better its connectivity will be, thus attracting more DiDi travel volume.
5.1.2. Spatial Heterogeneity Optimal Model Selection
In this part, GWR and MGWR models are, respectively, used for comparative analysis of spatial heterogeneity, and the optimal bandwidth (judged according to AICc value, and adaptive bandwidth is selected) and spatial kernel function (Gaussian kernel function with universality is selected) are used for modeling analysis [35]. Table 5 implies the comparison results of the GWR and MGWR model of the boarding behavior of DiDi vehicles in the peak period.
From the analysis of Table 5, the fitting results of the MGWR models are better than the GWR model. And the descending order of fit goodness of the SDM model in three cases is shown as Case 3, Case 2, and Case 1. In detail, the adjusted R2 of the MGWR model in Case 2 is 0.797, which is 0.028 improved compared with Case 1, and the AICc and RSS (431.561 and 53.976) are both reduced to a certain extent compared with Case 1 (432.487 and 54.385). In Case 3, the adjusted R2 of the MGWR model is 0.826, which is 0.057 higher than that in Case 1, and the AICc and RSS (429.567 and 51.245) are reduced to a certain extent compared with Case 1. Based on the above analysis results, we can get the conclusion consistent with the optimal spatial correlation model; that is, the fitting result of the model (Case 3), which only considers the orientation-order indicator between the road network density and the orientation-order indicator, is the best. From the perspective of spatial heterogeneity, this study also verifies that the orientation-order indicator has a certain influence on online car-hailing travel behavior and further indicates that it is reasonable to consider the orientation-order indicator as one of the representative factors of neighborhood design.
5.2. Optimal Model Estimation
According to the selection of optimal model in Section 5.1, the SDM model and MGWR model of Case 3 (selecting the orientation-order indicator as one of the independent variables) have the highest fitting degree. Therefore, this part estimates and analyzes the spatial correlation and spatial heterogeneity based on the optimal model so as to quantitatively analyze the impact of each variable on the built environment, especially the impact of the orientation-order indicator.
5.2.1. Spatial Correlation Analysis-SDM Model Estimation
The model parameter estimation results of the peak period obtained based on the SDM model are shown in Table 6. It can be seen that the bus station POI, residential district POI, catering service POI, shopping service POI, corporate business, land-use mix, and orientation order have significant positive correlations with the generation of DiDi travel volume. That is, the greater the development intensity of the traffic zone, the greater the intensity of people’s travel activities and the higher the demand for DiDi travel. The larger the orientation-order indicator is, the higher the grid orientation of the city streets is, and the streets will be extended orderly in all orientations, which is generally reflected in the city center with the relatively perfect road network, so it will attract more traffic volume of online car-hailing.
Specifically, the catering service POI, corporate business POI, and land-use mix in the surrounding traffic community have significant positive effects on the DiDi travel volume of this traffic zone. However, sports and leisure service POI and orientation order of the surrounding traffic community have noteworthy negative influences on the traffic volume of this traffic zone. The spatial lag variable coefficient ρ is positive and significant, and the traffic zone has a significant spillover effect on the adjacent traffic zones. Therefore, it is necessary to further test and calibrate the size of the spillover effect so as to reveal its spatial correlation impact on DiDi travel behavior. The analysis of direct effect, spillover effect, and total effect is shown in Table 7.
From the direct effect, it can be seen that the direct effects of the bus station, residential district, catering service, shopping service, corporate business, life service, and orientation order are all positive and significant; that is, the increase of these independent variables in a certain traffic zone will improve the volume of DiDi travel in the traffic zone.
Through the analysis of the spillover effect, the spillover effect of catering service, corporate business, and orientation order is positive. Specifically, the spillover effect of the first two variables is positive. A possible explanation is that the number of people having meals and going to and from companies in a certain traffic zone increases in the noon peak, which would drive the increase of DiDi travel demand in its surrounding traffic zones; the orientation order illustrates that the greater the value of the orientation order in this traffic zone, the greater the degree of uniform distribution of streets in all orientations and the higher the attraction value of the traffic zone to DiDi travel volume. Due to the market effect, the travel demand of the surrounding traffic zones would boost to a certain extent. However, the spillover effects of sports leisure service and land-use mix are negative. To be more specific, the increase of sports leisure service in a certain traffic area would not improve the DiDi travel demand of surrounding traffic zones, which may demonstrate that the sports leisure and entertainment facilities in the area are relatively sound, thus attracting a large number of DiDi trips, and then the demand of the adjacent communities could decrease correspondingly; the greater the land-use mix in a traffic zone, the stronger the intensity of travel activities in the region, which would undoubtedly reduce the travel demand of surrounding traffic zones.
From the total effect, it could be inferred that bus station, catering service, shopping service, corporate business, education and culture, and orientation order have noteworthy total effects on DiDi travel demand. Through the above analysis, the rationality of using the orientation-order indicator to characterize neighborhood design is demonstrated from the perspective of spatial correlation.
5.2.2. Spatial Heterogeneity Analysis-MGWR Model Estimation
The spatial distribution of significant independent variable coefficients during the peak period (13:00 to 14:00) calculated by the MGWR model, namely, the influencing mechanism of spatial heterogeneity, is visually analyzed, as indicated in Figure 5. The analysis of significant variables is as follows: as shown in Figure 5(a), the spatial distribution of bus station POI coefficient displays a wavy decreasing trend from northeast to southwest. This illustrates that the impact of bus station POI on DiDi travel is conspicuous in the suburbs, especially in the northeast outside the Second Ring Road; that is, DiDi travel is always generated near the bus station in this region. Therefore, it is speculated that the flexibility and convenience of DiDi vehicles may make it a supplement to bus travel, that is, the formation of “DiDi travel + Bus” travel mode. However, this travel mode is not obvious in the Second Ring Road, which may be due to the more convenient public transportation system in the city center. This finding is contrary to the conclusion of Tu et al. [36], which indicates that taxi travel does not supplement bus travel and may be related to the low income of some bus passengers. Although taxis and online car-hailing have different travel behaviors, their service modes are to provide door-to-door service for the public. The above two research studies are contradictory, perhaps because of the different study areas (one is in the United States; another is in China). Therefore, the accuracy of this inference should be verified additionally in future studies.

(a)

(b)

(c)

(d)

(e)

(f)

(g)

(h)
As shown in Figure 5(b), the residential district POI has a powerful positive impact on DiDi travel, and the impact is significant in the north and southwest corners of the region. The residential district POI is relatively dense in the southwest direction (Figure 4(b)), which is consistent with the impact intensity; however, the residential district POI is relatively sparse in the northern area (Figure 4(b)). The reason may be that the land-use types outside the Second Ring Road are relatively single and less abundant than those in the urban area, mainly for residential and undeveloped land use, and the DiDi travel mainly provides convenient services for residents in residential areas, so it shows a strong influence here. Although this finding is consistent with the conclusion of Zhuo et al. [37], that is, the contribution of residential density to taxi travel is positive, it suggests that residential density does not have spatial heterogeneity, which may be due to the situations of the different effects in different zones hidden in the study.
What is confusing is that, in Figure 5(c), the coefficient spatial distribution of catering service POI displays a wavy decreasing trend from northeast to southwest. It may be because the northeast of the study area is far away from the city center, and the public transportation is not very developed. Thus, most passengers choose DiDi vehicles to travel for meals here, while the city center is the opposite. The spatial distribution trend of coefficient of life service POI (as shown in Figure 5(f)) is similar to that of catering service POI.
The area where DiDi travel is most affected by shopping service POI is not the southwest region but the northern and north-central region (see Figure 5(d)). The southwest region (near Chunxi Road) does generate a large DiDi travel volume, but the region’s highly diverse land-use mix may be more than just attracted by shops. The shops in the northern part of the region are mainly concentrated around the residential district, which may be the reason why the shops have a greater influence in this area.
As can be seen from Figure 5(e), the coefficient of corporate business POI decreases from the northwest corner to the southeast corner, while the POI distribution is more in the southwest city center (Figure 4(e)). Especially in the southwest region, the density distribution of corporate business POI is relatively large, but the spatial impact intensity is small, which may be owing to the public transportation in the city center being more developed, and people often choose public transportation in the process of commuting. In addition, corporate business POI is densely distributed in a few areas of the northwest corner, which is also easy to explain. It indicates that there are some companies in the northwest corner because the bus system is relatively underdeveloped and people often use DiDi to achieve commuting.
It can be seen from Figure 5(g) that the land-use mix has a strong positive impact on online car-hailing travel, especially in the west; that is, areas with high land-use mix have more online car-hailing travel. However, this finding is different from studies by some researchers who believe that land-use mix is an effective strategy to reduce car travel by integrating adequate living facilities (such as office space, residential areas, shops, and other facilities) [38]. This study area near Chunxi Road is a famous business district in Chengdu, with a high level of land-use combination and numerous buildings such as shopping, entertainment, office, residence, and hospital. According to the research of Kockelman [39], in areas with a high land-use mix, the vehicle mileage of nonworking travel is less. However, the volume of DiDi travel is higher in this area. One possible explanation is that regions with high diversity are more attractive than other regions; that is, due to lower travel costs, it improves accessibility to multiple destinations and increases nonworking travel [40].
As shown in Figure 5(h), the orientation order has a strong positive influence on DiDi travel, and its change trend gradually decreases from the northeast corner to the southwest corner. Among them, the value in the northeast direction of the study area is large, indicating that the street orientation in this area is in good order, with a higher node degree, fewer dead-ends, and smaller circuity. At this time, the streets are evenly distributed in each orientation [15], which is in line with the structural form of the radial road network in Chengdu, and the street structure would increase the demand for DiDi travel accordingly. Moreover, the value is small in the southwest region. As the center of the radial road network, the crisscross road network leads to poor orientation order, and people tend to rely on the public transport system to travel. It can be seen that this indicator can well highlight the impact of neighborhood design on travel behavior, which is also consistent with the spatial correlation research results mentioned previously.
6. Conclusion
In this work, the road network orientation entropy indicator was introduced to analyze the spatiotemporal impact of the urban built environment on DiDi travel from two perspectives of spatial correlation and spatial heterogeneity. The main conclusions are as follows.
In this study, the orientation-order indicator was proposed as one of the characterization factors of the neighborhood design dimension in the “five Ds.” Through modeling and analysis in three cases (considering road network density, considering road network density and orientation-order indicator, and considering orientation-order indicator). It is concluded that the orientation-order indicator does have a certain influence on online car-hailing travel behavior. And the fitting degree of the model considering the single orientation-order indicator is higher than that considering both the orientation-order indicator and road network density. This indicates that road network density is not the main influencing factor of DiDi travel behavior in the case of this study, and more attention should be paid to the orientation order of neighborhoods and road network connectivity. That is, the greater the orientation-order indicator, the higher the grided degree of the road and the better its connectivity, thus attracting more DiDi trips. The possible reasons for the above results are that the highly grided road network will make it easier for car-hailing companies to operate efficiently, making their services more convenient and attractive or making it easier for customers and drivers to meet up and get on or off. And it may also be that the higher the connectivity of the road network, the lower the travel cost, which will attract more vehicle trips, and because of the convenience of online car-hailing, it is easier for travelers to choose. Moreover, through spatial correlation and heterogeneity analysis, the significant impact on DiDi travel behavior was further quantified. At the same time, the above conclusions also verify that it is reasonable and necessary to consider the orientation-order indicator as to the constituent elements of the urban built environment.
From the perspectives of spatial correlation and heterogeneity analysis, this study constructed spatial econometric models, respectively, and screened out the best fitting models SDM and MGWR. Based on the estimation results of the SDM model, the direct effect, spillover effect, and total effect are quantified. The results show that it is necessary to include the spatial spillover effect in the study of the impact of the built environment on online car-hailing demand based on the traffic zoning scale. The built environment of the traffic zone not only significantly affects the demand for online car-hailing wherein itself but also has significant impacts on the demand for ride-hailing in other traffic zones. Specifically, catering service, corporate business, and orientation-order indicators have significant positive impacts, while sports and leisure service and land-use mix have vital negative impacts. Based on the estimation results of the MGWR model, bus station, residential district, catering service, shopping service, corporate business, land-use mix, and orientation-order indicators are the main influencing factors of DiDi travel from the perspective of spatial variation coefficient analysis, among which residential district and orientation order have remarkable positive impacts.
This study analyzed the influence of various factors on DiDi travel, which can effectively help DiDi company to operate more effectively, upgrade the dynamic ride-matching algorithm, then improve the order receiving efficiency of drivers, and reduce the waiting time or detour time. DiDi travel may be a complementary mode for public transport, especially in the eastern region outside the Second Ring Road, and the development of this mode echoes the development of the interconnected multimodal transport system. Population density, sports and leisure service, scenic zone, public facility, medical and health service, government and accommodation, and other POIs have no significant impact on online car-hailing travel, and these variables have been proved to be important in other pieces of literature, indicating that perhaps when selecting appropriate variables, measures should be taken according to local conditions.
This study shows that the larger the orientation-order indicator has a significant impact on online car-hailing travel, implying that orientation order leads to many online car-hailing trips per capita or a high share of online car-hailing mode. At the same time, when the orientation order is high, it is possible to have a significant impact on the overall vehicle travel demand of the city, rather than just on online car-hailing. Therefore, travel data of other travel modes can be collected for verification in the future. In addition, the choice of travel behavior factors can be expanded in future research, including travel time, travel distance, and land-use mix characteristics of travel destinations, to reveal the close relationship between the built environment and DiDi travel.
Data Availability
The data used to support the findings of this study are available from the Gaia Open Dataset by DiDi. They can be obtained through the following website: https://outreach.didichuxing.com/research/opendata/.
Conflicts of Interest
The authors declare that there are no conflicts of interest regarding the publication of this paper.
Acknowledgments
This work was funded by the National Key Research and Development Program of China (Grant no. 2019YFB1600500) and the Projects Supported by Special Funds for Urban Construction (Grant no. SZJJ2019-22). The authors acknowledge the Gaia Open Dataset by DiDi for the support in data collection.