Abstract
Proper description of the return distribution is crucial for investment practitioners. The underestimation of the tail risk may lead to severe consequences, even for assets with moderate fluctuations. However, many empirical studies found that the distribution tails of many financial assets drop off more slowly than the Gaussian distributions. Therefore, we intend to model and calibrate the heavy tails observed in financial fluctuations in this study. By maximizing the Varma entropy with value-at-risk and expected shortfall constraints, we obtain the probability distribution of stock return and observe that the tail of stock return distribution is a power law. Since the variance of the real stock portfolio may be a random variable, using the mean-VaR-ES constraints to maximize the Varma entropy effectively avoids the problem of assuming that the variance is a constant value under the traditional mean-variance constraint. Therefore, the deduced theoretical model would be more consistent with the real market. Using high-frequency data from China’s stock markets, we calibrate our theoretical model and give the concrete form of probability density distribution p(x) for different time intervals. The calibration results show that the tail of the stock return distribution is a power law with most of the power-law orders between −2 and −7. We prove the robustness of our results by calibrating the Varma entropy for S&P 500 of the USA stock market and different stock market indices in China’s A-share market. Our research’s findings not only offer a theoretical perspective for researchers but also give investing professionals a theoretical foundation on which to base their decisions.
1. Introduction
One of the most important conditions for every new financial investment is risk assessment. The underestimation of the tail risk may lead to severe consequences, even for assets with moderate fluctuations. Therefore, modeling the probability density distribution function accurately is crucial for financial investors.
The well-known Markowitz model assumes that the return on financial assets follows a Gaussian distribution. Nevertheless, it is observed that many stock indices do not follow the random walk [1] and the distribution tails of many financial assets drop off more slowly than the Gaussian distributions [2].
Various models have been proposed to improve the probability density distribution function of financial assets. Some scholars have proposed that empirical financial asset volatility should be a random quantity, not a constant one like in the Black–Scholes model [3]. For instance, ARCH models [4] use earlier time’s real error term to describe the current error term’s variance, and GARCH models [5] assume the variance of the error term to be an autoregressive moving average. These models are frequently used in the financial market to describe the time-varying and clustering volatility of financial assets.
Particularly, financial assets’ power-law tails have been widely researched [6]. Mandelbrot [7] was the first to notice the power-law characteristics of financial asset price distributions. He modeled the logarithmic price increments with the Lévy flight, a stable distribution with a heavy-tailed distribution. A stable distribution is a sort of distribution that occurs when n independent and identically distributed random variables are added together. Its characteristic is that its functional form remains unchanged for different values of n [8]. This process, according to Mandelbrot, might explain why the aggregated return distribution did not converge to the normal distribution predicted by the central limit theorem (CLT). Since then, a series of studies have been done analyzing theoretically or empirically the power-law tails of financial assets. Gopikrishnan et al. [9] used a sample of 40 million data in three major US stock markets to analyze the probability distribution of stock price changes. They discovered that the cumulative distribution had asymptotic power-law behavior with an exponent alpha of about 3, which is beyond the Lévy regime of 0 < alpha < 2. The authors developed the so-called “inverse-cubic law” as a result of the stock markets’ unexpected conduct. Plerou et al. [10] further studied the stock price fluctuations of individual companies for US stock markets systematically using stock price returns of time scales Δt from 5 minutes up to four years. Their observation showed that for time periods between 5 minutes and 16 days, a power-law decay may accurately depict the distribution’s tails. The exponent of power law is between 2.5 and 4. Gopikrishnan et al. [11] conducted a parallel research using stock price indices. They observed that the distributions for the S&P 500 index for Δt shorter than 4 days exhibit a power-law asymptotic behavior, with the exponent around 3, well outside the stable Lévy regime 0 < alpha < 2. NIKKEI index and Hang-Seng index also showed similar characteristics for time scales shorter than 4 days. Wątorek et al. [12] also identified the power-law behaviors of the tails using recent data on cryptocurrencies, exchange rates, and contracts for differences (CFDs). By contrasting their findings with those from past research, they concentrate on the fitted function parameters and how they evolve through time. Their findings showed that the so-called “inverse-cubic power law” was still a suitable global reference on time scales of up to a few minutes. In fact, comparable statistical traits were discovered by other researchers using information from different stock markets [13–19].
Therefore, for a stock market with a heavy tail in return distributions, it is necessary to use a new uncertainty measure that does not depend on a particular distribution. Entropy can be applied to measure uncertainty in probability theory. As a measure of diversity, the concept of entropy, especially the principles of maximum entropy, has been widely used in asset pricing and portfolio selection. The idea of entropy was initially used for portfolio selection by Philippatos and Wilson [20]. To optimize projected portfolio return and reduce portfolio entropy, they created the notions of individual entropy, joint entropy, and conditional entropy. The derived probability density distribution model matched the single-index and full covariance models. Since then, this field has achieved a lot of advancements. A mean-hybrid entropy model, for instance, was given by Xu et al. [21] to address the portfolio selection issue with asset risk resulting from both randomness and fuzziness. A mean-variance-skewness-entropy portfolio selection model (MVSEM) was created by Usta and Kantar [22] and outperformed conventional portfolio selection models in several out-of-sample experiments. To create a well-diversified asset portfolio, Jana et al. [23] incorporated an additional entropy objective function to the multiobjective portfolio-based model and the probabilistic mean value and variance of continuous distribution. A possibilistic mean-semivariance-entropy model including return, risk, transaction cost, and portfolio diversity level was developed by Zhang et al. [24] for multiperiod portfolio selection. Zhou et al. [25] introduced a portfolio selection model based on information entropy-incremental entropy-skewness (EESM), with information entropy measuring portfolio risk. They evaluated the performance of the EESM and found that the EESM performs well relative to traditional portfolio selection models. Huang [26] created two kinds of credibility-based fuzzy mean-entropy models: a straightforward approach for determining the average entropy front and a fuzzy average entropy model. By an information theoretical inference mechanism, Rödder et al. [27] established a novel theory for determining portfolio weights under maximum entropy and minimum relative entropy. Anton and Afloarei Nucu [28] investigated the credit default swap (CDS) and stock markets using the idea of entropy. Their empirical findings revealed a shift of risk entropy from the private to public sectors during the whole period and respectively. In summary, some studies used the entropy model to determine the portfolio weights, to identify the mean-entropic frontier, or to analyze portfolio selection with transaction costs. Some of them proposed different forms of entropy, such as the more generalized incremental entropy, hybrid entropy, or information entropy like the Renyi entropy [29, 30], please see [31] for a full review of the use of entropy in portfolio selection.
In this paper, we use Varma entropy constrained by value-at-risk and expected shortfall to build the probability density distribution of the financial assets return. We especially want to model and calibrate the heavy tails observed in the financial fluctuations. Since the tail region of stock price distribution follows the power law [32, 33], which is significantly different from the normal distribution, it does not make sense to discuss variance in these circumstances. In order to avoid the use of variance as a constraint when optimizing portfolio entropy, following [34, 35], we apply the value-at-risk and expected shortfall constraints instead of the variance constraint. In this way, the probability density distribution of return is more theoretically reasonable. It is found that the tail of stock return distribution is a power law by maximizing the Varma entropy with VaR and ES constraints. Then, we use high-frequency data of stock market indices from China to USA to calibrate the Varma entropy. With the real-world data, we find that the tail region of the stock market index return distribution is a power law with most of the power-law exponents between −2 and −7. With the increase of the time interval of return lag, the power-law exponents are mostly tending to decrease. This is consistent with the principle of maximum entropy as entropy tends to increase when the system approaches its equilibrium value. With the calculated value-at-risk and the expected shortfall values, we fix the four Lagrangian multipliers in the theoretically deduced probability density distribution and thus give the numerical form of our theoretical model for different stock market indices with different time intervals.
The rest of this paper is arranged as follows: In Section 2, using mean-VaR-ES constraints to maximize the Varma entropy, we deduce theoretically the probability density distribution of stock return. In Section 3, we use high-frequency data from China’s stock market index to calibrate our theoretical model. In Section 4, we do the robustness tests using data from different stock markets. Finally, Section 5 draws conclusions.
2. Varma Entropy and the Power Law
In the real world, value-at-risk (VaR), developed by J.P. Morgan in the late 1980s [36], is a statistic in risk management that quantifies the risk of an asset portfolio into a numerical value. It describes the possible maximum loss over a certain period of time within a fixed confidence level. The value-at-risk (VaR) level K describes the greatest possible losses over a specific time frame with a level of confidence.
The concept of expected shortfall (ES) was first introduced by Rappoport and later developed by Artzner et al. [37, 38]. Expected shortfall is defined as the conditional expectation of loss when the loss exceeds the VaR threshold K, and it gauges how much one can lose on average in states beyond the selected level of VaR K. ES is derived by averaging all returns in the distribution that are less than the value-at-risk (VaR) level K during a certain time period with a degree of confidence:where represents the probability density distribution of the return and K represents the preselected VaR level.
The two-parameter Varma entropy [39] with continuous probability density distribution is defined as follows:with the normalization constraint:
Given the mean value of the portfolio , the global mean constraint takes the following form:
We set K as the preselected VaR level and define the tail probability as
The expected shortfall may be stated as follows:
The expected shortfall and tail probability may be formally rewritten as follows:
In order to maximize the Varma entropy of equation (3), we construct the Lagrangian of the portfolio as follows:
The Lagrangian will be maximized when its functional variation with regard to the unidentified probability density distribution is zero:
Solving the above formula and using the mean-VaR-ES constraints, we may derive the probability density distribution as follows:
In accordance with the mean-VaR-ES requirements, we may rewrite the probability density distribution as follows:where , , , and , and
The unity requirement (4) of the probability density function may be expressed as follows using the probability density distribution under the above mean-VaR-ES constraints:
If Varma entropy’s two parameters satisfy , we can integrate out equation (14) as follows:
Using the probability density distribution with the mean-VaR-ES restrictions (13), we can rewrite the mean constraint (5) as follows:
If Varma entropy’s two parameters satisfy , we can have
We may write the tail probability constraint (8) by using the probability density distribution under the mean-VaR-ES requirements (13) as follows:
If Varma entropy’s two parameters satisfy , we have
By making use of equation (13), the expected shortfall constraint (9) can be expressed as follows:
If Varma entropy’s two parameters satisfy , we have
In order to build up all of the algebraic equations (15), (17), (19), and (21) simultaneously, we restricted the specified parameters of the Varma entropy as follows:
Setting:
The limiting condition (22) gives
Therefore, with mean-VaR-ES constraints may be altered to read as follows:which shows that the probability density distribution with mean-VaR-ES constraints is a power law.
Utilizing the above equation (25), the distribution parameters’ constraint (15) becomes
Utilizing the equation (25), the distribution parameters’ constraint (17) becomes
Utilizing the equation (25), the distribution parameters’ constraint (19) can be written as follows:
Utilizing the equation (25), the distribution parameters’ constraint (21) can be written as follows:
It can be seen that if the Varma entropy’s parameters and mean-VaR-ES constraints are given, then we can use the equations (26)–(29) to jointly determine , , , and in the probability density distribution in equation (25).
3. Calibrating Varma Entropy
In the theoretical part, we have theoretically deduced that the tail region of the return distribution is a power-law distribution by maximizing Varma information entropy with mean-VaR-ES constraints. In this section, we will use real-world data to calibrate our theoretical model.
3.1. Data
For the purpose of risk management, studying the stock market index has a higher significance than studying individual stocks. The unsystematic risk of individual stocks can be hedged away by appropriate diversified investment. However, the systemic risk caused by the fluctuation of the stock market index cannot be reduced or eliminated through decentralized investment, since the systematic risk has an overall impact on the whole market. Therefore, in this paper, we choose the stock market index to study the characteristics of the return distribution of the stock market.
Given that China has become the second largest economy in the world and is the largest emerging economy [40], we use the stock market index from China to calibrate the model. In order to reflect the overall situation of China’s stock market and in line with previous research, the Shanghai and Shenzhen 300 index (CSI 300 index) is utilized. The CSI 300 index is a capitalization-weighted stock market index reflecting the stock markets of Shanghai and Shenzhen, which was jointly released by China Securities Index Co., Ltd. and Shanghai and Shenzhen stock exchanges in April 2005. The stock scope of the CSI 300 index is comprehensively determined according to the market value and tradable market value of the stock, which has nothing to do with the industry of the company. The CSI 300 index covers most of the circulating stocks of Shanghai and Shenzhen stock markets and comprehensively covers large blue chip companies and small- and medium-sized companies. It is the stock market index used by most scholars to study China’s stock market.
3.2. Market Return
The data we use are intraday high-frequency data. We have downloaded the high-frequency stock index with a data frequency of 1 minute using the Wind database and calculated the logarithmic rate of return for the stock market index using the following equation:where represents the stock market index at time and represents the stock market index at time . In this paper, we study the stock return of short-time interval. We choose the time interval Δt = 10, 20, 30, 40, 50, 60, 70, and 80 minutes, respectively, to calculate the stock price return of the corresponding time interval. 10 minutes are close to the minimum time interval for completing a transaction. Our research is mainly based on risk management rather than high-frequency trading, so it is of little significance to study smaller time intervals.
In Table 1, we list the descriptive statistics of our sample, including the sample size, the mean value, the standard deviation, and the minimum and maximum of the data. The sample period starts from January 2018 to the end of 2018. It can be seen that the stock market may indeed fluctuate greatly within a short period of time. For Δt = 10 minutes, the largest decline of the CSI 300 index reached 1.56%. In 80 minutes, the largest decline of the CSI 300 index reached 3.61%. It can be seen that the research on the stock return of short-time interval and the effective management of the risk of intraday fluctuation in the stock market are of great significance in the field of risk management.
In Table 2, we use skewness, kurtosis, Jarque–Bera test, Kolmogorov–Smirnov test, and Anderson–Darling test, respectively, to test whether the return distribution obeys the normal distribution. It can be seen from Table 2 that the skewness of the sample data is above 0, and there is positive bias; the sample data have leptokurtosis; Jarque–Bera test, Kolmogorov–Smirnov test, and Anderson–Darling test all significantly reject the original assumption that the sample data obey the normal distribution. That is, the return distribution of short-time interval does not obey the normal distribution.
3.3. Power Law and Calibrating Varma Entropy
Now, we analyze the tail region of the return distribution using data from the CSI 300 index. We first use the log-log diagram to exhibit the power-law relationship between the rate of return Z and the probability distribution P(Z). When there is a power-law relationship between the rate of return Z and the probability distribution P(Z), log Z and log P(Z) should have a linear relationship. As shown in Figure 1, the log-log plot of the CSI 300 index directly reflects the linear relationship between and . That is, the tail region of the short-time interval return distribution obeys the power-law distribution.
Following [12], we also transformed the return into the normalized form according towhere is the time interval, is the mean of and is the standard deviation of . The log-log plot of the probability distribution of normalized return with different time intervals for CSI300 is shown in Figure 2.
We fit the tail region of the return distribution of CSI 300 index data using MATLAB. The power-law exponent and the R2 are exhibited in Table 3. As can be seen in Table 3, the power-law exponents are between −7 and −2 with very high R2. This indicates that the power-law distribution can describe the tail region of the short-time interval return distribution well, which is consistent with the conclusion drawn from the log-log graph. With increasing the return lags (from 10 to 80 min), the distribution tail power-law exponents are mostly expected to decrease.
We calculate the value-at-risk and the expected shortfall of CSI300 within 5% confidence level. Solving equations (26)–(29) jointly, we fix the four Lagrangian multipliers , , , and in the probability density distribution under the mean-VaR-ES constraints (25). The numerical results of the four Lagrangian multipliers are shown in Table 3.
Having obtained the Lagrangian multipliers, we can write the numerical form of probability density distribution for different stock market indexes with different time intervals.
For ∆t = 10 min, the probability density distribution of CSI300 is
For ∆t = 20 min, the probability density distribution of CSI300 is
For ∆t = 30 min, the probability density distribution of CSI300 is
For ∆t = 40 min, the probability density distribution of CSI300 is
For ∆t = 50 min, the probability density distribution of CSI300 is
For ∆t = 60 min, the probability density distribution of CSI300 is
For ∆t = 70 min, the probability density distribution of CSI300 is
For ∆t = 80 min, the probability density distribution of CSI300 is
As can be seen in Table 2 (second column), with the increase of time lags (from 10 to 80 minutes), the kurtosis values of CSI 300 index data decrease as they should. This corresponds to the decrease of tail thickness as the time lags increase. From Table 3 column two, we can see that the power-law exponents tend to decrease with the increase of time lags (from 10 to 80 min). We note that the rate of return for the stock market index is always smaller than 1 () and in this areas for . For clearer exhibition of the dependence relationship of the tail thickness on the tail exponent, please see Figure 3. Thus, with the increase of the time lags (from 10 to 80 min), the distribution becomes less leptokurtic, and the tail exponent should decrease. This is consistent with our empirical results using the CSI 300 index data as shown in Tables 2 and 3 where the tail exponent decreases from 4.55 to 2.38. Ultimately, for the sufficiently large time lags (decades ago, it was many weeks; nowadays, due to much faster information transmission in the contemporary world, it is likely to take much less time), it approaches the normal distribution as consistent with the central limit theorem (CLT).
This effect can also be interpreted in terms of the Varma entropy. Varma entropy is proper to :
As is set equal to , we have
Therefore, Varma entropy is proper to :where is the distribution tail power-law exponent of the probability density distribution as in equation (25).
The limiting condition gives ; thus, when becomes smaller, becomes smaller. As , smaller exponent () leads to higher value for . In other words, when power-law exponent decreases with the increase of return lag ∆t, Varma entropy becomes larger. When Varma entropy reaches its maximum value, the system reaches its equilibrium as a stable distribution. This is consistent with the principle of maximum entropy as entropy tends to increase when the system approaches its equilibrium value.
4. Robustness Tests
In this section, we apply our model to the S&P 500 of the USA stock market and different stock price indices in China’s A-share market for robustness tests.
4.1. Robustness Tests of S&P 500 Index
In order to test whether the conclusions we draw are still robust for a mature financial market, we use the high-frequency data of S&P 500 in 2018 to calibrate our theoretical model. The results are exhibited in Table 4.
As can be seen in Table 4, the tail region of the return distribution is a power-law distribution with a very high R2, which indicates that the power-law distribution can describe the tail region of the short-time interval return distribution well. Most of the power-law orders are between −2 and −7. When the time interval increases, the power-law exponents are most likely to decrease. With the calculated value-at-risk and expected shortfall, we numerically solve the four Lagrangian multipliers , , , and and exhibit them in Table 4. In this way, we have used Varma entropy with VaR and ES constraints to find the numerical form of probability density distribution for high-frequency data of S&P 500.
For ∆t = 10 min, the probability density distribution of the S&P 500 index is
For ∆t = 20 min, the probability density distribution of the S&P 500 index is
For ∆t = 30 min, the probability density distribution of the S&P 500 index is
For ∆t = 40 min, the probability density distribution of the S&P 500 index is
For ∆t = 50 min, the probability density distribution of the S&P 500 index is
For ∆t = 60 min, the probability density distribution of the S&P 500 index is
For ∆t = 70 min, the probability density distribution of the S&P 500 index is
For ∆t = 80 min, the probability density distribution of the S&P 500 index is
4.2. Robustness Tests of the Shanghai Composite Index
We also calibrate our theoretical model using different stock price indices in China’s A-share market, namely, the Shanghai Composite index. The results are exhibited in Table 5.
As can be seen in Table 5, the tail region of the return distribution is a power-law distribution with very high R2, which indicates that the power-law distribution can describe the tail region of the short-time interval return distribution well. Most of the power-law orders are between −2 and −7. When the time interval increases, the power-law exponents are most likely to decrease. We also exhibit the numerical results of the four Lagrangian multipliers , , , and . With the solved parameters, we can write the numerical form of probability density distribution for high-frequency data of the Shanghai Composite index.
Thus, for ∆t = 10 min, the probability density distribution of the Shanghai Composite index is
For ∆t = 20 min, the probability density distribution of the Shanghai Composite index is
For ∆t = 30 min, the probability density distribution of the Shanghai Composite index is
For ∆t = 40 min, the probability density distribution of the Shanghai Composite index is
For ∆t = 50 min, the probability density distribution of the Shanghai Composite index is
For ∆t = 60 min, the probability density distribution of the Shanghai Composite index is
For ∆t = 70 min, the probability density distribution of the Shanghai Composite index is
For ∆t = 80 min, the probability density distribution of the Shanghai Composite index is
Overall, by empirical analysis, it is proved that the tail region of the return distribution is a power-law distribution. By maximizing mean-VaR-ES constraints, we can give the concrete numerical form of the probability density distribution of high-frequency stock market indexes, which are more theoretically reasonable and consistent with real-world characteristics. This conclusion is robust for data of different stock price indices in China’s A-share market and for mature capital in developed countries represented by the S&P 500 of the United States.
5. Conclusions
It is crucial for financial investors to assess properly the risk of any prospective investment. Therefore, modeling accurately the probability density distribution function of financial assets is a vital issue in the field of risk management.
Many scholars have proved empirically that the distribution of return in stock markets does not obey Gaussian distribution [33] and many stock indices do not follow the random walk [1]. The distributions of stock price returns drop off more slowly than the normal distributions [32]. For a stock market with a heavy tail in return distributions, it is necessary to use a new uncertainty measure that does not depend on a particular distribution. Therefore, we would use entropy, which can be applied to measure uncertainty in probability theory [31], to model the non-Gaussian distribution.
By maximizing the Varma entropy with mean-VaR-ES constraints, we obtained the probability distribution of stock return in the tail region and deduced theoretically that the tail of stock return distribution is a power law. Since the volatility of the real stock portfolio may be stochastic, using the mean-VaR-ES constraints to maximize the Varma entropy effectively avoids the problem of assuming that the variance is a constant value under the traditional mean-variance constraint. In this way, the deduced probability density distribution of return is more theoretically reasonable and more consistent with the real market.
Using high-frequency data from China’s stock markets, we calibrate our theoretical model and give the concrete form of probability density distribution p(x) for different time intervals. We find that the tail region of the stock market index return distribution is a power law with most of the power-law orders between −2 and −7. With the increase of the time interval of return lag, the power-law exponents mostly tend to decrease. This is consistent with the central limit theorem as with the increase of the time lags, the distribution becomes less leptokurtic, and for the sufficiently large time lags, it approaches the normal distribution. With the increase of the time interval of return lag, the power-law exponents mostly tend to decrease. This is consistent with the principle of maximum entropy as entropy tends to increase when the system approaches its equilibrium value. With the calculated mean value of the return, the value-at-risk, and the expected shortfall, we calculate the four parameters , , , and in the probability density distribution under the mean-VaR-ES constraints and thus give the numerical form of probability density distribution for different stock market indices with different time intervals. We use different stock price indices in China’s A-share market and stock market index from the mature capital market represented by the S&P 500 of the United States to calibrate the Varma entropy. The results show that our conclusions are robust for different stock markets.
Two groups may benefit from our results, namely, theoretical researchers and investment practitioners. For theoretical researchers, our model gives a complementary insight into what leads to the heavy tails observed in financial fluctuations. Besides, we also give a more theoretical explanation to the phenomenon that with the increase of the time interval of return lag, the power-law exponents mostly tend to decrease. Our research results can also benefit investment practitioners with a decision-making basis. However, further theoretical and empirical analyses are still needed. Further studies can be carried out by the following directions. First, studies may be carried out to further explore the deep meaning of the power-law parameter q and find more economic explanation for the mechanism leading to the power-law distribution of stock price fluctuations on small time scales. Second, researchers can also extend the idea with the maximum entropy principle to derive distribution for other financial assets and calibrate the model using real-world data.
Data Availability
The data used to support the findings of this study are available from the corresponding author upon request.
Disclosure
The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
Authors’ Contributions
C.L. performed the formal analysis of the investigation, the methodology, and the software. C.L. and C.C. wrote the first draft of the paper and reviewed and edited the paper. All authors have read and agreed to the published version of the manuscript.
Acknowledgments
This research was supported by the China Postdoctoral Science Foundation (Grant no. 2021M700398) and the Fundamental Research Funds for the Central Universities (Grant no. FRF-TP-22-059A1).