Site icon Bamboos Consulting

Portfolio analysis statistical considerations

Portfolio analysis is essentially a statistical technique. However, because the ‘‘true’’ population parameters for the input data (expected returns, variances, and covariances) are unobservable, sample statistics must be estimated. Thus, the efficient portfolios generated by portfolio analysis are no better than the statistical input data on which they are based.

Three methods are used for generating the statistical inputs: 

A key assumption of the single-index market model is that the covariance between the error terms of any two securities is zero, thus the covariance of returns between any two securities arises primarily through their relationship with a common market factor. However, if some relevant omitted variables exist, the covariance of returns between any two securities arises through their common relationships with more than a single common factor: for instance the growth rate in the index of the nation’s industrial production. That is, security returns could be assumed to be related to both a market factor and an industrial growth factor:

The two-index model can be constructed so that the two independent variables, r and g , are orthogonal by removing the market effect from the growth rate in the index of industrial production. In such case, their covariance is zero. Furthermore, such indexes are assumed to be uncorrelated with the error term. The orthogonalization procedure for removing each factor’s effect on the other factor can be extended to any number of indexes, and essentially consists of the substitution of each index with the residuals of its regression against the other indexes because, by assumption, such residuals are uncorrelated with the other indexes.

As with individual securities, the variance of returns from a portfolio can be decomposed

The unsystematic risk will approach zero as the number of individual securities in the portfolio, n, increases. Thus, the total risk of a well-diversified portfolio converges to its systematic risk for large n. 

Efficient portfolios can be derived from any return-generating processes by minimizing a Lagrangian objective function:

where α’s elements are the intercept terms of the single-index model; β’s elements are beta coefficients for the securities:

and the matrix:

is a variance-covariance matrix of the variances for the error terms and the market. However, the covariance matrix must be a positive definite covariance matrix for the first-order conditions to be necessary and sufficient for a global optimum. Below I briefly explain why:

The non-stationarity of mean and risk

In a dynamic economic environment, firms’ investment and financing decisions will affect the systematic risk, expected return, and standard deviation of returns. For example, Boness, Chen, and Jatusipitak (1974) find parameters shift after a capital structure change and Christie (1982) demonstrates that the standard deviation of a stock’s return is an increasing function of both financial and operating leverage. Beaver’s (1968) realized returns and Patell and Wolfson’s (1981) ex-ante assessments support an increase in the variance of stock returns around the announcement of quarterly earnings. Macro information shocks may also shift the level of interest rates and market risk premiums. Therefore, it is not surprising for the expected return and the risk dimensions of individual stocks and portfolios to change through time. Nonstationarity of mean and risk could cause the skewness, highly peaked, and longer-tailed (leptokurtic) characteristics in the empirical unconditional distribution of asset returns.

Several statistical distribution models can capture the skewness and leptokurtic features of asset returns:

In such circumstances when the mean and variance parameters are not sufficient to describe the investment environment, other portfolio selection criteria are used: the geometric mean return (GMR) criterion, the safety-first criterion, value at risk (VaR), copulas, semivariance, stochastic dominance, and the mean-variance-skewness criterion.

Returns Distribution Fitting

Substantially, estimating the best stable Paretian distribution is rather daunting. A common method fits a number of distributions to sample data, compare the goodness of fit with a chi-squared value, and test for significant difference between observed and fitted distribution with a Kolmogorov-Smirnov test. The chi-squared value bins data into 50 bins based on percentiles so that each bin contains approximately an equal number of values. For each fitted distribution, the expected count of values in each bin is predicted from the distribution. The chi-squared value is the sum of the relative squared error for each bin, such that:

chi-squared = sum ((observed – predicted)2) / predicted)

The cumulative sum of observed and predicted frequency across the bin range used are employed for the observed and predicted parameters above. The lower the chi-squared value the better the fit. The Kolmogorov-Smirnov test assumes that data has been standardized (demeaned and then divided by the standard deviation). A value greater than 0.05 means that the fitted distribution is not significantly different from the observed distribution of the data.

However, when working with large datasets of real-world data often no theoretical distribution fits the data perfectly or, even worst, the theoretical distribution does not match the process generating the data; indeed, statistical distributions are theoretical models of real-world data that closely match the underlying generating process. In practice, the theoretical model and the test must both agree.

Accordingly to the code above, the Weibull distribution formula is selected to model the log-returns of the S&P500 over the last 18 years. The theoretical model is used to measure the mean-time of failure of a piece of equipment in the production process. Thus, we can accept the model to predict how many failures will occur in the next quarter, six months, or year for the purpose of forecasting tail risk measures as VaR.

References

E. Elton, M. Gruber, S. Brown, W. Goetzmann. (2007). Modern Portfolio Theory and Investment Analysis, 11th edition.

Exit mobile version