There are two main types of data sources of income distributions in China: household survey data and grouped data. Household survey data are typically available for isolated years and individual provinces. In comparison, aggregate or grouped data are typically available more frequently and usually have national coverage. In principle, grouped data allow investigation of the change of inequality over longer, continuous periods of time, and the identification of patterns of inequality across broader regions. Nevertheless, a major limitation of grouped data is that only mean (average) income and income shares of quintile or decile groups of the population are reported. Directly using grouped data reported in this format is equivalent to assuming that all individuals in a quintile or decile group have the same income. This potentially distorts the estimate of inequality within each region. The aim of this paper is to apply an improved econometric method designed to use grouped data to study income inequality in China. A generalized beta distribution is employed to model income inequality in China at various levels and periods of time. The generalized beta distribution is more general and flexible than the lognormal distribution that has been used in past research, and also relaxes the assumption of a uniform distribution of income within quintile and decile groups of populations. The paper studies the nature and extent of inequality in rural and urban China over the period 1978 to 2002. Income inequality in the whole of China is then modeled using a mixture of province‐specific distributions. The estimated results are used to study the trends in national inequality, and to discuss the empirical findings in the light of economic reforms, regional policies, and globalization of the Chinese economy.
The estimation of income distributions is important for assessing income inequality and poverty and for making comparisons of inequality and poverty over time, countries and regions, as well as before and after changes in taxation and transfer policies. Distributions have been estimated both parametrically and nonparametrically. Parametric estimation is convenient because it facilitates subsequent inferences about inequality and poverty measures and lends itself to further analysis such as the combining of regional distributions into a national distribution. Nonparametric estimation makes inferences more difficult, but it does not place what are sometimes unreasonable restrictions on the nature of the distribution. By estimating a mixture of gamma distributions, in this paper we attempt to benefit from the advantages of parametric estimation without suffering the disadvantage of inflexibility. Using a sample of Canadian income data, we use Bayesian inference to estimate gamma mixtures with two and three components. We describe how to obtain a predictive density and distribution function for income and illustrate the flexibility of the mixture. Posterior densities for Lorenz curve ordinates and the Gini coefficient are obtained.
The Lorenz curve relates the cumulative proportion of income to the cumulative proportion of population. When a particular functional form of the Lorenz curve is specified it is typically estimated by linear or nonlinear least squares, estimation techniques that have good properties when the error terms are independently and normally distributed. Observations on cumulative proportions are clearly neither independent nor normally distributed. This paper proposes and applies a new methodology that recognizes the cumulative proportional nature of the Lorenz curve data by assuming that the income proportions are distributed as a Dirichlet distribution. Five Lorenz-curve specifications are used to demonstrate the technique. Maximum likelihood estimates under the Dirichlet distribution assumption provide better-fitting Lorenz curves than nonlinear least squares and another estimation technique that has appeared in the literature.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.