Lorenz curve
Template:Short description Script error: No such module "Unsubst".
In economics, the Lorenz curve is a graphical representation of the distribution of income or of wealth. It was developed by Max O. Lorenz in 1905 for representing inequality of the wealth distribution.
The curve is a graph showing the proportion of overall income or wealth assumed by the bottom Template:Mvar% of the people, although this is not rigorously true for a finite population (see below). It is often used to represent income distribution, where it shows for the bottom Template:Mvar% of households, what percentage (Template:Mvar%) of the total income they have. The percentage of households is plotted on the Template:Mvar-axis, the percentage of income on the Template:Mvar-axis. It can also be used to show distribution of assets. In such use, many economists consider it to be a measure of social inequality.
The concept is useful in describing inequality among the size of individuals in ecology[1] and in studies of biodiversity, where the cumulative proportion of species is plotted against the cumulative proportion of individuals.[2] It is also useful in business modeling: e.g., in consumer finance, to measure the actual percentage Template:Mvar% of delinquencies attributable to the Template:Mvar% of people with worst risk scores. Lorenz curves were also applied to epidemiology and public health, e.g., to measure pandemic inequality as the distribution of national cumulative incidence (y%) generated by the population residing in areas (x%) ranked with respect to their local epidemic attack rate.[3]
Explanation
Template:Lorenz curve global income 2011.svg Points on the Lorenz curve represent statements such as, "the bottom 20% of all households have 10% of the total income."
A perfectly equal income distribution would be one in which every person has the same income. In this case, the bottom Template:Mvar% of society would always have Template:Mvar% of the income. This can be depicted by the straight line y = xScript error: No such module "Check for unknown parameters".; called the "line of perfect equality."
By contrast, a perfectly unequal distribution would be one in which one person has all the income and everyone else has none. In that case, the curve would be at y = 0%Script error: No such module "Check for unknown parameters". for all x < 100%Script error: No such module "Check for unknown parameters"., and y = 100%Script error: No such module "Check for unknown parameters". when x = 100%Script error: No such module "Check for unknown parameters".. This curve is called the "line of perfect inequality."
The Gini coefficient is the ratio of the area between the line of perfect equality and the observed Lorenz curve to the area between the line of perfect equality and the line of perfect inequality. The higher the coefficient, the more unequal the distribution is. In the diagram on the right, this is given by the ratio A / (A+B)Script error: No such module "Check for unknown parameters"., where Template:Mvar and Template:Mvar are the areas of regions as marked in the diagram.
Definition and calculation
The Lorenz curve is a probability plot (a P–P plot) comparing the distribution of a variable against a hypothetical uniform distribution of that variable. It can usually be represented by a function L(F)Script error: No such module "Check for unknown parameters"., where Template:Mvar, the cumulative portion of the population, is represented by the horizontal axis, and Template:Mvar, the cumulative portion of the total wealth or income, is represented by the vertical axis.
The curve Template:Mvar need not be a smoothly increasing function of Template:Mvar, For wealth distributions there may be oligarchies or people with negative wealth for instance.[4]
For a discrete distribution of Y given by values y1Script error: No such module "Check for unknown parameters"., ..., ynScript error: No such module "Check for unknown parameters". in non-decreasing order (yi ≤ yi+1)Script error: No such module "Check for unknown parameters". and their probabilities the Lorenz curve is the continuous piecewise linear function connecting the points (Fi, Li)Script error: No such module "Check for unknown parameters"., i = 0 to nScript error: No such module "Check for unknown parameters"., where F0 = 0Script error: No such module "Check for unknown parameters"., L0 = 0Script error: No such module "Check for unknown parameters"., and for i = 1 to nScript error: No such module "Check for unknown parameters".:
When all yiScript error: No such module "Check for unknown parameters". are equally probable with probabilities 1 / nScript error: No such module "Check for unknown parameters". this simplifies to
For a continuous distribution with the probability density function Template:Mvar and the cumulative distribution function Template:Mvar, the Lorenz curve Template:Mvar is given by: where denotes the average. The Lorenz curve L(F)Script error: No such module "Check for unknown parameters". may then be plotted as a function parametric in Template:Mvar: L(x)Script error: No such module "Check for unknown parameters". vs. F(x)Script error: No such module "Check for unknown parameters".. In other contexts, the quantity computed here is known as the length biased (or size biased) distribution; it also has an important role in renewal theory.
Alternatively, for a cumulative distribution function F(x)Script error: No such module "Check for unknown parameters". with inverse x(F)Script error: No such module "Check for unknown parameters"., the Lorenz curve L(F)Script error: No such module "Check for unknown parameters". is directly given by:
The inverse x(F)Script error: No such module "Check for unknown parameters". may not exist because the cumulative distribution function has intervals of constant values. However, the previous formula can still apply by generalizing the definition of x(F)Script error: No such module "Check for unknown parameters".: where infScript error: No such module "Check for unknown parameters". is the infimum.
For an example of a Lorenz curve, see Pareto distribution.
Properties
A Lorenz curve always starts at (0,0) and ends at (1,1).
The Lorenz curve is not defined if the mean of the probability distribution is zero or infinite.
The Lorenz curve for a probability distribution is a continuous function. However, Lorenz curves representing discontinuous functions can be constructed as the limit of Lorenz curves of probability distributions, the line of perfect inequality being an example.
The information in a Lorenz curve may be summarized by the Gini coefficient and the Lorenz asymmetry coefficient.[1]
The Lorenz curve cannot rise above the line of perfect equality.
A Lorenz curve that never falls beneath a second Lorenz curve and at least once runs above it, has Lorenz dominance over the second one.[5]
If the variable being measured cannot take negative values, the Lorenz curve:
- cannot sink below the line of perfect inequality,
- is increasing.
Note however that a Lorenz curve for net worth would start out by going negative due to the fact that some people have a negative net worth because of debt.
The Lorenz curve is invariant under positive scaling. If XScript error: No such module "Check for unknown parameters". is a random variable, for any positive number Template:Mvar the random variable Template:Mvar has the same Lorenz curve as XScript error: No such module "Check for unknown parameters"..
The Lorenz curve is flipped twice, once about F = 0.5Script error: No such module "Check for unknown parameters". and once about L = 0.5Script error: No such module "Check for unknown parameters"., by negation. If XScript error: No such module "Check for unknown parameters". is a random variable with Lorenz curve LX(F)Script error: No such module "Check for unknown parameters"., then −XScript error: No such module "Check for unknown parameters". has the Lorenz curve:
- L−X = 1 − LX(1 − F)Script error: No such module "Check for unknown parameters".
The Lorenz curve is changed by translations so that the equality gap F − L(F)Script error: No such module "Check for unknown parameters". changes in proportion to the ratio of the original and translated means. If XScript error: No such module "Check for unknown parameters". is a random variable with a Lorenz curve LX(F)Script error: No such module "Check for unknown parameters". and mean μXScript error: No such module "Check for unknown parameters"., then for any constant c ≠ −μXScript error: No such module "Check for unknown parameters"., X + cScript error: No such module "Check for unknown parameters". has a Lorenz curve defined by:
For a cumulative distribution function F(x)Script error: No such module "Check for unknown parameters". with mean Template:Mvar and (generalized) inverse x(F)Script error: No such module "Check for unknown parameters"., then for any Template:Mvar with 0 < F < 1Script error: No such module "Check for unknown parameters". :
- If the Lorenz curve is differentiable:
- If the Lorenz curve is twice differentiable, then the probability density function f(x)Script error: No such module "Check for unknown parameters". exists at that point and:
- If L(F)Script error: No such module "Check for unknown parameters". is continuously differentiable, then the tangent of L(F)Script error: No such module "Check for unknown parameters". is parallel to the line of perfect equality at the point F(μ)Script error: No such module "Check for unknown parameters".. This is also the point at which the equality gap F − L(F)Script error: No such module "Check for unknown parameters"., the vertical distance between the Lorenz curve and the line of perfect equality, is greatest. The size of the gap is equal to half of the relative mean absolute deviation:
Examples
Both L(F) = FTemplate:IsupScript error: No such module "Check for unknown parameters". and L(F) = 1 − (1 − F)Template:IsupScript error: No such module "Check for unknown parameters"., for P ≥ 1Script error: No such module "Check for unknown parameters"., are well-known functional forms for the Lorenz curve.[6]
See also
- Distribution (economics)
- Distribution of wealth
- Welfare economics
- Income inequality metrics
- Gini coefficient
- Hoover index (a.k.a. Robin Hood index)
- ROC analysis
- Social welfare (political science)
- Economic inequality
- Zipf's law
- Pareto distribution
- Mean deviation
- The Elephant Curve
References
<templatestyles src="Reflist/styles.css" />
Script error: No such module "Check for unknown parameters".
Further reading
- Script error: No such module "Citation/CS1".
- Script error: No such module "Citation/CS1".
- Script error: No such module "citation/CS1".
- Script error: No such module "citation/CS1".
External links
- WIID Template:Webarchive: World Income Inequality Database, a source of information on inequality, collected by WIDER (World Institute for Development Economics Research, part of United Nations University)
- glcurve: Stata module to plot Lorenz curve (type "findit glcurve" or "ssc install glcurve" in Stata prompt to install)
- Free add-on to STATA to compute inequality and poverty measures
- Free Online Software (Calculator) computes the Gini Coefficient, plots the Lorenz curve, and computes many other measures of concentration for any dataset
- Free Calculator: Online and downloadable scripts (Python and Lua) for Atkinson, Gini, and Hoover inequalities
- Users of the R data analysis software can install the "ineq" package which allows for computation of a variety of inequality indices including Gini, Atkinson, Theil.
- A MATLAB Inequality Package Template:Webarchive, including code for computing Gini, Atkinson, Theil indexes and for plotting the Lorenz Curve. Many examples are available.
- A complete handout about the Lorenz curve including various applications, including an Excel spreadsheet graphing Lorenz curves and calculating Gini coefficients as well as coefficients of variation.
- LORENZ 3.0 is a Mathematica notebook which draw sample Lorenz curves and calculates Gini coefficients and Lorenz asymmetry coefficients from data in an Excel sheet.