Quantile_regression

Quantile regression

Statistics concept

Quantile regression is a type of regression analysis used in statistics and econometrics. Whereas the method of least squares estimates the conditional mean of the response variable across values of the predictor variables, quantile regression estimates the conditional median (or other quantiles) of the response variable. Quantile regression is an extension of linear regression used when the conditions of linear regression are not met.

Background: quantiles

Quantile regression expresses the conditional quantiles of a dependent variable as a linear function of the explanatory variables. Crucial to the practicality of quantile regression is that the quantiles can be expressed as the solution of a minimization problem, as we will show in this section before discussing conditional quantiles in the next section.

Quantile of a random variable

Let $Y$ be a real-valued random variable with cumulative distribution function $F_{Y}(y)=P(Y\leq y)$ . The $\tau$ th quantile of Y is given by

q_{Y}(\tau )=F_{Y}^{-1}(\tau )=\inf \left\{y:F_{Y}(y)\geq \tau \right\}

where $\tau \in (0,1).$

Define the loss function as $\rho _{\tau }(m)=m(\tau -\mathbb {I} _{(m<0)})$ , where $\mathbb {I}$ is an indicator function. A specific quantile can be found by minimizing the expected loss of $Y-u$ with respect to $u$ :^[1](pp. 5–6):

q_{Y}(\tau )={\underset {u}{\mbox{arg min}}}E(\rho _{\tau }(Y-u))={\underset {u}{\mbox{arg min}}}{\biggl \{}(\tau -1)\int _{-\infty }^{u}(y-u)dF_{Y}(y)+\tau \int _{u}^{\infty }(y-u)dF_{Y}(y){\biggr \}}.

This can be shown by computing the derivative of the expected loss with respect to $u$ via an application of the Leibniz integral rule, setting it to 0, and letting $q_{\tau }$ be the solution of

0=(1-\tau )\int _{-\infty }^{q_{\tau }}dF_{Y}(y)-\tau \int _{q_{\tau }}^{\infty }dF_{Y}(y).

This equation reduces to

0=F_{Y}(q_{\tau })-\tau ,

and then to

F_{Y}(q_{\tau })=\tau .

If the solution $q_{\tau }$ is not unique, then we have to take the smallest such solution to obtain the $\tau$ th quantile of the random variable Y.

Example

Let $Y$ be a discrete random variable that takes values $y_{i}=i$ with $i=1,2,\dots ,9$ with equal probabilities. The task is to find the median of Y, and hence the value $\tau =0.5$ is chosen. Then the expected loss of $Y-u$ is

L(u)=E(\rho _{\tau }(Y-u))={\frac {(\tau -1)}{9}}\sum _{y_{i}<u}

(y_{i}-u)

+{\frac {\tau }{9}}\sum _{y_{i}\geq u}

(y_{i}-u)

={\frac {0.5}{9}}{\Bigl (}

-

\sum _{y_{i}<u}

(y_{i}-u)

+\sum _{y_{i}\geq u}

(y_{i}-u)

{\Bigr )}.

Since ${0.5/9}$ is a constant, it can be taken out of the expected loss function (this is only true if $\tau =0.5$ ). Then, at u=3,

L(3)\propto \sum _{i=1}^{2}

-(i-3)

+\sum _{i=3}^{9}

(i-3)

=[(2+1)+(0+1+2+...+6)]=24.

Suppose that u is increased by 1 unit. Then the expected loss will be changed by $(3)-(6)=-3$ on changing u to 4. If, u=5, the expected loss is

L(5)\propto \sum _{i=1}^{4}i+\sum _{i=0}^{4}i=20,

and any change in u will increase the expected loss. Thus u=5 is the median. The Table below shows the expected loss (divided by ${0.5/9}$ ) for different values of u.

u	1	2	3	4	5	6	7	8	9
Expected loss	36	29	24	21	20	21	24	29	36

Intuition

Consider $\tau =0.5$ and let q be an initial guess for $q_{\tau }$ . The expected loss evaluated at q is

L(q)=-0.5\int _{-\infty }^{q}(y-q)dF_{Y}(y)+0.5\int _{q}^{\infty }(y-q)dF_{Y}(y).

In order to minimize the expected loss, we move the value of q a little bit to see whether the expected loss will rise or fall. Suppose we increase q by 1 unit. Then the change of expected loss would be

\int _{-\infty }^{q}1dF_{Y}(y)-\int _{q}^{\infty }1dF_{Y}(y).

The first term of the equation is $F_{Y}(q)$ and second term of the equation is $1-F_{Y}(q)$ . Therefore, the change of expected loss function is negative if and only if $F_{Y}(q)<0.5$ , that is if and only if q is smaller than the median. Similarly, if we reduce q by 1 unit, the change of expected loss function is negative if and only if q is larger than the median.

In order to minimize the expected loss function, we would increase (decrease) L(q) if q is smaller (larger) than the median, until q reaches the median. The idea behind the minimization is to count the number of points (weighted with the density) that are larger or smaller than q and then move q to a point where q is larger than $100\tau$ % of the points.

Sample quantile

The $\tau$ sample quantile can be obtained by using an importance sampling estimate and solving the following minimization problem

{\hat {q}}_{\tau }={\underset {q\in \mathbb {R} }{\mbox{arg min}}}\sum _{i=1}^{n}\rho _{\tau }(y_{i}-q),

={\underset {q\in \mathbb {R} }{\mbox{arg min}}}\left[(1-\tau )\sum _{y_{i}<q}(y_{i}-q)+\tau \sum _{y_{i}\geq q}(y_{i}-q)\right]

,

where the function $\rho _{\tau }$ is the tilted absolute value function. The intuition is the same as for the population quantile.

Conditional quantile and quantile regression

The $\tau$ th conditional quantile of $Y$ given $X$ is the $\tau$ th quantile of the Conditional probability distribution of $Y$ given $X$ ,

Q_{Y|X}(\tau )=\inf \left\{y:F_{Y|X}(y)\geq \tau \right\}

.

We use a capital $Q$ to denote the conditional quantile to indicate that it is a random variable.

In quantile regression for the $\tau$ th quantile we make the assumption that the $\tau$ th conditional quantile is given as a linear function of the explanatory variables:

Q_{Y|X}(\tau )=X\beta _{\tau }

.

Given the distribution function of $Y$ , $\beta _{\tau }$ can be obtained by solving

\beta _{\tau }={\underset {\beta \in \mathbb {R} ^{k}}{\mbox{arg min}}}E(\rho _{\tau }(Y-X\beta )).

Solving the sample analog gives the estimator of $\beta$ .

{\hat {\beta _{\tau }}}={\underset {\beta \in \mathbb {R} ^{k}}{\mbox{arg min}}}\sum _{i=1}^{n}(\rho _{\tau }(Y_{i}-X_{i}\beta )).

Note that when $\tau =0.5$ , the loss function $\rho _{\tau }$ is proportional to the absolute value function, and thus median regression is the same as linear regression by least absolute deviations.

Computation of estimates for regression parameters

The mathematical forms arising from quantile regression are distinct from those arising in the method of least squares. The method of least squares leads to a consideration of problems in an inner product space, involving projection onto subspaces, and thus the problem of minimizing the squared errors can be reduced to a problem in numerical linear algebra. Quantile regression does not have this structure, and instead the minimization problem can be reformulated as a linear programming problem

{\underset {\beta ,u^{+},u^{-}\in \mathbb {R} ^{k}\times \mathbb {R} _{+}^{2n}}{\min }}\left\{\tau 1_{n}^{'}u^{+}+(1-\tau )1_{n}^{'}u^{-}|X\beta +u^{+}-u^{-}=Y\right\},

where

u_{j}^{+}=\max(u_{j},0)

,

u_{j}^{-}=-\min(u_{j},0).

Simplex methods^[1]^: 181 or interior point methods^[1]^: 190 can be applied to solve the linear programming problem.

Asymptotic properties

For $\tau \in (0,1)$ , under some regularity conditions, ${\hat {\beta }}_{\tau }$ is asymptotically normal:

{\sqrt {n}}({\hat {\beta }}_{\tau }-\beta _{\tau }){\overset {d}{\rightarrow }}N(0,\tau (1-\tau )D^{-1}\Omega _{x}D^{-1}),

where

D=E(f_{Y}(X\beta )XX^{\prime })

and

\Omega _{x}=E(X^{\prime }X).

Direct estimation of the asymptotic variance-covariance matrix is not always satisfactory. Inference for quantile regression parameters can be made with the regression rank-score tests or with the bootstrap methods.^[9]

Equivariance

See invariant estimator for background on invariance or see equivariance.

Scale equivariance

For any $a>0$ and $\tau \in [0,1]$

{\hat {\beta }}(\tau ;aY,X)=a{\hat {\beta }}(\tau ;Y,X),

{\displaystyle {\hat {\beta }}(\tau

Shift equivariance

For any $\gamma \in R^{k}$ and $\tau \in [0,1]$

{\hat {\beta }}(\tau ;Y+X\gamma ,X)={\hat {\beta }}(\tau ;Y,X)+\gamma .

Equivariance to reparameterization of design

Let $A$ be any $p\times p$ nonsingular matrix and $\tau \in [0,1]$

{\hat {\beta }}(\tau ;Y,XA)=A^{-1}{\hat {\beta }}(\tau ;Y,X).

Invariance to monotone transformations

If $h$ is a nondecreasing function on $\mathbb {R}$ , the following invariance property applies:

h(Q_{Y|X}(\tau ))\equiv Q_{h(Y)|X}(\tau ).

Example (1):

If $W=\exp(Y)$ and $Q_{Y|X}(\tau )=X\beta _{\tau }$ , then $Q_{W|X}(\tau )=\exp(X\beta _{\tau })$ . The mean regression does not have the same property since $\operatorname {E} (\ln(Y))\neq \ln(\operatorname {E} (Y)).$

Inference

Interpretation of the slope parameters

The linear model $Q_{Y|X}(\tau )=X\beta _{\tau }$ mis-specifies the true systematic relation $Q_{Y|X}(\tau )=f(X,\tau )$ when $f(\cdot ,\tau )$ is nonlinear. However, $Q_{Y|X}(\tau )=X\beta _{\tau }$ minimizes a weighted distanced to $f(X,\tau )$ among linear models.^[10] Furthermore, the slope parameters $\beta _{\tau }$ of the linear model can be interpreted as weighted averages of the derivatives $\nabla f(X,\tau )$ so that $\beta _{\tau }$ can be used for causal inference.^[11] Specifically, the hypothesis $H_{0}:\nabla f(x,\tau )=0$ for all $x$ implies the hypothesis $H_{0}:\beta _{\tau }=0$ , which can be tested using the estimator ${\hat {\beta _{\tau }}}$ and its limit distribution.

Goodness of fit

The goodness of fit for quantile regression for the $\tau$ quantile can be defined as:^[12] $R^{2}(\tau )=1-{\frac {{\hat {V}}_{\tau }}{{\tilde {V}}_{\tau }}},$ where ${\hat {V}}_{\tau }$ is the sum of squares of the conditional quantile, while ${\tilde {V}}_{\tau }$ is the sum of squares of the unconditional quantile.

Variants

Censored quantile regression

If the response variable is subject to censoring, the conditional mean is not identifiable without additional distributional assumptions, but the conditional quantile is often identifiable. For recent work on censored quantile regression, see: Portnoy^[21] and Wang and Wang^[22]

Example (2):

Let $Y^{c}=\max(0,Y)$ and $Q_{Y|X}=X\beta _{\tau }$ . Then $Q_{Y^{c}|X}(\tau )=\max(0,X\beta _{\tau })$ . This is the censored quantile regression model: estimated values can be obtained without making any distributional assumptions, but at the cost of computational difficulty,^[23] some of which can be avoided by using a simple three step censored quantile regression procedure as an approximation.^[24]

For random censoring on the response variables, the censored quantile regression of Portnoy (2003)^[21] provides consistent estimates of all identifiable quantile functions based on reweighting each censored point appropriately.

Censored quantile regression has close links to survival analysis.

{\displaystyle t} — Depiction of two Kaplan–Meier estimators for the survival probabilities $S(t)=1-F(t)$ of two patient groups as a function of time $t$ , where $F(t)$ is the distribution function of the deaths. The $\tau$ quantile of the deaths is $t_{\tau }=F^{-1}(\tau )$ , where $F^{-1}$ is the quantile function of the deaths. Censored quantile regression can be used to estimate these conditional quantiles individually, while survival analysis estimates the (conditional) survival function.

Share this article:

This article uses material from the Wikipedia article Quantile_regression, and is written by contributors. Text is available under a CC BY-SA 4.0 International License; additional terms may apply. Images, videos and audio are available under their respective licenses.

[Koenker2005-1] [1]
Koenker, Roger (2005). Quantile Regression. Cambridge University Press. pp. 146–7. ISBN 978-0-521-60827-5.

[2] [2]
Cade, Brian S.; Noon, Barry R. (2003). "A gentle introduction to quantile regression for ecologists" (PDF). Frontiers in Ecology and the Environment. 1 (8): 412–420. doi:10.2307/3868138. JSTOR 3868138.

[3] [3]
Wei, Y.; Pere, A.; Koenker, R.; He, X. (2006). "Quantile Regression Methods for Reference Growth Charts". Statistics in Medicine. 25 (8): 1369–1382. doi:10.1002/sim.2271. PMID 16143984. S2CID 7830193.

[4] [4]
Wei, Y.; He, X. (2006). "Conditional Growth Charts (with discussions)". Annals of Statistics. 34 (5): 2069–2097 and 2126–2131. arXiv:math/0702634. doi:10.1214/009053606000000623. S2CID 88516697.

[5] [5]
Stigler, S. (1984). "Boscovich, Simpson and a 1760 manuscript note on fitting a linear relation". Biometrika. 71 (3): 615–620. doi:10.1093/biomet/71.3.615.

[6] [6]
Koenker, Roger (2005). Quantile Regression. Cambridge: Cambridge University Press. pp. 2. ISBN 9780521845731.

[:0-7] [7]
Furno, Marilena; Vistocco, Domenico (2018). Quantile Regression: Estimation and Simulation. Hoboken, NJ: John Wiley & Sons. pp. xv. ISBN 9781119975281.

[8] [8]
Koenker, Roger (August 1998). "Galton, Edgeworth, Frisch, and prospects for quantile regression in economics" (PDF). UIUC.edu. Retrieved August 22, 2018.

[9] [9]
Kocherginsky, M.; He, X.; Mu, Y. (2005). "Practical Confidence Intervals for Regression Quantiles". Journal of Computational and Graphical Statistics. 14 (1): 41–55. doi:10.1198/106186005X27563. S2CID 120598656.

[10] [10]
Angrist, J.; Chernozhukov, V.; Fernandez-Val, I. (2006). "Quantile Regression under Misspecification, with an Application to the U.S. Wage Structure" (PDF). Econometrica. 74 (2): 539–563. doi:10.1111/j.1468-0262.2006.00671.x.

[11] [11]
Kato, R.; Sasaki, Y. (2017). "On Using Linear Quantile Regressions for Causal Inference". Econometric Theory. 33 (3): 664–690. doi:10.1017/S0266466616000177.

[12] [12]
Roger Koenker & José A. F. Machado (1999) Goodness of Fit and Related Inference Processes for Quantile Regression, Journal of the American Statistical Association, 94:448, 1296-1310, DOI: 10.1080/01621459.1999.10473882

[13] [13]
Kozumi, H.; Kobayashi, G. (2011). "Gibbs sampling methods for Bayesian quantile regression" (PDF). Journal of Statistical Computation and Simulation. 81 (11): 1565–1578. doi:10.1080/00949655.2010.496117. S2CID 44015988.

[14] [14]
Yang, Y.; Wang, H.X.; He, X. (2016). "Posterior Inference in Bayesian Quantile Regression with Asymmetric Laplace Likelihood". International Statistical Review. 84 (3): 327–344. doi:10.1111/insr.12114. hdl:2027.42/135059. S2CID 14947362.

[15] [15]
Yang, Y.; He, X. (2010). "Bayesian empirical likelihood for quantile regression". Annals of Statistics. 40 (2): 1102–1131. arXiv:1207.5378. doi:10.1214/12-AOS1005. S2CID 88519086.

[16] [16]
Steinwart, Ingo; Christmann, Andreas (2011). "Estimating conditional quantiles with the help of the pinball loss". Bernoulli. 17 (1). Bernoulli Society for Mathematical Statistics and Probability: 211–225. arXiv:1102.2101. doi:10.3150/10-BEJ267.

[17] [17]
Petneházi, Gábor (2019-08-21). "QCNN: Quantile Convolutional Neural Network". arXiv:1908.07978 [cs.LG].

[18] [18]
Rodrigues, Filipe; Pereira, Francisco C. (2018-08-27). "Beyond expectation: Deep joint mean and quantile regression for spatio-temporal problems". arXiv:1808.08798 [stat].

[19] [19]
Nonparametric Quantile Regression: Non-Crossing Constraints and Conformal Prediction by Wenlu Tang, Guohao Shen, Yuanyuan Lin, Jian Huang, https://arxiv.org/pdf/2210.10161.pdf

[20] [20]
Meinshausen, Nicolai (2006). "Quantile Regression Forests" (PDF). Journal of Machine Learning Research. 7 (6): 983–999.

[Portnoy2003-21] [21]
Portnoy, S. L. (2003). "Censored Regression Quantiles". Journal of the American Statistical Association. 98 (464): 1001–1012. doi:10.1198/016214503000000954. S2CID 120674851.

[22] [22]
Wang, H.; Wang, L. (2009). "Locally Weighted Censored Quantile Regression". Journal of the American Statistical Association. 104 (487): 1117–1128. CiteSeerX 10.1.1.504.796. doi:10.1198/jasa.2009.tm08230. S2CID 34494316.

[23] [23]
Powell, James L. (1986). "Censored Regression Quantiles". Journal of Econometrics. 32 (1): 143–155. doi:10.1016/0304-4076(86)90016-3.

[24] [24]
Chernozhukov, Victor; Hong, Han (2002). "Three-Step Censored Quantile Regression and Extramarital Affairs". J. Amer. Statist. Assoc. 97 (459): 872–882. doi:10.1198/016214502388618663. S2CID 1410755.

[25] [25]
Efficient Quantile Regression for Heteroscedastic Models by, Yoonsuh Jung, Yoonkyung Lee, Steven N. MacEachern, https://www.tandfonline.com/doi/abs/10.1080/00949655.2014.967244?journalCode=gscs20

[26] [26]
"quantreg(x,y,tau,order,Nboot) - File Exchange - MATLAB Central". www.mathworks.com. Retrieved 2016-02-01.

[27] [27]
"Gretl Command Reference" (PDF). April 2017.

[28] [28]
"quantreg: Quantile Regression". R Project. 2018-12-18.

[29] [29]
"gbm: Generalized Boosted Regression Models". R Project. 2019-01-14.

[30] [30]
"quantregForest: Quantile Regression Forests". R Project. 2017-12-19.

[31] [31]
"qrnn: Quantile Regression Neural Networks". R Project. 2018-06-26.

[32] [32]
"qgam: Smooth Additive Quantile Regression Models". R Project. 2019-05-23.

[33] [33]
"Quantile Regression Forests". Scikit-garden. Retrieved 3 January 2019.

[34] [34]
"Statsmodels: Quantile Regression". Statsmodels. Retrieved 15 November 2019.

[35] [35]
"An Introduction to Quantile Regression and the QUANTREG Procedure" (PDF). SAS Support.

[36] [36]
"The QUANTSELECT Procedure". SAS Support.

[37] [37]
"qreg — Quantile regression" (PDF). Stata Manual.

[38] [38]
Cameron, A. Colin; Trivedi, Pravin K. (2010). "Quantile Regression". Microeconometrics Using Stata (Revised ed.). College Station: Stata Press. pp. 211–234. ISBN 978-1-59718-073-3.

[39] [39]
"JohnLangford/vowpal_wabbit". GitHub. Retrieved 2016-07-09.

[40] [40]
"QuantileRegression.m". MathematicaForPrediction. Retrieved 3 January 2019.

[41] [41]
"QuantileRegression". Wolfram Function Repository. Retrieved 14 September 2022.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]