References: Chapters 3.7, 3.8, 4.2 Gardini A. (2007).
Let’s consider a generic uni-variate linear model with -regressors, namely  and suppose that we are interested in testing whereas the coefficient  is statistically different from a certain value  known at priori. In this case the null hypothesis can be equivalently represented using a more flexible matrix notation, i.e.   where  Hence, the linear restriction in Equation 17.1 can be written in matrix as 
 Multiple restrictions
Let’s consider multiple restrictions, i.e.   Let’s construct the vector for (1) (first column of ) and (2) (second column of ), i.e. 
 Restricted least squares
Proposition 17.1 ()
Let’s consider a linear model under the OLS assumptions and let’s consider a set of  linear hypothesis on the parameters of the model taking the form  Therefore, the optimization problem became restricted to the space of parameters that satisfies the conditions. More precisely, the space , that is a subset of the parameter space  where the linear constraint holds true, is defined as  Hence, the optimization problem in Equation 15.2 is restricted to only the parameters that satisfy the constraint.
Formally, the RLS estimator is the solution of the following minimization problem, i.e.   where  reads as in the OLS case (Equation 15.1). Notably, the analytic solution for  reads 
 
Proof. In order to solve the minimization problem in Equation 17.2, let’s construct the Lagrangian (Equation 31.11) as  Then, one obtain the following system of equation, i.e.   Let’s firstly solve explicitly  from (A), i.e.   and substitute the result in (B), i.e.   Hence, it is possible to explicit the Lagrange multipliers  as:  Finally, substituting  (Equation 17.5) in Equation 17.4 gives the optimal solution, i.e.   Note that if constraints hold true in the OLS estimate,  is true and therefore . Hence the RLS and OLS parameters are the same, i.e. .
 
 
 
 
Proposition 17.2 ()
The RLS estimator (Equation 17.3) is correct for the true parameter in population  if and only if the restrictions imposed by  are true in population, i.e. expected value is computed as:  where  only if the second component is zero, that happens only when  holds true and so .
 
Proof. Let’s apply the expected value on Equation 17.3 remembering that ,  and  are non-stochastic and that  is correct (Equation 15.8). Developing the computations gives:  Hence  is correct if and only if the restriction holds true in population, i.e. 
 
 
 
 
Proposition 17.3 ()
The variance of the RLS estimator (Equation 17.3)  It is interesting to note that the variance of the RLS estimator is always lower or equal than the variance of the OLS estimator, in fact 
 
Proof. In order to compute the variance of the RLS estimator, let’s apply the variance operator to @Equation 17.3, i.e.   Let’s denote with  the matrix  and let’s bring it outside the variance, i.e.   Moreover, substituting the expression of the variance of  (Equation 15.10) one obtain  Developing the matrix multiplication gives 
 
 
 
 
 A test for linear restrictions
Under the assumption of normality of the error terms, it is possible to derive a statistic to test the significance of the linear restrictions imposed by . Let’s test the validity of the hull hypothesis  against its alternative hypothesis , i.e.   Under normality, the OLS estimate are multivariate normal, thus applying the scaling property one obtain that the distribution under  is normal, i.e.   Thus, we can write the statistic 
If we work under , then the mean in Equation 17.7 is zero, i.e.
 Recalling the relation (Section 32.1.1) between the distribution of the quadratic form of a multivariate normal and the  distribution, then the test statistic  has  distribution, with  the number of restrictions.
Instead, under  the distribution of the linear restriction is exactly equal to Equation 17.7. Thus, applying property 4. in Section 32.1.1 one obtain that the test statistic is distributed as a non central , i.e.  where the non centrality parameter  reads  As general decision rule  is rejected if the statistic in Equation 17.9 is greater than the quantile with confidence level  of a  random variable. Such critic value, denoted with  represents the value for which the probability that a  is greater than  is exactly equal to , i.e.  
Gardini A., Costa M., Cavaliere G. 2007. 
Econometria, Volume Primo. FrancoAngeli. 
https://cris.unibo.it/handle/11585/119378.