HP 15c User Manual

Page 111

Section 4: Using Matrix Operations

111

Least-Squares Using Normal Equations

The unconstrained least-squares problem is known in statistical literature as multiple linear
regression. It uses the linear model









Here, b

…, b

are the unknown parameters, x

, ..., x

are the independent (or explanatory)

variables, y is the dependent (or response) variable, and r is the random error having
expected value E(r) = 0, variance σ

After making n observations of y and x

, x

, ..., x

, this problem can be expressed as

y = Xb + r

where y is an n-vector, X is an n × p matrix, and r is an n-vector consisting of the unknown
random errors satisfying E(r) = 0 and Cov(r) = E(rr

) = σ

If the model is correct and X

X has an inverse, then the calculated least-squares solution

)

(





has the following properties:



bˆ

) = b, so that bˆ is an unbiased estimator of b.



Cov(

bˆ

) = E((

bˆ

− b)

(

bˆ

− b)) = σ

–l

, the covariance matrix of the estimator

bˆ



rˆ

) = 0, where

rˆ

= y − X

bˆ

is the vector of residuals.



)

(

)

(||









, so that

)







is an unbiased estimator for

. You can estimate Cov(

bˆ

) by replacing σ



The total sum of squares

can be partitioned according to

= y

= (y − X

bˆ

+ X

bˆ

)

(y − X

bˆ

+ X

bˆ

)

= (y − X

bˆ

)

(y − X

bˆ

) - 2

bˆ

(y − X

bˆ

) + (X

bˆ

)

bˆ

)































Squares

Sum

Regression

Squares

Sum

Residual

When the model is correct,











