25.4 Linear Least Squares
Octave also supports linear least squares minimization. That is, Octave can find the parameter b such that the model y = x*b fits data (x,y) as well as possible, assuming zero-mean Gaussian noise. If the noise is assumed to be isotropic the problem can be solved using the ‘\’ or ‘/’ operators, or the ols
function. In the general case where the noise is assumed to be anisotropic the gls
is needed.
- : [beta, sigma, r] = ols (y, x)
-
Ordinary least squares (OLS) estimation.
OLS applies to the multivariate model y = x*b + e where y is a t-by-p matrix, x is a t-by-k matrix, b is a k-by-p matrix, and e is a t-by-p matrix.
Each row of y is a p-variate observation in which each column represents a variable. Likewise, the rows of x represent k-variate observations or possibly designed values. Furthermore, the collection of observations x must be of adequate rank, k, otherwise b cannot be uniquely estimated.
The observation errors, e, are assumed to originate from an underlying p-variate distribution with zero mean and p-by-p covariance matrix S, both constant conditioned on x. Furthermore, the matrix S is constant with respect to each observation such that
mean (e) = 0
andcov (vec (e)) = kron (s, I)
. (For cases that don’t meet this criteria, such as autocorrelated errors, see generalized least squares, gls, for more efficient estimations.)The return values beta, sigma, and r are defined as follows.
- beta
-
The OLS estimator for matrix b. beta is calculated directly via
inv (x'*x) * x' * y
if the matrixx'*x
is of full rank. Otherwise,beta = pinv (x) * y
wherepinv (x)
denotes the pseudoinverse of x. - sigma
-
The OLS estimator for the matrix s,
sigma = (y-x*beta)' * (y-x*beta) / (t-rank(x))
- r
The matrix of OLS residuals,
r = y - x*beta
.
- : [beta, v, r] = gls (y, x, o)
-
Generalized least squares (GLS) model.
Perform a generalized least squares estimation for the multivariate model y = x*B + E where y is a t-by-p matrix, x is a t-by-k matrix, b is a k-by-p matrix and e is a t-by-p matrix.
Each row of y is a p-variate observation in which each column represents a variable. Likewise, the rows of x represent k-variate observations or possibly designed values. Furthermore, the collection of observations x must be of adequate rank, k, otherwise b cannot be uniquely estimated.
The observation errors, e, are assumed to originate from an underlying p-variate distribution with zero mean but possibly heteroscedastic observations. That is, in general,
mean (e) = 0
andcov (vec (e)) = (s^2)*o
in which s is a scalar and o is a t*p-by-t*p matrix.The return values beta, v, and r are defined as follows.
- beta
-
The GLS estimator for matrix b.
- v
-
The GLS estimator for scalar s^2.
- r
The matrix of GLS residuals, r = y - x*beta.
See also: ols.
- : x = lsqnonneg (c, d)
- : x = lsqnonneg (c, d, x0)
- : x = lsqnonneg (c, d, x0, options)
- : [x, resnorm] = lsqnonneg (…)
- : [x, resnorm, residual] = lsqnonneg (…)
- : [x, resnorm, residual, exitflag] = lsqnonneg (…)
- : [x, resnorm, residual, exitflag, output] = lsqnonneg (…)
- : [x, resnorm, residual, exitflag, output, lambda] = lsqnonneg (…)
-
Minimize
norm (c*x - d)
subject tox >= 0
.c and d must be real matrices.
x0 is an optional initial guess for the solution x.
options is an options structure to change the behavior of the algorithm (see optimset).
lsqnonneg
recognizes these options:"MaxIter"
,"TolX"
.Outputs:
- resnorm
-
The squared 2-norm of the residual:
norm (c*x-d)^2
- residual
-
The residual:
d-c*x
- exitflag
-
An indicator of convergence. 0 indicates that the iteration count was exceeded, and therefore convergence was not reached; >0 indicates that the algorithm converged. (The algorithm is stable and will converge given enough iterations.)
- output
-
A structure with two fields:
-
"algorithm"
: The algorithm used ("nnls"
) -
"iterations"
: The number of iterations taken.
-
- lambda
Undocumented output
- : x = lscov (A, b)
- : x = lscov (A, b, V)
- : x = lscov (A, b, V, alg)
- : [x, stdx, mse, S] = lscov (…)
-
Compute a generalized linear least squares fit.
Estimate x under the model b = Ax + w, where the noise w is assumed to follow a normal distribution with covariance matrix {\sigma^2} V.
If the size of the coefficient matrix A is n-by-p, the size of the vector/array of constant terms b must be n-by-k.
The optional input argument V may be an n-element vector of positive weights (inverse variances), or an n-by-n symmetric positive semi-definite matrix representing the covariance of b. If V is not supplied, the ordinary least squares solution is returned.
The alg input argument, a guidance on solution method to use, is currently ignored.
Besides the least-squares estimate matrix x (p-by-k), the function also returns stdx (p-by-k), the error standard deviation of estimated x; mse (k-by-1), the estimated data error covariance scale factors (\sigma^2); and S (p-by-p, or p-by-p-by-k if k > 1), the error covariance of x.
Reference: Golub and Van Loan (1996), Matrix Computations (3rd Ed.), Johns Hopkins, Section 5.6.3
- : optimset ()
- : options = optimset ()
- : options = optimset (par, val, …)
- : options = optimset (old, par, val, …)
- : options = optimset (old, new)
-
Create options structure for optimization functions.
When called without any input or output arguments,
optimset
prints a list of all valid optimization parameters.When called with one output and no inputs, return an options structure with all valid option parameters initialized to
[]
.When called with a list of parameter/value pairs, return an options structure with only the named parameters initialized.
When the first input is an existing options structure old, the values are updated from either the par/val list or from the options structure new.
Valid parameters are:
- AutoScaling
- ComplexEqn
- Display
-
Request verbose display of results from optimizations. Values are:
-
"off"
[default] -
No display.
"iter"
-
Display intermediate results for every loop iteration.
"final"
-
Display the result of the final loop iteration.
"notify"
Display the result of the final loop iteration if the function has failed to converge.
-
- FinDiffType
- FunValCheck
-
When enabled, display an error if the objective function returns an invalid value (a complex number, NaN, or Inf). Must be set to
"on"
or"off"
[default]. Note: the functionsfzero
andfminbnd
correctly handle Inf values and only complex values or NaN will cause an error in this case. - GradObj
-
When set to
"on"
, the function to be minimized must return a second argument which is the gradient, or first derivative, of the function at the point x. If set to"off"
[default], the gradient is computed via finite differences. - Jacobian
-
When set to
"on"
, the function to be minimized must return a second argument which is the Jacobian, or first derivative, of the function at the point x. If set to"off"
[default], the Jacobian is computed via finite differences. - MaxFunEvals
-
Maximum number of function evaluations before optimization stops. Must be a positive integer.
- MaxIter
-
Maximum number of algorithm iterations before optimization stops. Must be a positive integer.
- OutputFcn
-
A user-defined function executed once per algorithm iteration.
- TolFun
-
Termination criterion for the function output. If the difference in the calculated objective function between one algorithm iteration and the next is less than
TolFun
the optimization stops. Must be a positive scalar. - TolX
-
Termination criterion for the function input. If the difference in x, the current search point, between one algorithm iteration and the next is less than
TolX
the optimization stops. Must be a positive scalar. - TypicalX
- Updating
See also: optimget.
- : optimget (options, parname)
- : optimget (options, parname, default)
-
Return the specific option parname from the optimization options structure options created by
optimset
.If parname is not defined then return default if supplied, otherwise return an empty matrix.
See also: optimset.
© 1996–2020 John W. Eaton
Permission is granted to make and distribute verbatim copies of this manual provided the copyright notice and this permission notice are preserved on all copies.
Permission is granted to copy and distribute modified versions of this manual under the conditions for verbatim copying, provided that the entire resulting derived work is distributed under the terms of a permission notice identical to this one.Permission is granted to copy and distribute translations of this manual into another language, under the above conditions for modified versions.
https://octave.org/doc/v6.3.0/Linear-Least-Squares.html