QR decomposition explained (2024)

In linear algebra, a QR decomposition, also known as a QR factorization or QU factorization, is a decomposition of a matrix A into a product A=QR of an orthonormal matrix Q and an upper triangular matrix R. QR decomposition is often used to solve the linear least squares (LLS) problem and is the basis for a particular eigenvalue algorithm, the QR algorithm.

Cases and definitions

Square matrix

Any real square matrix A may be decomposed as

A=QR,

where Q is an orthogonal matrix (its columns are orthogonal unit vectors meaning and R is an upper triangular matrix (also called right triangular matrix). If A is invertible, then the factorization is unique if we require the diagonal elements of R to be positive.

If A has n linearly independent columns, then the first n columns of Q form an orthonormal basis for the column space of A. More generally, the first k columns of Q form an orthonormal basis for the span of the first k columns of A for any .[1] The fact that any column k of A only depends on the first k columns of Q corresponds to the triangular form ofR.[1]

Rectangular matrix

More generally, we can factor a complex m×n matrix A, with, as the product of an m×m unitary matrix Q and an m×n upper triangular matrix R. As the bottom (mn) rows of an m×n upper triangular matrix consist entirely of zeroes, it is often useful to partition R, or both R and Q:

A=QR=Q\begin{bmatrix}R1\ 0\end{bmatrix}=\begin{bmatrix}Q1&Q2\end{bmatrix}\begin{bmatrix}R1\ 0\end{bmatrix}=Q1R1,

where R1 is an n×n upper triangular matrix, 0 is an zero matrix, Q1 is m×n, Q2 is, and Q1 and Q2 both have orthogonal columns.

call Q1R1 the thin QR factorization of A; Trefethen and Bau call this the reduced QR factorization.[1] If A is of full rank n and we require that the diagonal elements of R1 are positive then R1 and Q1 are unique, but in general Q2 is not. R1 is then equal to the upper triangular factor of the Cholesky decomposition of A A (=ATA if A is real).

QL, RQ and LQ decompositions

Analogously, we can define QL, RQ, and LQ decompositions, with L being a lower triangular matrix.

Computing the QR decomposition

There are several methods for actually computing the QR decomposition, such as the Gram–Schmidt process, Householder transformations, or Givens rotations. Each has a number of advantages and disadvantages.

Using the Gram–Schmidt process

\langlev,w\rangle=vsf{T}w

(or

\langlev,w\rangle=v\daggerw

for the complex case).

Define the projection:

\operatorname{proj}ua=

\left\langleu,a\right\rangle
\left\langleu,u\right\rangle

{u

}

then:

\begin{align}u1&=a1,&e1&=

u1
\|u1\|

\\u2&=a2-

\operatorname{proj}
u1

a2,&e2&=

u2
\|u2\|

\\u3&=a3-

\operatorname{proj}
u1

a3-

\operatorname{proj}
u2

a3,&e3&=

u3
\|u3\|

\\&  \vdots&&  \vdots\\uk&=ak-

k-1
\sum
j=1
\operatorname{proj}
uj

ak,&ek&=

uk
\|uk\|

\end{align}

We can now express the

ai

s over our newly computed orthonormal basis:

\begin{align}a1&=\left\langlee1,a1\right\ranglee1\\a2&=\left\langlee1,a2\right\ranglee1+\left\langlee2,a2\right\ranglee2\\a3&=\left\langlee1,a3\right\ranglee1+\left\langlee2,a3\right\ranglee2+\left\langlee3,a3\right\ranglee3\\&  \vdots\\ak&=

k
\sum
j=1

\left\langleej,ak\right\rangleej\end{align}

where This can be written in matrix form:

A=QR

where:

Q=\begin{bmatrix}e1&&en\end{bmatrix}

and

R=\begin{bmatrix}\langlee1,a1\rangle&\langlee1,a2\rangle&\langlee1,a3\rangle&&\langlee1,an\rangle\\0&\langlee2,a2\rangle&\langlee2,a3\rangle&&\langlee2,an\rangle\\0&0&\langlee3,a3\rangle&&\langlee3,an\rangle\\\vdots&\vdots&\vdots&\ddots&\vdots\\0&0&0&&\langleen,an\rangle\\\end{bmatrix}.

Example

Consider the decomposition of

A=\begin{bmatrix}12&-51&4\\6&167&-68\\-4&24&-41\end{bmatrix}.

Recall that an orthonormal matrix

Q

has the property

Then, we can calculate

Q

by means of Gram–Schmidt as follows:

\begin{align}U=\begin{bmatrix}u1&u2&u3\end{bmatrix}&=\begin{bmatrix}12&-69&-58/5\\6&158&6/5\\-4&30&-33\end{bmatrix};\\Q=\begin{bmatrix}

u1
\|u1\|

&

u2
\|u2\|

&

u3
\|u3\|

\end{bmatrix}&=\begin{bmatrix}6/7&-69/175&-58/175\\3/7&158/175&6/175\\-2/7&6/35&-33/35\end{bmatrix}.\end{align}

Thus, we have

\begin{align}Qsf{T}A&=Qsf{T}QR=R;\\R&=Qsf{T}A=\begin{bmatrix}14&21&-14\\0&175&-70\\0&0&35\end{bmatrix}.\end{align}

Relation to RQ decomposition

The RQ decomposition transforms a matrix A into the product of an upper triangular matrix R (also known as right-triangular) and an orthogonal matrix Q. The only difference from QR decomposition is the order of these matrices.

QR decomposition is Gram–Schmidt orthogonalization of columns of A, started from the first column.

RQ decomposition is Gram–Schmidt orthogonalization of rows of A, started from the last row.

Advantages and disadvantages

The Gram-Schmidt process is inherently numerically unstable. While the application of the projections has an appealing geometric analogy to orthogonalization, the orthogonalization itself is prone to numerical error. A significant advantage is the ease of implementation.

Using Householder reflections

A Householder reflection (or Householder transformation) is a transformation that takes a vector and reflects it about some plane or hyperplane. We can use this operation to calculate the QR factorization of an m-by-n matrix

A

with .

Q can be used to reflect a vector in such a way that all coordinates but one disappear.

Let

x

be an arbitrary real m-dimensional column vector of

A

such that

\|x\|=|\alpha|

for a scalar α. If the algorithm is implemented using floating-point arithmetic, then α should get the opposite sign as the k-th coordinate of where

xk

is to be the pivot coordinate after which all entries are 0 in matrix As final upper triangular form, to avoid loss of significance. In the complex case, set

\alpha=

i\argxk
-e

\|x\|

and substitute transposition by conjugate transposition in the construction of Q below.

Then, where

e1

is the vector [1 0 ⋯ 0]T, ||·|| is the Euclidean norm and

I

is an m×m identity matrix, set

\begin{align}u&=x+\alphae1,\\v&=

u
\|u\|

,\\Q&=I-2vvsf{T}.\end{align}

Or, if

A

is complex

Q=I-2vv\dagger.

Q

is an m-by-m Householder matrix, which is both symmetric and orthogonal (Hermitian and unitary in the complex case), and

Qx=\begin{bmatrix}\alpha\ 0\\vdots\ 0\end{bmatrix}.

This can be used to gradually transform an m-by-n matrix A to upper triangular form. First, we multiply A with the Householder matrix Q1 we obtain when we choose the first matrix column for x. This results in a matrix Q1A with zeros in the left column (except for the first row).

Q1A=\begin{bmatrix}\alpha1&\star&&\star\\0&&&\\\vdots&&A'&\\0&&&\end{bmatrix}

This can be repeated for A′ (obtained from Q1A by deleting the first row and first column), resulting in a Householder matrix Q2. Note that Q2 is smaller than Q1. Since we want it really to operate on Q1A instead of A′ we need to expand it to the upper left, filling in a 1, or in general:

Qk=\begin{bmatrix}Ik-1&0\\0&Qk'\end{bmatrix}.

After

t

iterations of this process,

R=QtQ2Q1A

is an upper triangular matrix. So, with

\begin{align}Qsf{T}&=QtQ2Q1,\\Q&=

sf{T}
Q
2
sf{T},\\
Q
t

&=Q1Q2Qt,\end{align}

A=QR

is a QR decomposition of

A

.

This method has greater numerical stability than the Gram–Schmidt method above.

The following table gives the number of operations in the k-th step of the QR-decomposition by the Householder transformation, assuming a square matrix with size n.

OperationNumber of operations in the k-th step
Multiplications

2(n-k+1)2

Additions

(n-k+1)2+(n-k+1)(n-k)+2

Division

1

Square root

1

Summing these numbers over the steps (for a square matrix of size n), the complexity of the algorithm (in terms of floating point multiplications) is given by

2
3

n3+n2+

1
3

n-2=O\left(n3\right).

Example

Let us calculate the decomposition of

A=\begin{bmatrix}12&-51&4\\6&167&-68\\-4&24&-41\end{bmatrix}.

First, we need to find a reflection that transforms the first column of matrix A, vector into

Now,

u=x-\alphae1,

and

v=

u
\|u\|

.

Here,

\alpha=14

and

x=a1=\begin{bmatrix}12&6&-4\end{bmatrix}sf{T}

Therefore

u=\begin{bmatrix}-2&6&-4\end{bmatrix}sf{T}=2\begin{bmatrix}-1&3&-2\end{bmatrix}sf{T}

and and then

\begin{align}Q1={}&I-

2
\sqrt{14

\sqrt{14}}\begin{bmatrix}-1\ 3\ -2\end{bmatrix}\begin{bmatrix}-1&3&-2\end{bmatrix}\\={}&I-

1
7

\begin{bmatrix}1&-3&2\\-3&9&-6\\2&-6&4\end{bmatrix}\\={}&\begin{bmatrix}6/7&3/7&-2/7\\3/7&-2/7&6/7\\-2/7&6/7&3/7\\\end{bmatrix}.\end{align}

Now observe:

Q1A=\begin{bmatrix}14&21&-14\\0&-49&-14\\0&168&-77\end{bmatrix},

so we already have almost a triangular matrix. We only need to zero the (3, 2) entry.

Take the (1, 1) minor, and then apply the process again to

A'=M11=\begin{bmatrix}-49&-14\\168&-77\end{bmatrix}.

By the same method as above, we obtain the matrix of the Householder transformation

Q2=\begin{bmatrix}1&0&0\\0&-7/25&24/25\\0&24/25&7/25\end{bmatrix}

after performing a direct sum with 1 to make sure the next step in the process works properly.

Now, we find

Q=

sf{T}=
Q
2

\begin{bmatrix}6/7&-69/175&58/175\\3/7&158/175&-6/175\\-2/7&6/35&33/35\end{bmatrix}.

Or, to four decimal digits,

\begin{align}Q&=

sf{T}=
Q
2

\begin{bmatrix}0.8571&-0.3943&0.3314\\0.4286&0.9029&-0.0343\\-0.2857&0.1714&0.9429\end{bmatrix}\\R&=Q2Q1A=Qsf{T}A=\begin{bmatrix}14&21&-14\\0&175&-70\\0&0&-35\end{bmatrix}.\end{align}

The matrix Q is orthogonal and R is upper triangular, so is the required QR decomposition.

Advantages and disadvantages

The use of Householder transformations is inherently the most simple of the numerically stable QR decomposition algorithms due to the use of reflections as the mechanism for producing zeroes in the R matrix. However, the Householder reflection algorithm is bandwidth heavy and not parallelizable, as every reflection that produces a new zero element changes the entirety of both Q and R matrices.

Using Givens rotations

QR decompositions can also be computed with a series of Givens rotations. Each rotation zeroes an element in the subdiagonal of the matrix, forming the R matrix. The concatenation of all the Givens rotations forms the orthogonal Q matrix.

In practice, Givens rotations are not actually performed by building a whole matrix and doing a matrix multiplication. A Givens rotation procedure is used instead which does the equivalent of the sparse Givens matrix multiplication, without the extra work of handling the sparse elements. The Givens rotation procedure is useful in situations where only relatively few off-diagonal elements need to be zeroed, and is more easily parallelized than Householder transformations.

Example

Let us calculate the decomposition of

A=\begin{bmatrix}12&-51&4\\6&167&-68\\-4&24&-41\end{bmatrix}.

First, we need to form a rotation matrix that will zero the lowermost left element, We form this matrix using the Givens rotation method, and call the matrix

G1

. We will first rotate the vector to point along the X axis. This vector has an angle We create the orthogonal Givens rotation matrix,

G1

:

\begin{align}G1&=\begin{bmatrix}\cos(\theta)&0&-\sin(\theta)\\0&1&0\\\sin(\theta)&0&\cos(\theta)\end{bmatrix}\\&\begin{bmatrix}0.94868&0&-0.31622\\0&1&0\\0.31622&0&0.94868\end{bmatrix}\end{align}

And the result of

G1A

now has a zero in the

a31

element.

G1A\begin{bmatrix}12.64911&-55.97231&16.76007\\6&167&-68\\0&6.64078&-37.6311\end{bmatrix}

We can similarly form Givens matrices

G2

and which will zero the sub-diagonal elements

a21

and forming a triangular matrix The orthogonal matrix

Qsf{T}

is formed from the product of all the Givens matrices Thus, we have and the QR decomposition is

Advantages and disadvantages

The QR decomposition via Givens rotations is the most involved to implement, as the ordering of the rows required to fully exploit the algorithm is not trivial to determine. However, it has a significant advantage in that each new zero element

aij

affects only the row with the element to be zeroed (i) and a row above (j). This makes the Givens rotation algorithm more bandwidth efficient and parallelizable than the Householder reflection technique.

Connection to a determinant or a product of eigenvalues

We can use QR decomposition to find the determinant of a square matrix. Suppose a matrix is decomposed as

A=QR

. Then we have\det A = \det Q \det R.

Q

can be chosen such that

\detQ=1

. Thus,\det A = \det R = \prod_i r_

where the

rii

are the entries on the diagonal of

R

. Furthermore, because the determinant equals the product of the eigenvalues, we have\prod_ r_ = \prod_ \lambda_

where the

λi

are eigenvalues of

A

.

We can extend the above properties to a non-square complex matrix

A

by introducing the definition of QR decomposition for non-square complex matrices and replacing eigenvalues with singular values.

Start with a QR decomposition for a non-square matrix A:

A=Q\begin{bmatrix}R\ 0\end{bmatrix},    Q\daggerQ=I

where

denotes the zero matrix and

Q

is a unitary matrix.

From the properties of the singular value decomposition (SVD) and the determinant of a matrix, we have

|\prodirii|=\prodi\sigmai,

where the

\sigmai

are the singular values of

Note that the singular values of

A

and

R

are identical, although their complex eigenvalues may be different. However, if A is square, then

{\prodi\sigmai}=|\prodiλi|.

It follows that the QR decomposition can be used to efficiently calculate the product of the eigenvalues or singular values of a matrix.

Column pivoting

Pivoted QR differs from ordinary Gram-Schmidt in that it takes the largest remaining column at the beginning of each new step—column pivoting—[2] and thus introduces a permutation matrix P:

AP=QR\iffA=QRPsf{T}

Column pivoting is useful when A is (nearly) rank deficient, or is suspected of being so. It can also improve numerical accuracy. P is usually chosen so that the diagonal elements of R are non-increasing:

\left|r11\right|\ge\left|r22\right|\ge\ge\left|rnn\right|

. This can be used to find the (numerical) rank of A at lower computational cost than a singular value decomposition, forming the basis of so-called rank-revealing QR algorithms.

Using for solution to linear inverse problems

Compared to the direct matrix inverse, inverse solutions using QR decomposition are more numerically stable as evidenced by their reduced condition numbers.[3]

To solve the underdetermined linear problem

Ax=b

where the matrix

A

has dimensions

m x n

and rank first find the QR factorization of the transpose of where Q is an orthogonal matrix (i.e. and R has a special form:

R=\left[\begin{smallmatrix}R1\ 0\end{smallmatrix}\right]

. Here

R1

is a square

m x m

right triangular matrix, and the zero matrix has dimension After some algebra, it can be shown that a solution to the inverse problem can be expressed as:

x=Q

sf{T}\right)
\left[\begin{smallmatrix}\left(R
1

-1b\\0\end{smallmatrix}\right]

where one may either find

-1
R
1

by Gaussian elimination or compute

sf{T}\right)
\left(R
1

-1b

directly by forward substitution. The latter technique enjoys greater numerical accuracy and lower computations.

To find a solution

\hat{x}

to the overdetermined problem

Ax=b

which minimizes the norm first find the QR factorization of The solution can then be expressed as where

Q1

is an

m x n

matrix containing the first

n

columns of the full orthonormal basis

Q

and where

R1

is as before. Equivalent to the underdetermined case, back substitution can be used to quickly and accurately find this

\hat{x

} without explicitly inverting (

Q1

and

R1

are often provided by numerical libraries as an "economic" QR decomposition.)

Generalizations

Iwasawa decomposition generalizes QR decomposition to semi-simple Lie groups.

See also

  • Polar decomposition
  • Eigenvalue decomposition
  • Spectral decomposition
  • LU decomposition
  • Singular value decomposition

References

  1. Book: Trefethen . Lloyd N. . Bau . David III . Nick Trefethen . Numerical linear algebra . 1997 . . Philadelphia, PA . 978-0-898713-61-9.
  2. Book: Strang . Gilbert . Linear Algebra and Learning from Data . 2019 . Wellesley Cambridge Press . Wellesley . 978-0-692-19638-0 . 143 . 1st.
  3. Book: Parker, Robert L. . Geophysical Inverse Theory . 1994 . Princeton University Press . 978-0-691-20683-7 . Princeton, N.J. . 1134769155 . Section 1.13 .

Further reading

  • .

External links

This article is licensed under the GNU Free Documentation License. It uses material from the Wikipedia article "QR decomposition".

Except where otherwise indicated, Everything.Explained.Today is © Copyright 2009-2024, A B Cryer, All Rights Reserved. Cookie policy.

QR decomposition explained (2024)

FAQs

What is the point of QR decomposition? ›

It follows that the QR decomposition can be used to efficiently calculate the product of the eigenvalues or singular values of a matrix.

What are the different types of QR decomposition? ›

There are really two kinds of QR decomposition: there is the full QR decomposition, where A is m×n, Q is m×m, and R is m×n. There is also the reduced QR decomposition, where A is m×n with rank r, Q is m×r, and R is r×n.

What is R in QR decomposition? ›

A QR decomposition of. a real square matrix A is a decomposition of A as. A = QR, where Q is an orthogonal matrix (i.e. QT Q = I) and R is an upper triangular matrix. If A is nonsingular, then this factorization is unique.

Is QR decomposition is the same as LU decomposition? ›

Another decomposition method is QR decomposition. One advantage of QR decomposition over LU decomposition is that this method does not require that the decomposition be carried out on a square matrix.

What are the disadvantages of QR? ›

They're not super secure. Anyone can access the information stored in a QR code. Be smart about what you link to – and make sure you don't put any confidential info in whatever you're linking to. Some people are also cautious about scanning QR codes because they don't know where the link is going to take them.

Who invented QR decomposition? ›

The QR algorithm was developed in the late 1950s by John G. F. Francis and by Vera N. Kublanovskaya, working independently. The basic idea is to perform a QR decomposition, writing the matrix as a product of an orthogonal matrix and an upper triangular matrix, multiply the factors in the reverse order, and iterate.

Is a QR decomposition unique? ›

The QR decomposition is unique. satisfying the stated properties are unique. Thus, the two matrices involved in the QR decomposition are unique.

What are the four types of decomposition? ›

There are 3 different types of decomposition reactions thermal, electrolytic, and photolytic.

Does every matrix have a QR factorization? ›

Theorem. Every matrix has a QR-decomposition, though R may not always be invertible.

What is R in QR code? ›

An R QR code is a type of QR code (Quick Response code) that is used to store data in a two-dimensional barcode. It is used to store information such as URLs, contact information, and other data.

What does the Q in the R stand for in QR code? ›

Well, the QR – which stands for “quick response” – code is basically a barcode on steroids. While the barcode holds information horizontally, the QR code does so both horizontally and vertically. This enables the QR code to hold over a hundred times more information.

Does QR decomposition work on non-square matrices? ›

which can be solved directly by back-substitution. QR-decomposition can also be performed on non-square matrices with few long columns.

Does QR decomposition always exist? ›

The QR decompostion always exists, even if the matrix does not have full rank, so the constructor will never fail. The primary use of the QR decomposition is in the least squares solution of nonsquare systems of simultaneous linear equations.

Why is LU decomposition faster? ›

LU decomposition stands for Lower Upper triangular decomposition. The triangular matrices speed up the process of back substitution in root finding.

Is the LU decomposition unique Why or why not? ›

The LU decomposition may not exist for a matrix A . If the LU decomposition exists then it is unique. The LU decomposition provides an efficient means of solving linear equations. The reason that L has all diagonal entries set to 1 is that this means the LU decomposition is unique.

What is the purpose of the decomposition method? ›

Decomposition method is a generic term for solutions of various problems and design of algorithms in which the basic idea is to decompose the problem into subproblems. The term may specifically refer to: Decomposition method (constraint satisfaction) in constraint satisfaction.

What is the purpose of problem decomposition? ›

It involves breaking down a complex problem or system into smaller parts that are more manageable and easier to understand. The smaller parts can then be examined and solved, or designed individually, as they are simpler to work with.

What is the point of spectral decomposition? ›

We can use spectral decomposition to more easily solve systems of equations. For example, in OLS estimation, our goal is to solve the following for b. The orthogonal P matrix makes this computationally easier to solve. Moreover, since D is a diagonal matrix, D−1 is also easy to compute.

What is the point of time series decomposition? ›

By decomposing a time series, you gain a clearer understanding of the underlying patterns and trends, making it easier to analyze and forecast the data accurately.

Top Articles
Latest Posts
Recommended Articles
Article information

Author: Foster Heidenreich CPA

Last Updated:

Views: 6457

Rating: 4.6 / 5 (56 voted)

Reviews: 87% of readers found this page helpful

Author information

Name: Foster Heidenreich CPA

Birthday: 1995-01-14

Address: 55021 Usha Garden, North Larisa, DE 19209

Phone: +6812240846623

Job: Corporate Healthcare Strategist

Hobby: Singing, Listening to music, Rafting, LARPing, Gardening, Quilting, Rappelling

Introduction: My name is Foster Heidenreich CPA, I am a delightful, quaint, glorious, quaint, faithful, enchanting, fine person who loves writing and wants to share my knowledge and understanding with you.