This note uses Named tensor notation.

The singular values decomposition of a matrix ARfgleft×fgright is a complete description of how it scales different directions in the spaces Rfgleft and Rfgright. Any matrix A can be expressed as the sum of r:=rank(A) outer products

A=fgsingσUV,

where

  • URfgsing×fgleft is made of r orthonormal vectors that span A within Rfgleft;
  • VRfgsing×fgright is made of r orthonormal vectors that span A within Rfgright;
  • the singular values σRfgsing describes how these vectors are scaled by A.

Notes:

  • This shows that any matrix is diagonal up to a rotation of the spaces it acts on!
  • Unlike for eigenvalues, the vectors Ufgsing(i) and Vfgsing(i) do not need to point in the same direction (and indeed they’re not part of the same space!).
  • Since we limited the sum to r terms, σ only contains only the nonzero singular values (this is known as the compact SVD).

What it says about A as a transformation

Linear map

When we apply A to a vector xRfgleft, x’s component in the direction of Ufgsing(i) gets scaled by a factor σi and transformed into direction Vfgsing(i):

Afgfgleftx=(fgsingσUV)fgfgleftx=fgsingσscaling(Ufgfgleftx)``coordinates of x''Vnew directions.

By symmetry, Afgfgrighty transforms direction Vfgsing(i) into direction Ufgsing(i) and scales by σi.

Geometric interpretation

In particular, the transformation xAfgfgleftx turns the unit ball

B:={xRfgleft|xfgleft1}

into an ellipsoid of dimension r within Rfgright, whose semi-axes are σiVfgsing(i). In particular, the length of the semi-axes are given by the singular values. Indeed, we have

Afgfgleftx=(fgsingσUV)fgfgleftx=(Ufgfgleftx)fgfgsing(σV),

and the only constraint that x being in B imposes on UfgfgleftxRfgsing is that it should have norm at most 1.

Bilinear form

In a similar vein, when we look at the components of x and y relative to U and V, the singular values give the coefficients of the bilinear form associated with A:

xfgfgleftAfgfgrighty=xfgfgleft(fgsingσUV)fgfgrighty=fgsingσ(Ufgfgleftx)``coordinates of x'' (Vfgfgrighty)``coordinates of y''.

Relation to eigenvalues

The singular values of A are the square roots of the eigenvalues of

AfgfgrightAT(fgleft)Rfgleft×fgleft,

which corresponds to the linear map that uses A to take a vector xRfgleft into Rfgright and then back to Rfgleft. Indeed,1

AfgfgrightAT(fgleft)=(fgsingσUV)fgfgright(fgsingσUT(fgleft)V)=fgsingfgsingσUVfgfgright(σUT(fgleft)V)T(fgsing)=fgsingfgsingσU(σUT(fgleft))T(fgsing)(VfgfgrightVT(fgsing))Ifgsing,fgsing=fgsingσ2UUT(fgleft),

which is an eigendecomposition since the vectors of U (in Rfgleft) are linearly independent.

  1. This particular computation is admittedly simpler in the usual matrix multiplication notation: taking ARn×m, we have AAT=(UΣVT)(UΣVT)T=UΣVTVΣUT=UΣImΣUT=UΣ2UT