Matrix

Intro

A lot of ideas in matrix are related to what we have already learned in vector. Recall that we can express a simultaneous equation as multiplication of a matrix and a vector.

$2a + 3b = 12\\ 5a + b = 17$

$\begin{pmatrix}2&3\\5&1\end{pmatrix}*\begin{pmatrix}a\\b\end{pmatrix}=\begin{pmatrix}12\\17\end{pmatrix}$

In this chapter, we are going to look under the hood to understand what exactly happens during this multiplication operation. It’s intriguing to see how powerful matrices are in terms of transforming vectors. This concept is also at the heart of linear algebra.

We will then dive into the solution for simultaneous equations by calculating matrix inverse and determinant. Understanding matrix inverse then unlocks the way we convert a vector space in general. As compared to what we have learned previously on changing basis, now we are able to convert a vector to any space without being restricted to a space of orthogonal basis vectors.

We are also going to look at two interesting extensions of matrix. One is called orthogonal matrix. It is the most convenient kind of matrix when we need to perform matrix inverse operation. We will construct such a matrix by the Gram-Schmidt process. The other extension is Eigen vectors and Eigen values. Eigen vectors have some special properties during matrix transformation. Knowing the Eigen vectors for a matrix will also make repeated multiplication of this matrix by itself much simpler.

Let’s get started!

Matrix as a Transformation

We start by showing a matrix $\begin{pmatrix}2&3\\5&1\end{pmatrix}$ multiplied by the basis vector $\begin{pmatrix}1\\0\end{pmatrix}$ and $\begin{pmatrix}0\\1\end{pmatrix}$ .

$\begin{pmatrix}2&3\\5&1\end{pmatrix}*\begin{pmatrix}1\\0\end{pmatrix}=\begin{pmatrix}2\\5\end{pmatrix}$

$\begin{pmatrix}2&3\\5&1\end{pmatrix}*\begin{pmatrix}0\\1\end{pmatrix}=\begin{pmatrix}3\\1\end{pmatrix}$

To read this result, we can see matrix $\begin{pmatrix}2&3\\5&1\end{pmatrix}$ transforms vector $\begin{pmatrix}1\\0\end{pmatrix}$ into a new vector $\begin{pmatrix}2\\5\end{pmatrix}$ and transforms vector $\begin{pmatrix}0\\1\end{pmatrix}$ into a new vector $\begin{pmatrix}3\\1\end{pmatrix}$ . Graphically, we get below transformation from $e_1$ to $e_1'$ and from $e_2$ to $e_2'$ .

Mathematics Basics - Linear Algebra (Matrix Part 1)

We learned from previous chapter on vector that $e_1$ and $e_2$ describe a basis vector space. After being multiplied by a matrix $\begin{pmatrix}2&3\\5&1\end{pmatrix}$ , the original vector space changes to a new one described by $e_1'$ and $e_2'$ . If we call $e_1$ and $e_2$ our input vectors and $e_1'$ and $e_2'$ our output vectors, what matrix multiplication does therefore is equivalent to a vector space transformation.

This part is tricky, but very critical for us to form a right intuition of matrix multiplication. Let’s do a matrix multiplication in a step-by-step manner to illustrate this idea of vector space transformation.

$\begin{aligned} \begin{pmatrix}2&3\\5&1\end{pmatrix}*\begin{pmatrix}3\\2\end{pmatrix}&=\begin{pmatrix}2&3\\5&1\end{pmatrix}*\left(3*\begin{pmatrix}1\\0\end{pmatrix}+2*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=\begin{pmatrix}2&3\\5&1\end{pmatrix}*(3*e_1+2*e_2)\\ &=3*\begin{pmatrix}2&3\\5&1\end{pmatrix}*e_1+2*\begin{pmatrix}2&3\\5&1\end{pmatrix}*e_2\\ &=3*\begin{pmatrix}2\\5\end{pmatrix}+2*\begin{pmatrix}3\\1\end{pmatrix}\\ &=3*e_1'+2*e_2' \end{aligned}$

Note here we make use of the associative and distributive properties of matrix multiplication.

The vector $\begin{pmatrix}3\\2\end{pmatrix}$ is originally expressed in basis vectors $e_1$ and $e_2$ . After being multiplied by matrix $\begin{pmatrix}2&3\\5&1\end{pmatrix}$ , we have projected it onto the new basis vectors $e_1'$ and $e_2'$ , i.e., $3e_1'+2e_2'$ . So we can think of matrix multiplication as the vector sum of the transformed basis vectors. Matrix $\begin{pmatrix}2&3\\5&1\end{pmatrix}$ tells us where the original basis vectors $e_1$ and $e_2$ should go after the transformation. In addition, we can see that columns of the transformation matrix are just the new basis vectors, $e_1'=\begin{pmatrix}2\\5\end{pmatrix}$ , $e_2'=\begin{pmatrix}3\\1\end{pmatrix}$ .

If we treat matrix as a way to transform vector space, how many ways can we transform it? Next, we will discuss some possible matrix transformation in a 2-dimensional space.

Identity Matrix

$\begin{pmatrix}1&0\\0&1\end{pmatrix}$ is an identity matrix. It keeps the vector space unchanged. That means a vector is not altered at all when it is multiplied by an identity matrix. We can look at an example of vector $\begin{pmatrix}3\\2\end{pmatrix}$ .

$\begin{aligned} \begin{pmatrix}1&0\\0&1\end{pmatrix}*\begin{pmatrix}3\\2\end{pmatrix}&=\begin{pmatrix}1&0\\0&1\end{pmatrix}*\left(3*\begin{pmatrix}1\\0\end{pmatrix}\right)+\begin{pmatrix}1&0\\0&1\end{pmatrix}*\left(2*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=3*\left(\begin{pmatrix}1&0\\0&1\end{pmatrix}*\begin{pmatrix}1\\0\end{pmatrix}\right)+2*\left(\begin{pmatrix}1&0\\0&1\end{pmatrix}*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=3*\begin{pmatrix}1\\0\end{pmatrix}+2*\begin{pmatrix}0\\1\end{pmatrix}\\ &=\begin{pmatrix}3\\2\end{pmatrix} \end{aligned}$

Scaling Matrix

Scaling matrix has this form $\begin{pmatrix}a&0\\0&b\end{pmatrix}$ . It alters the area of basis vector space by a factor of $a*b$ . Let’s look at an example below.

$\begin{aligned} \begin{pmatrix}5&0\\0&7\end{pmatrix}*\begin{pmatrix}x\\y\end{pmatrix}&=\begin{pmatrix}5&0\\0&7\end{pmatrix}*\left(x*\begin{pmatrix}1\\0\end{pmatrix}\right)+\begin{pmatrix}5&0\\0&7\end{pmatrix}*\left(y*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=x*\left(\begin{pmatrix}5&0\\0&7\end{pmatrix}*\begin{pmatrix}1\\0\end{pmatrix}\right)+y*\left(\begin{pmatrix}5&0\\0&7\end{pmatrix}*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=x*\begin{pmatrix}5\\0\end{pmatrix}+y*\begin{pmatrix}0\\7\end{pmatrix} \end{aligned}$

The basis vectors for $\begin{pmatrix}x\\y\end{pmatrix}$ have shifted from $\begin{pmatrix}1\\0\end{pmatrix}$ and $\begin{pmatrix}0\\1\end{pmatrix}$ to $\begin{pmatrix}5\\0\end{pmatrix}$ and $\begin{pmatrix}0\\7\end{pmatrix}$ . This stretches the vector space from a 1X1 square to a 5X7 rectangle as shown in the graph below.

Mathematics Basics - Linear Algebra (Matrix Part 1)

Similarly, we can also have a scaling matrix with fractional values. This squashes the original vector space into a smaller one.

Flipping Matrix

We can use matrix $\begin{pmatrix}-1&0\\0&1\end{pmatrix}$ to flip the horizontal basis vector $\begin{pmatrix}1\\0\end{pmatrix}$ to the other side. Here is an illustration.

$\begin{aligned} \begin{pmatrix}-1&0\\0&1\end{pmatrix}*\begin{pmatrix}x\\y\end{pmatrix}&=\begin{pmatrix}-1&0\\0&1\end{pmatrix}*\left(x*\begin{pmatrix}1\\0\end{pmatrix}\right)+\begin{pmatrix}-1&0\\0&1\end{pmatrix}*\left(y*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=x*\left(\begin{pmatrix}-1&0\\0&1\end{pmatrix}*\begin{pmatrix}1\\0\end{pmatrix}\right)+y*\left(\begin{pmatrix}-1&0\\0&1\end{pmatrix}*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=x*\begin{pmatrix}-1\\0\end{pmatrix}+y*\begin{pmatrix}0\\1\end{pmatrix} \end{aligned}$

Graphically, this flips the basis vectors along the vertical axis.

Mathematics Basics - Linear Algebra (Matrix Part 1)

Similarly, we can use matrix $\begin{pmatrix}1&0\\0&-1\end{pmatrix}$ to flip the basis vectors along the horizontal axis.

$\begin{aligned} \begin{pmatrix}1&0\\0&-1\end{pmatrix}*\begin{pmatrix}x\\y\end{pmatrix}&=\begin{pmatrix}1&0\\0&-1\end{pmatrix}*\left(x*\begin{pmatrix}1\\0\end{pmatrix}\right)+\begin{pmatrix}1&0\\0&-1\end{pmatrix}*\left(y*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=x*\left(\begin{pmatrix}1&0\\0&-1\end{pmatrix}*\begin{pmatrix}1\\0\end{pmatrix}\right)+y*\left(\begin{pmatrix}1&0\\0&-1\end{pmatrix}*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=x*\begin{pmatrix}1\\0\end{pmatrix}+y*\begin{pmatrix}0\\-1\end{pmatrix} \end{aligned}$
Mathematics Basics - Linear Algebra (Matrix Part 1)

Inverse Matrix

When we flip the basis vectors in both vertical and horizontal direction, we get a new vector space that has inverted the values in both basis vectors. This inverse matrix is $\begin{pmatrix}-1&0\\0&-1\end{pmatrix}$ .

$\begin{aligned} \begin{pmatrix}-1&0\\0&-1\end{pmatrix}*\begin{pmatrix}x\\y\end{pmatrix}&=\begin{pmatrix}-1&0\\0&-1\end{pmatrix}*\left(x*\begin{pmatrix}1\\0\end{pmatrix}\right)+\begin{pmatrix}-1&0\\0&-1\end{pmatrix}*\left(y*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=x*\left(\begin{pmatrix}-1&0\\0&-1\end{pmatrix}*\begin{pmatrix}1\\0\end{pmatrix}\right)+y*\left(\begin{pmatrix}-1&0\\0&-1\end{pmatrix}*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=x*\begin{pmatrix}-1\\0\end{pmatrix}+y*\begin{pmatrix}0\\-1\end{pmatrix} \end{aligned}$
Mathematics Basics - Linear Algebra (Matrix Part 1)

Diagonal Mirroring Matrix

We can also do a mirroring of vector space along the 45° line with matrix $\begin{pmatrix}0&1\\1&0\end{pmatrix}$ .

$\begin{aligned} \begin{pmatrix}0&1\\1&0\end{pmatrix}*\begin{pmatrix}x\\y\end{pmatrix}&=\begin{pmatrix}0&1\\1&0\end{pmatrix}*\left(x*\begin{pmatrix}1\\0\end{pmatrix}\right)+\begin{pmatrix}0&1\\1&0\end{pmatrix}*\left(y*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=x*\left(\begin{pmatrix}0&1\\1&0\end{pmatrix}*\begin{pmatrix}1\\0\end{pmatrix}\right)+y*\left(\begin{pmatrix}0&1\\1&0\end{pmatrix}*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=x*\begin{pmatrix}0\\1\end{pmatrix}+y*\begin{pmatrix}1\\0\end{pmatrix} \end{aligned}$

The basis vector $\begin{pmatrix}1\\0\end{pmatrix}$ is shifted to $\begin{pmatrix}0\\1\end{pmatrix}$ and basis vector $\begin{pmatrix}0\\1\end{pmatrix}$ is shifted to $\begin{pmatrix}1\\0\end{pmatrix}$ as shown in the graph.

Mathematics Basics - Linear Algebra (Matrix Part 1)

If we combine the inverse matrix and diagonal mirroring matrix, we would get a mirroring of vector space along the -45° line. This matrix is $\begin{pmatrix}0&-1\\-1&0\end{pmatrix}$ .

$\begin{aligned} \begin{pmatrix}0&-1\\-1&0\end{pmatrix}*\begin{pmatrix}x\\y\end{pmatrix}&=\begin{pmatrix}0&-1\\-1&0\end{pmatrix}*\left(x*\begin{pmatrix}1\\0\end{pmatrix}\right)+\begin{pmatrix}0&1\\1&0\end{pmatrix}*\left(y*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=x*\left(\begin{pmatrix}0&-1\\-1&0\end{pmatrix}*\begin{pmatrix}1\\0\end{pmatrix}\right)+y*\left(\begin{pmatrix}0&-1\\-1&0\end{pmatrix}*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=x*\begin{pmatrix}0\\-1\end{pmatrix}+y*\begin{pmatrix}-1\\0\end{pmatrix} \end{aligned}$
Mathematics Basics - Linear Algebra (Matrix Part 1)

Sheering Matrix

Our basis vectors can also be transformed in more interesting ways like sheering when the vector space is no longer square or rectangle. Let’s see an example of sheering transformation matrix $\begin{pmatrix}1&1\\0&1\end{pmatrix}$ .

$\begin{aligned} \begin{pmatrix}1&1\\0&1\end{pmatrix}*\begin{pmatrix}x\\y\end{pmatrix}&=\begin{pmatrix}1&1\\0&1\end{pmatrix}*\left(x*\begin{pmatrix}1\\0\end{pmatrix}\right)+\begin{pmatrix}1&1\\0&1\end{pmatrix}*\left(y*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=x*\left(\begin{pmatrix}1&1\\0&1\end{pmatrix}*\begin{pmatrix}1\\0\end{pmatrix}\right)+y*\left(\begin{pmatrix}1&1\\0&1\end{pmatrix}*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=x*\begin{pmatrix}1\\0\end{pmatrix}+y*\begin{pmatrix}1\\1\end{pmatrix} \end{aligned}$
Mathematics Basics - Linear Algebra (Matrix Part 1)

The new vector space after transformation is a parallelogram. Sheering transformation is not limited to just one basis vector. We can do a sheering transform for both $e_1$ and $e_2$ at the same time.

Rotation Matrix

Rotation is another common type of transformation. We can rotate the basis vectors by 90° anti-clockwise using the transformation matrix $\begin{pmatrix}0&-1\\1&0\end{pmatrix}$ .

$\begin{aligned} \begin{pmatrix}0&-1\\1&0\end{pmatrix}*\begin{pmatrix}x\\y\end{pmatrix}&=\begin{pmatrix}0&-1\\1&0\end{pmatrix}*\left(x*\begin{pmatrix}1\\0\end{pmatrix}\right)+\begin{pmatrix}0&-1\\1&0\end{pmatrix}*\left(y*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=x*\left(\begin{pmatrix}0&-1\\1&0\end{pmatrix}*\begin{pmatrix}1\\0\end{pmatrix}\right)+y*\left(\begin{pmatrix}0&-1\\1&0\end{pmatrix}*\begin{pmatrix}0\\1\end{pmatrix}\right)\\ &=x*\begin{pmatrix}0\\1\end{pmatrix}+y*\begin{pmatrix}-1\\0\end{pmatrix} \end{aligned}$
Mathematics Basics - Linear Algebra (Matrix Part 1)

A more general case to rotate the basis vectors by any angle θ anti-clockwise is given by $\begin{pmatrix}\cos\theta&\sin\theta\\-\sin\theta&\cos\theta\end{pmatrix}$ .

Composite Transformation Matrix

It is also possible to apply transformation on the basis vectors more than once. For example, we can first do a 90° anti-clockwise rotation using $\begin{pmatrix}0&-1\\1&0\end{pmatrix}$ , followed by a flipping along the vertical axis using $\begin{pmatrix}-1&0\\0&1\end{pmatrix}$ . The composite transformation is just the matrix product of the two basic transformation matrix.
$\begin{pmatrix}-1&0\\0&1\end{pmatrix}*\begin{pmatrix}0&-1\\1&0\end{pmatrix}=\begin{pmatrix}0&1\\1&0\end{pmatrix}$
Mathematics Basics - Linear Algebra (Matrix Part 1)

We can add on more matrices to the expression for more transformation steps. Note the order of operation in matrix multiplication matters. This is because matrix multiplications are not commutative, i.e. $\begin{pmatrix}-1&0\\0&1\end{pmatrix}*\begin{pmatrix}0&-1\\1&0\end{pmatrix}\neq\begin{pmatrix}0&-1\\1&0\end{pmatrix}*\begin{pmatrix}-1&0\\0&1\end{pmatrix}$ . The later transformation matrix should always precede the earlier transformation matrix in multiplication expression.

Matrix transformation is a very important step in a lot of machine learning applications, especially image-related tasks. One good example is face recognition problem whereby you need to center and align the subject faces in the image captured.

(Inspired by Mathematics for Machine Learning lecture series from Imperial College London)

Mathematics Basics - Linear Algebra (Matrix Part 1)

Matrix

Intro

Matrix as a Transformation

Identity Matrix

Scaling Matrix

Flipping Matrix

Inverse Matrix

Diagonal Mirroring Matrix

Sheering Matrix

Rotation Matrix

Composite Transformation Matrix

相关推荐