Up Main page

Let \(V\) be a vector space over the field \(\mathbb{F}\) having dimension \(n\). Recall that given an ordered basis \((v_1,\ldots,v_n)\) for \(V\) and a vector \(u \in V\), the tuple representation of \(u\) with respect to \((v_1,\ldots,v_n)\), denoted by \([u]_{(v_1,\ldots, v_n)}\), is the tuple \(\begin{bmatrix} \lambda_1 \\ \lambda_2 \\ \vdots \\ \lambda_n\end{bmatrix}\) such that \(\displaystyle u = \sum_{i=1}^n \lambda_i v_i\). (Note that \(\lambda_1,\ldots,\lambda_n\) are uniquely determined since \(v_1,\ldots,v_n\) form a basis for \(V\).) As a result, one can work with tuples instead of the vectors in the original vector space.

We will now see that we can express linear transformations as matrices as well. Hence, one can simply focus on studying linear transformations of the form \(T(x) = Ax\) where \(A\) is a matrix.

As for tuple representations of vectors, matrix representations of a linear transformation will depend on the choice of the ordered basis for the domain and that for the codomain.

Matrix representations of linear transformations

Let \(V\) and \(W\) be vector spaces over some field \(\mathbb{F}\). Let \(\Gamma = (v_1,\ldots,v_n)\) be an ordered basis for \(V\) and let \(\Omega = (w_1,\ldots,w_m)\) be an ordered basis for \(W\).

Let \(T:V \rightarrow W\) be a linear transformation. We can give a matrix representation of \(T\) as follows.

For each \(j \in \{1,\ldots,n\}\), \(T(v_j)\) is a vector in \(W\). Hence, we can write \(T(v_j)\) as a linear combination of \(w_1,\ldots,w_m\). Therefore, there are scalars \(a_{1,j}, a_{2,j},\ldots, a_{m,j}\) such that \(T(v_j) = a_{1,j}w_1 + a_{2,j}w_2 + \cdots + a_{m,j}w_m\).

Take an arbitrary \(v \in V\). Then, there exist scalars \(\lambda_1,\ldots,\lambda_n\) such that \(v = \lambda_1 v_1 +\cdots + \lambda_n v_n\). So \([v]_\Gamma = \begin{bmatrix} \lambda_1 \\ \vdots \\ \lambda_n\end{bmatrix}\).

Hence, \begin{eqnarray*} T(v) & = & T(\lambda_1 v_1 +\cdots + \lambda_n v_n) \\ & = & \sum_{j=1}^n \lambda_j T(v_j) \\ & = & \sum_{j=1}^n \lambda_j \left (\sum_{i=1}^m a_{i,j} w_i \right)\\ & = & \sum_{j=1}^n \sum_{i=1}^n (a_{i,j} \lambda_j)w_i \\ & = & \sum_{i=1}^m \left(\sum_{j=1}^n a_{i,j}\lambda_j\right ) w_i \end{eqnarray*}

Thus, \([T(v)]_{\Omega} = \displaystyle\begin{bmatrix} \sum_{j=1}^n a_{1,j}\lambda_j \\ \sum_{j=1}^n a_{2,j}\lambda_j \\ \vdots \\ \sum_{j=1}^n a_{m,j}\lambda_j \end{bmatrix} = \begin{bmatrix} a_{1,1} & a_{1,2} & \cdots & a_{1,n} \\ a_{2,1} & a_{2,2} & \cdots & a_{2,n} \\ \vdots & \vdots & \ddots & \vdots \\ a_{m,1} & a_{m,2} & \cdots & a_{m,n} \end{bmatrix} \begin{bmatrix} \lambda_1 \\ \lambda_2 \\ \vdots \\ \lambda_n\end{bmatrix}. \) So, if we let \(A\) be the \(m\times n\) matrix such that \(A_{i,j} = a_{i,j}\), then \([T(v)]_{\Omega}\) is precisely \(A [v]_{\Gamma}\).

Hence, given any \(u \in V\), we can obtain the tuple representation of \(T(u)\) with respect to \(\Omega\) by computing \(A[u]_{\Gamma}\). The matrix \(A\) is called the matrix representation of \(T\) and is denoted \([T]_{\Gamma}^{\Omega}\).

Note that column \(i\) of \([T]_\Gamma^\Omega\) is given by \([T(v_i)]_{\Omega}\).

Example

Let \(T:\mathbb{R}^3\rightarrow P_2\) be a linear transformation, where \(P_2\) is the vector space of polynomials in \(x\) with real coefficients having degree at most 2, given by \[T\left(\begin{bmatrix} a\\b\\c\end{bmatrix}\right) = (a-b)x^2 + cx + (a+b+c).\] Let \(\Gamma = \left ( \begin{bmatrix} 1\\0\\0\end{bmatrix}, \begin{bmatrix} 1\\1\\0\end{bmatrix}, \begin{bmatrix} 0 \\-1\\1\end{bmatrix} \right)\) and \(\Omega = (x+1,x^2 - x, x^2 + x -1)\). We now find \([T]_{\Gamma}^{\Omega}\).

One can easily check that \(\Gamma\) and \(\Omega\) are ordered bases for \(\mathbb{R}^3\) and \(P_2\), respectively.

We first find scalars \(\alpha_1,\alpha_2,\alpha_3,\beta_1,\beta_2,\beta_3, \gamma_1,\gamma_2,\gamma_3\) such that \begin{eqnarray*} T\left ( \begin{bmatrix} 1\\0\\0\end{bmatrix}\right) & = & \alpha_1(x+1) + \beta_1(x^2-x) + \gamma_1(x^2+x-1) \\ T\left ( \begin{bmatrix} 1\\1\\0\end{bmatrix}\right) & = & \alpha_2(x+1) + \beta_2(x^2-x) + \gamma_2(x^2+x-1) \\ T\left ( \begin{bmatrix} 0\\-1\\1\end{bmatrix}\right) & = & \alpha_3(x+1) + \beta_3(x^2-x) + \gamma_3(x^2+x-1). \end{eqnarray*} Then \([T]_{\Gamma}^{\Omega} = \begin{bmatrix} \alpha_1 & \alpha_2 & \alpha_3 \\ \beta_1 & \beta_2 & \beta_3 \\ \gamma_1 & \gamma_2 & \gamma_3 \\ \end{bmatrix}.\)

Now, \(T\left ( \begin{bmatrix} 1\\0\\0\end{bmatrix}\right) = x^2 + 1\). Hence, we need \[x^2 + 1 = \alpha_1(x+1) + \beta_1(x^2-x) + \gamma_1(x^2+x-1),\] or equivalently, \[x^2 + 1 = (\beta_1+\gamma_1)x^2 + (\alpha_1-\beta_1+\gamma_1)x + (\alpha_1-\gamma_1).\] Comparing coefficients gives \begin{eqnarray*} 1 & = & \beta_1 + \gamma_1 \\ 0 & = & \alpha_1 -\beta_1 + \gamma_1 \\ 1 & = & \alpha_1 - \gamma_1. \\ \end{eqnarray*} Solving gives \(\alpha_1 = 1\), \(\beta_1 = 1\), \(\gamma_1 = 0\).

Similarly, we obtain \[T\left ( \begin{bmatrix} 1\\1\\0\end{bmatrix}\right) = 2 = \frac{4}{3}(x+1) + \frac{2}{3}(x^2-x) - \frac{2}{3}(x^2+x-1)\] and \[T\left ( \begin{bmatrix} 0\\-1\\1\end{bmatrix}\right) = x^2 + x = \frac{2}{3}(x+1) + \frac{1}{3}(x^2-x) + \frac{2}{3}(x^2+x-1).\]

Hence, \([T]_{\Gamma}^{\Omega} = \begin{bmatrix} 1 & \frac{4}{3} & \frac{2}{3} \\ 1 & \frac{2}{3} & \frac{1}{3} \\ 0 & -\frac{2}{3} & \frac{2}{3} \end{bmatrix}.\)

We check this against a couple of vectors. Let \(u = \begin{bmatrix} 0 \\ 1 \\0\end{bmatrix}\) and \(v = \begin{bmatrix} 0 \\ 0 \\ 1\end{bmatrix}\). Then \(T(u) = -x^2 + 1\) and \(T(v) = x + 1\) and \([u]_\Gamma = \begin{bmatrix} -1 \\ 1 \\0\end{bmatrix}\) and \([v]_\Gamma = \begin{bmatrix} -1\\1 \\ 1\end{bmatrix}\).

Let \(A = [T]_{\Gamma}^{\Omega}\).

Then \(A [u]_\Gamma = \begin{bmatrix} \frac{1}{3}\\-\frac{1}{3}\\-\frac{2}{3} \end{bmatrix}\), which is the tuple representation of \(\frac{1}{3}(x+1)-\frac{1}{3}(x^2-x)-\frac{2}{3}(x^2+x-1) = -x^2+1 = T(u)\).

And \(A [v]_\Gamma = \begin{bmatrix} 1\\0\\0\end{bmatrix}\), which is the tuple representation of \(x+1 = T(v)\).

Remark: It is perhaps a bit confusing that tuple representations of vectors in \(\mathbb{R}^3\) are also vectors in \(\mathbb{R}^3\). But why would one want to have \(3\)-tuple representations of vectors in \(\mathbb{R}^3\)? The answer to this question might not be obvious at first sight. However, when one works with \(\mathbb{R}^n\) for large \(n\), it might be possible to choose an ordered bases in such a way that the vectors that one is interested in have sparse tuple representations (i.e. tuple representations in which very few entries are nonzero.) Sparsity can drastically improve certain kinds of computations and is one of the main ingredients on which current digital audio and image compression algorithms are based.

Summary

With tuple representations for vectors and matrix representations for linear transformations, we have a unifying framework for computations over all finite-dimensional vector spaces. In other words, thinking about an \(n\)-dimensional vector space is essentially thinking about \(n\)-tuples and thinking about linear transformations is essentially thinking about matrices. The usefulness of this fact cannot be overstated, especially when it comes to developing computer software that works with vector spaces.

Quick Quiz

Exercises

Let \(T:\mathbb{R}^2 \rightarrow \mathbb{R}^2\) be a linear transformation. Let \(\Gamma = \left ( \begin{bmatrix} 1 \\ 0 \end{bmatrix}, \begin{bmatrix} 1\\ 1 \end{bmatrix}\right) \) and \(\Omega = \left ( \begin{bmatrix} 2 \\ 1 \end{bmatrix}, \begin{bmatrix} 1\\ 0 \ \end{bmatrix}\right) \) be ordered bases for \(\mathbb{R}^2\)

  1. Suppose that \(T\) is given by \(T\left(\begin{bmatrix} x_1 \\ x_2 \end{bmatrix}\right)= \begin{bmatrix} 2x_1 - x_2 \\ x_1 + x_2 \end{bmatrix}\). Find \([T]_\Gamma^\Omega\).  

  2. Suppose that \(T\left(\begin{bmatrix} 1 \\ 0 \end{bmatrix}\right)= \begin{bmatrix} 2 \\ 1 \end{bmatrix}\) and \(T\left(\begin{bmatrix} 0 \\ 1 \end{bmatrix}\right)= \begin{bmatrix} 1 \\ -1 \end{bmatrix}\). Find \([T]_\Gamma^\Omega\).