A matrix-valued function, or simply a matrix function,
can either be interpreted as a function
taking
matrices as values or as an
matrix with
(scalar-valued) functions as entires, say
, and its
differential is obtained by simply taking the differentials
of its entries:
; here we use the superscript
and the
subscript
to indicate the
th row and
th column.
Example 8.1. Find
at
for the matrix function
.
Solution.
. At
, we have
Suppose that we have anothe matrix function, say
, which is
. Then
is an
matrix function with its
entry is given by
. Notice that
Rule No. M1 (Product rule for matrix function).
.
Caveat:
the order of the product
CANNOT be reversed. For scalar functions
and
we do have
. But this is no longer valid for matrix functions.
Example 8.2. Let
be an invertible
matrix
function. (``Invertible'' here means that there is a matrix function
, called the inverse of
such that
, where
is the identity matrix.)
Verify that
Solution. Differntiate both sides of
,
the RHS becomes
because
is a constant matrix and the LHS is
. Thus
. Multiply both sides on the right
by
, we have
.
A particular case of Rule No. M1 is
, where
is a
column vector. This is because a column vector here is nothing but
an
matrix. We will identify a column vector with a usual vector
with components displayed in a row by means of the following
convention: any row of objects surround by the round brackets
``('' and ``)'' is the
same as the column of same objects, following the same order,
embraced by square brackets ``['' and ``]''. For example, if
and
, then
We say that a collection
of
complex matrices form a matrix group if it satisfies the following
conditions:
(MG1) The identity matrix
belongs to
.
Our purpose of introducing matrix groups is to let the reader have
a taste of Lie groups, which is an important
mathematical subject. (``Lie'' here is pronounced as ``Lee'').
Example 8.3. Show that all invertible real
matrices form a matrix group. This matrix group is denoted by
, called
the general linear group.
Solution. We have to check conditions (MG1) to (MG3) for
. Clearly the identity matrix is invertible.
If
and
are invertible
matrices, then so is
with
. Indeed
and similarly
.
It remains to show MG3). This follows from the fact that if
is invertible, then so is
and the inverse
of
is just
itself. (As you can see, the important thing here
is to know what you must check. The actual checking is easy and rather
insipid. This situation is a bit like dealing with bureaucrats - it is
important to know the right person to talk to, but you do not expect
the conversation to be interesting.)
Example 8.4. Show that the set
of all real
matrices
satisfying
is a matrix group. This matrix
group is called the orthogonal group. (Recall that
is the
transpose of
. A matrix
satisfying
the condition
(that is,
)
is called an orthogonal matrix.)
Solution. We have to check that the set
satisfies conditions (MG1) to (MG3).
Since
, it follows that the identity matrix
is orthogonal. Let
and
be orthogonal matrices. Then
and
. Hence
and
similarly
; (here we have used the well known identity
in matrix theory). Finally, suppose that
is an orthogonal matrix. Then
and hence
and simialrly
, showing that
is also orthogonal.
Now we introduce an important concept: the Lie algebra
of a (closed) matrix group
.
An element in
is an
matrix produced in the following
way. Take any parametric curve
in
such that
.
Differentiate
and evaluate the derivative
at
.
The resulting matrix
is an element in
.
For example,
Fact. If
is a (closed) matrix group, then
is a Lie
algebra, and we have
Example 8.5. A matrix Lie group
is said to be commutative or
abelian if
for all
and
in
. Show that, if
is an
abelian matrix group, then its ajoint representaion is trivial in
the sense that
is the identity mapping
(that is
for all
) for all
and
its Lie algebra
is also abelian in the
sense that
for all
and
in
.
Solution: Let
and
.
Then there is a parametric curve
in
such that
and
.
Since
is commutative, we have
.
Differentiating the last identity with respect to
and then
evaluating at
, we arrive at
. Thus
. Next, let
be another element of
the Lie algebra
and
be a corresponding parametric
curve in
such that
and
. Then
we have
. Differentiating the last identity and
setting
, we have
. Therefore
.
In our definition of the Lie algebra
of a matrix group, we
say that
is in it if and only if there exists a curve
in
such that
and
. A natural question is:
is there a good choice of
for us to check this ? The answer is YES.
It is
. It turns out that
is in
if and
only if
is in
for all
. But what is the meaning of
?
To define the matrix exponential
,
we mimick the scalar case
, where
is a
real number. You probably have never worried about how to define
rigorously. But now we have to. Recall from elementary calculus that
(
is just
when
) is the solution to
the initial value problem
,
. So, if we can
establish the existence and uniqueness of solution to this
initial value problem, then we can define
to be the unique
its solution. We adopt the same tactic in our treatment for matrix
exponentials. Take any
(constant) matrix
. Consider
the matrix differential equation
, where the unknown
is an
matrix function, with the initial condition
.
Thus we are considering the following initial value problem for a matrix
equation:
Example 8.6. Find
for
.
Solution: A simple computation shows that
,
and
. Thus the sequence
(
) repeats
the pattern of
. So
Fact.
is in
if and only if
is in
for all
.
Suppose that
is in
for all
. Then
is a parametric curve in
with
and hence
is in
. This proves the ``if'' part.
The ``only if'' part is much harder to prove. We only give a very brief
sketch here.
Suppose that
. Then there is a smooth parametric
curve
such that
and
.
Thus, near
, we have
, where
shrinks to
as
. Now fix
. For large
,
Example 8.7. Determine the Lie algebra
of the orthogonal
group
.
Solution: Let
be a matrix in
. Then there is
a path
in
such that
and
. That
is orthogonal means
. Differentiate both sides, applying the product rule:
.
Next, evaluate at
, taking
and
into account:
, that is,
is skew symmetric. Conversely, given
a skew symmetric matrix
(
), consider the path
. Since
is the
solution to the initial value problem (8.2) above, we have
and
. Thus, in order
to show that
belongs to the Lie algebra of
, it is enough to
check that
belongs to
.
Indeed,
Exercises
,
(a)
and
, where
. Take
.
(b)
and
, (for
, see Exercise 10 above).