Extreme rays

Let

C \subseteq R^{n}

be a polyhedral cone. A nonzero

d \in C

is an extreme ray of

C

if there do not exist linearly independent

u, v \in C

and positive scalars

λ

and

γ

such that

d = λ u + γ v

Note that if

d

is an extreme ray, then

λ d

is also an extreme ray for all

λ > 0

. We say that two extreme rays are equivalent if one is a positive scalar multiple of the other. Hence, when enumerating extreme rays of a polyhedral cone, we often enumerate one representative from each equivalent class of extreme rays. One way to achieve this is to impose a normalization condition such as restricting to unit vectors with respect to some vector norm.

Proposition 10.2. Let

C \subseteq R^{n}

be given by

{x \in R^{n} : A x \geq 0}

for some

A \in R^{m \times n}

. Let

d \in C

be nonzero. Let

A^{=} x \geq 0

denote the subsystem of

A x \geq 0

consisting of all the inequalities active at

d

. Then

d

is an extreme ray of

C

if and only if

rank (A^{=}) = n - 1

Proof. Suppose that

d = λ u + γ v

for some linearly independent

u, v \in C

and scalars

λ, γ > 0

. Then

0 = A^{=} d = λ A^{=} u + γ A^{=} v \geq 0 .

Hence, equality holds throughout. Since

λ, γ > 0

, we must have

A^{=} u = A^{=} v = 0

. Hence,

u

and

v

are linearly independent vectors in the nullspace of

A^{=}

, implying that

rank (A^{=}) < n - 1

Suppose that $rank (A^{=}) < n - 1$ . There exists a nonzero vector $y$ in the nullspace of $A^{=}$ such that $d$ and $y$ are linearly independent. For a sufficiently small $ϵ > 0$ , $d \pm ϵ y \in C$ . Note that $d - ϵ y$ and $d + ϵ y$ are linearly independent and $d = (d - ϵ y) + (d + ϵ y)$ , implying that $d$ is not an extreme ray.

Example. Consider $C = {x \in R^{3} : A x \geq 0}$ where $A = [\begin{matrix} 3 & 0 & - 1 \\ 1 & - 2 & 1 \\ - 1 & 2 & - 1 \end{matrix}]$ . Then $d = [\begin{matrix} 1 \\ 2 \\ 3 \end{matrix}]$ is an extreme ray since $A^{=} = A$ and $rank (A^{=}) = 2$ .

Incidentally, we could have also taken the first and third rows of $A$ for $A^{=}$ .

Corollary 10.3. The number of nonequivalent extreme rays of a polyhedral cone is finite.

Proof. Since there are only finitely many subsystems of a finite system of linear inequalities, it follows from Proposition 10.2 that there can only be a finite number of nonequivalent extreme rays.

We say that

d^{1}, \dots, d^{k}

form a complete set of extreme rays of $C$ if

d^{i}

is an extreme ray of

C

for

i = 1, \dots, k

and every extreme ray of

C

is equivalent to

d^{i}

for some

i \in {1, \dots, k}

Worked examples

Let $C = {[\begin{matrix} x_{1} \\ x_{2} \end{matrix}] \in R^{2} : \begin{array}{r} x_{1} + 2 x_{2} \geq 0 \\ - 3 x_{1} + x_{2} \geq 0 \end{array}}$ . Find all extreme rays of $C$ of unit length.

Solving this problem graphically is certainly an option.

Since $C \subseteq R^{2}$ , we only need to consider subsystems consisting of a single inequality.

Setting the inequality $x_{1} + 2 x_{2} \geq 0$ to equality, we obtain the general solution $λ [\begin{matrix} - 2 \\ 1 \end{matrix}]$ . Since $[\begin{matrix} - 2 \\ 1 \end{matrix}]$ also satisfies the other inequality, normalizing it to a unit vector gives the extreme ray $[\begin{matrix} - \frac{2}{\sqrt{5}} \\ \frac{1}{\sqrt{5}} \end{matrix}]$ .

Setting the inequality $- 3 x_{1} + x_{2} \geq 0$ to equality, we obtain the general solution $λ [\begin{matrix} 1 \\ 3 \end{matrix}]$ . Since $[\begin{matrix} 1 \\ 3 \end{matrix}]$ also satisfies the other inequality, normalizing it to a unit vector gives the extreme ray $[\begin{matrix} \frac{1}{\sqrt{10}} \\ \frac{3}{\sqrt{10}} \end{matrix}]$ .

Let $d^{1}, \dots, d^{k} \in R^{n}$ be such that $C = cone ({d^{1}, \dots, d^{k}})$ is pointed. Prove that if for some $\bar{i} \in {1, \dots, k}$ , $d^{\bar{i}}$ is not an extreme ray of $C$ , then $C = cone ({d^{i} : i \in {1, \dots, k} ∖ {\bar{i}}})$ .

Without loss of generaliy, we may assume that $\bar{i} = k$ .

Since $d^{k}$ is not an extreme ray, there exist linearly independent $u, v \in C$ and scalars $λ, γ > 0$ such that $d^{k} = λ u + γ v .$ Let $α_{1}, \dots, α_{k} \geq 0$ be scalars such that $u = \sum_{i = 1}^{k} α_{i} d^{i}$ and let $β_{1}, \dots, β_{k} \geq 0$ be scalars such that $v = \sum_{i = 1}^{k} β_{i} d^{i}$ . Then, $(1 - λ α_{k} - γ β_{k}) d^{k} = \sum_{i = 1}^{k - 1} (λ α_{i} + γ β_{i}) d^{i} .$ If $1 - λ α_{k} - γ β_{k} > 0$ , then $d^{k} = \sum_{i = 1}^{k - 1} \frac{λ α_{i} + γ β_{i}}{1 - λ α_{k} - γ β_{k}} d^{i},$ implying that $d^{k} \in cone ({d^{1}, \dots, d^{k - 1}})$ . It follows that $C = cone ({d^{1}, \dots, d^{k - 1}})$ .

Suppose that $1 - λ α_{k} - γ β_{k} \leq 0$ . Then $\sum_{i = 1}^{k} ζ_{i} d^{k} = 0$ where $ζ_{i} = λ α_{i} + γ β_{i}$ for $i = 1, \dots, k - 1$ , and $ζ_{i} = 1 - λ α_{k} - γ β_{k}$ . The important point is that $ζ_{i} \geq 0$ for $i = 1, \dots, k$ . Note that we cannot have $ζ_{i} = 0$ for $i = 1, \dots, k - 1$ . Otherwise, we will have that $u$ and $v$ are both scalar multiples of $d^{k}$ , contradicting that they are linearly independent. Without loss of generality, we may assume that $ζ_{1} > 0$ . Thus $- ζ_{1} d^{1} = \sum_{i = 2}^{k} ζ_{i} d^{k} .$ Dividing both sides by $ζ_{1}$ gives $- d^{1} = \sum_{i = 2}^{k} \frac{ζ_{i}}{ζ_{1}} d^{k},$ implying that $- d^{1} \in C$ . Since we also have $d^{1} \in C$ , it follows that $C$ contains the line ${λ d^{1} : λ \in R}$ . By Theorem 8.6. cannot be pointed, which is a contradiction.