Graphs and Markov chains

Learning Objectives

Express a graph as a sparse matrix.
Identify the performance benefits of a sparse matrix.

Graphs

Undirected Graphs:

The following is an example of an undirected graph:

The adjacency matrix, $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ , for undirected graphs is always symmetric and is defined as:

a i j = {1 if (node i, node j) are connected 0 otherwise, <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mi>i</mi><mi>j</mi></mrow></msub><mo>=</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">{</mo><mtable columnalign="left left" columnspacing="1em" rowspacing=".2em"><mtr><mtd><mn>1</mn><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><mrow data-mjx-texclass="ORD"><mi data-mjx-auto-op="false">if</mi></mrow><mtext> </mtext><mo stretchy="false">(</mo><msub><mrow data-mjx-texclass="ORD"><mi data-mjx-auto-op="false">node</mi></mrow><mi>i</mi></msub><mo>,</mo><msub><mrow data-mjx-texclass="ORD"><mi data-mjx-auto-op="false">node</mi></mrow><mi>j</mi></msub><mo stretchy="false">)</mo><mtext> </mtext><mtext>are connected</mtext></mtd></mtr><mtr><mtd><mn>0</mn><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><mrow data-mjx-texclass="ORD"><mi data-mjx-auto-op="false">otherwise</mi></mrow></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE" fence="true" stretchy="true" symmetric="true"></mo></mrow><mo>,</mo></math>

A = [011100110010100100101011010100000101] . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mo>=</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>1</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>1</mn></mtd></mtr><mtr><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE">]</mo></mrow><mo>.</mo></math>

Directed Graphs:

The following is an example of a directed graph:

The adjacency matrix, $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ , for directed graphs is defined as:

a i j = {1 if node i \leftarrow node j 0 otherwise, <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mi>i</mi><mi>j</mi></mrow></msub><mo>=</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">{</mo><mtable columnalign="left left" columnspacing="1em" rowspacing=".2em"><mtr><mtd><mn>1</mn><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><mrow data-mjx-texclass="ORD"><mi data-mjx-auto-op="false">if</mi></mrow><mtext> </mtext><msub><mrow data-mjx-texclass="ORD"><mi data-mjx-auto-op="false">node</mi></mrow><mi>i</mi></msub><mo stretchy="false">\leftarrow</mo><msub><mrow data-mjx-texclass="ORD"><mi data-mjx-auto-op="false">node</mi></mrow><mi>j</mi></msub></mtd></mtr><mtr><mtd><mn>0</mn><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><mrow data-mjx-texclass="ORD"><mi data-mjx-auto-op="false">otherwise</mi></mrow></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE" fence="true" stretchy="true" symmetric="true"></mo></mrow><mo>,</mo></math>

A = [000100110000100000001010010000000101] . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mo>=</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>1</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>1</mn></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE">]</mo></mrow><mo>.</mo></math>

Weighted Directed Graphs:

The following is an example of a weighted directed graph:

The adjacency matrix, $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ , for weighted directed graphs is defined as:

a i j = {w i j if node i \leftarrow node j 0 otherwise, <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mi>i</mi><mi>j</mi></mrow></msub><mo>=</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">{</mo><mtable columnalign="left left" columnspacing="1em" rowspacing=".2em"><mtr><mtd><msub><mi>w</mi><mrow data-mjx-texclass="ORD"><mi>i</mi><mi>j</mi></mrow></msub><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><mrow data-mjx-texclass="ORD"><mi data-mjx-auto-op="false">if</mi></mrow><mtext> </mtext><msub><mrow data-mjx-texclass="ORD"><mi data-mjx-auto-op="false">node</mi></mrow><mi>i</mi></msub><mo stretchy="false">\leftarrow</mo><msub><mrow data-mjx-texclass="ORD"><mi data-mjx-auto-op="false">node</mi></mrow><mi>j</mi></msub></mtd></mtr><mtr><mtd><mn>0</mn><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><mtext> </mtext><mtext> </mtext><mrow data-mjx-texclass="ORD"><mi data-mjx-auto-op="false">otherwise</mi></mrow></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE" fence="true" stretchy="true" symmetric="true"></mo></mrow><mo>,</mo></math>

where $a i j <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mi>i</mi><mi>j</mi></mrow></msub></math>$ is the $(i, j) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>i</mi><mo>,</mo><mi>j</mi><mo stretchy="false">)</mo></math>$ element of $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ , and $w i j <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>w</mi><mrow data-mjx-texclass="ORD"><mi>i</mi><mi>j</mi></mrow></msub></math>$ is the link weight associated with edge connecting nodes $i <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>i</mi></math>$ and $j <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>j</mi></math>$ . The adjacency matrix which describes the example graph above is:

A = [000 0.4 00 0.1 0.5 0000 0.9 00000 00 1.0 0 1.0 0 0 0.5 0000 000 0.6 0 1.0] . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mo>=</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0.4</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>0.1</mn></mtd><mtd><mn>0.5</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>0.9</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>1.0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>1.0</mn></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>0</mn></mtd><mtd><mn>0.5</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>0.6</mn></mtd><mtd><mn>0</mn></mtd><mtd><mn>1.0</mn></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE">]</mo></mrow><mo>.</mo></math>

Typically, when we discuss weighted directed graphs it is in the context of transition matrices for Markov chains where the link weights across each column sum to $1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>1</mn></math>$ .

Markov Chain

A Markov chain is a stochastic model where the probability of future (next) state depends only on the most recent (current) state. This memoryless property of a stochastic process is called Markov property. From a probability perspective, the Markov property implies that the conditional probability distribution of the future state (conditioned on both past and current states) depends only on the current state.

Markov Matrix

A Markov/Transition/Stochastic matrix is a square matrix used to describe the transitions of a Markov chain. Each of its entries is a non-negative real number representing a probability. Based on Markov property, next state vector $x k + 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mrow data-mjx-texclass="ORD"><mi>k</mi><mo>+</mo><mn>1</mn></mrow></msub></math>$ is obtained by left-multiplying the Markov matrix $M <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">M</mi></mrow></math>$ with the current state vector $x k <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mi>k</mi></msub></math>$ .

x k + 1 = M x k <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mrow data-mjx-texclass="ORD"><mi>k</mi><mo>+</mo><mn>1</mn></mrow></msub><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">M</mi></mrow><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mi>k</mi></msub></math>

In this course, unless specifically stated otherwise, we define the transition matrix $M <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">M</mi></mrow></math>$ as a left Markov matrix where each column sums to $1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>1</mn></math>$ .

Note: Alternative definitions in outside resources may present $M <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">M</mi></mrow></math>$ as a right markov matrix where each row of $M <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">M</mi></mrow></math>$ sums to $1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>1</mn></math>$ and the next state is obtained by right-multiplying by $M <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">M</mi></mrow></math>$ , i.e. $x T k + 1 = x T k M <math xmlns="http://www.w3.org/1998/Math/MathML"><msubsup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mrow data-mjx-texclass="ORD"><mi>k</mi><mo>+</mo><mn>1</mn></mrow><mi>T</mi></msubsup><mo>=</mo><msubsup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mi>k</mi><mi>T</mi></msubsup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">M</mi></mrow></math>$ .

A steady state vector $x * <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mo>*</mo></msup></math>$ is a probability vector (entries are non-negative and sum to $1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>1</mn></math>$ ) that is unchanged by the operation with the Markov matrix $M <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>M</mi></math>$ , i.e.

M x * = x * <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">M</mi></mrow><msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mo>*</mo></msup><mo>=</mo><msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mo>*</mo></msup></math>

Therefore, the steady state vector $x * <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mo>*</mo></msup></math>$ is an eigenvector corresponding to the eigenvalue $λ = 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>λ</mi><mo>=</mo><mn>1</mn></math>$ of matrix $M <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">M</mi></mrow></math>$ . If there is more than one eigenvector with $λ = 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>λ</mi><mo>=</mo><mn>1</mn></math>$ , then a weighted sum of the corresponding steady state vectors will also be a steady state vector. Therefore, the steady state vector of a Markov chain may not be unique and could depend on the initial state vector.

Markov Chain Example

Suppose we want to build a Markov Chain model for weather predicting in UIUC during summer. We observed that:

a sunny day is $60 % <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>60</mn><mi mathvariant="normal">%</mi></math>$ likely to be followed by another sunny day, $10 % <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>10</mn><mi mathvariant="normal">%</mi></math>$ likely followed by a rainy day and $30 % <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>30</mn><mi mathvariant="normal">%</mi></math>$ likely followed by a cloudy day;
a rainy day is $40 % <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>40</mn><mi mathvariant="normal">%</mi></math>$ likely to be followed by another rainy day, $20 % <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>20</mn><mi mathvariant="normal">%</mi></math>$ likely followed by a sunny day and $40 % <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>40</mn><mi mathvariant="normal">%</mi></math>$ likely followed by a cloudy day;
a cloudy day is $40 % <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>40</mn><mi mathvariant="normal">%</mi></math>$ likely to be followed by another cloudy day, $30 % <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>30</mn><mi mathvariant="normal">%</mi></math>$ likely followed by a rainy day and $30 % <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>30</mn><mi mathvariant="normal">%</mi></math>$ likely followed by a sunny day.

The state diagram is shown below:

The Markov matrix is

M = [0.6 0.2 0.3 0.1 0.4 0.3 0.3 0.4 0.4] . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">M</mi></mrow><mo>=</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mn>0.6</mn></mtd><mtd><mn>0.2</mn></mtd><mtd><mn>0.3</mn></mtd></mtr><mtr><mtd><mn>0.1</mn></mtd><mtd><mn>0.4</mn></mtd><mtd><mn>0.3</mn></mtd></mtr><mtr><mtd><mn>0.3</mn></mtd><mtd><mn>0.4</mn></mtd><mtd><mn>0.4</mn></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE">]</mo></mrow><mo>.</mo></math>

If the weather on day $0 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>0</mn></math>$ is known to be rainy, then

x 0 = [010]; <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mn>0</mn></msub><mo>=</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mn>1</mn></mtd></mtr><mtr><mtd><mn>0</mn></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE">]</mo></mrow><mo>;</mo></math>

and we can determine the probability vector for day $1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>1</mn></math>$ by

x 1 = M x 0 . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mn>1</mn></msub><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">M</mi></mrow><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mn>0</mn></msub><mo>.</mo></math>

The probability distribution for the weather on day $n <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>n</mi></math>$ is given by

x n = M n x 0 . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mi>n</mi></msub><mo>=</mo><msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">M</mi></mrow><mrow data-mjx-texclass="ORD"><mi>n</mi></mrow></msup><msub><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mn>0</mn></msub><mo>.</mo></math>

Review Questions

See this review link

ChangeLog

2018-04-01 Erin Carrier ecarrie2@illinois.edu: Minor reorg and formatting changes
2018-03-25 Yu Meng yumeng5@illinois.edu: adds Markov chains
2018-03-01 Erin Carrier ecarrie2@illinois.edu: adds more review questions
2018-01-14 Erin Carrier ecarrie2@illinois.edu: removes demo links
2017-11-02 Erin Carrier ecarrie2@illinois.edu: adds changelog, fix COO row index error
2017-10-25 Erin Carrier ecarrie2@illinois.edu: adds review questions, minor fixes and formatting changes
2017-10-25 Arun Lakshmanan lakshma2@illinois.edu: first complete draft
2017-10-16 Luke Olson lukeo@illinois.edu: outline