Lecture Notes for Simulation

27 April 2005 - Markov Chains


The idea is to unroll the equality a random variable at a time, working backwards from t to 1. The probability

P({ X1, ..., Xt } = { s1, ..., st })

is equivalent to the conditional probability

P({ X2, ..., Xt } = { s2, ..., st } | X1 = s1)

by the Markov property. The equality

P(intersection(A, B) | C) = P(A | P(B | intersection(A, C))

provides a tool to pick apart the conditional probability by defining

A = { X2, ..., Xt-1 } = { s2, ..., st-1 }

B = { Xt = st }

C = X1 = s1

intersection(A, B) represents the intersection of all Markov-chain instances that have st as their t-th element and all Markov-chain instances that have s2 through st-1 as their second through t-1th elements; that is, it's the Markov chain { X2, ..., Xt } = { s2, ..., st }.

The conditional probability can be rewritten as

P({ X2, ..., Xt } = { s2, ..., st } | X1 = s1) =
P({ X2, ..., Xt-1 } = { s2, ..., st-1 } | X1 = s1)
P(Xt = st | { X1, ..., Xt-1 } = { s1, ..., st-1 })

By the Markov property,

P(Xt = st | { X1, ..., Xt-1 } = { s1, ..., st-1 }) = P(Xt = st | Xt-1 } = st-1)

and P(Xt = st | Xt-1 = st-1) is equal to the single-step transition probability pst-1st. The conditional probability

P({ X2, ..., Xt } = { s2, ..., st } | X1 = s1)

can now be rewritten as

P({ X2, ..., Xt-1 } = { s2, ..., st-1 } | X1 = s1)pst-1st

and that's the first turn of the crank. Turning the crank t-2 more times reduces the conditional probability to

P(X1 = s1) ps1s2 ... pst-1st

P(X1 = s1) is given by the Markov chain's initial distribution.


This page last modified on 2 March 2005.