Question: Activity 4 . 5 . 6 . Consider the Internet with eight web pages, shown in Figure 4 . 5 . 8 . Figure 4
Activity Consider the Internet with eight web pages, shown in Figure
Figure A simple model of the Internet with eight web pages.
a Construct the Google matrix G for this Internet. Then use a Markov chain to find the steadystate PageRank vector mathbfx
def markovchainA x N:
for i in rangeN:
x Ax
print xnumericalapproxdigits
## define the matrix G and x
G
x
markovchainG x
b What does this vector tell us about the relative quality of the pages in this Internet? Which page has the highest quality and which the lowest? Figure A model of the Internet with five web pages.
What happens when you begin the Markov chain with the vector mathbfxleftbeginarraylendarrayright Explain why this behavior is consistent with the Perron
Frobenius theorem.
d What do you think the PageRank vector for this Internet should be Is any one page of a higher quality than another?
e Now consider the Internet with eight web pages, shown in Figure
Figure Another model of the Internet with eight web pages.
Notice that this version of the Internet is identical to the first one that we saw in this activity, except that a single link from page to page has been removed. We can therefore find its Google matrix G by slightly modifying the earlier matrix.
What is the longterm behavior of a Markov chain defined by G and why is this behavior not desirable? How is this behavior consistent with the PerronFrobenius theorem?
The PerronFrobenius theorem Theorem.. tells us that a Markov chain mathbfxkG mathbfxk converges to a unique steadystate vector when the matrix G is positive. This means that G or some power of G should have only positive entries. Clearly, this is not the case for the matrix formed from the Internet in Figure
We can understand the problem with the Internet shown in Figure. by adding a box around some of the pages as shown in Figure Here we see that the pages outside of the box give up all of their PageRank to the pages inside the box. This is not desirable because the PageRanks of the pages outside of the box are found to be zero. Once again, the Google matrix G is not a positive matrix.
Figure The pages outside the box give up all of their PageRank to the pages inside the box.
Google solves this problem by slightly modifying the Google matrix G to obtain a positive matrix Gprime To understand this, think of the entries in the Google matrix as giving the probability that an Internet user follows a link from one page of another. To create a positive matrix, we will allow that user to randomly jump to any other page on the Internet with a small probability.
To make sense of this, suppose that there are N pages on our internet. The matrix
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
