Question: Activity 4 . 5 . 6 . Consider the Internet with eight web pages, shown in Figure 4 . 5 . 8 . Figure 4

Activity 4.5.6. Consider the Internet with eight web pages, shown in Figure 4.5.8.
Figure 4.5.8. A simple model of the Internet with eight web pages.
a. Construct the Google matrix \( G \) for this Internet. Then use a Markov chain to find the steady-state PageRank vector \(\mathbf{x}\).
```
def markov_chain(A, x0, N):
for i in range(N):
x0= A*x0
print (x0.numerical_approx(digits=3))
## define the matrix G and x0
G =
x0=
markov_chain(G, x0,20)
```
b. What does this vector tell us about the relative quality of the pages in this Internet? Which page has the highest quality and which the lowest? Figure 4.5.9. A model of the Internet with five web pages.
What happens when you begin the Markov chain with the vector \(\mathbf{x}_{0}=\left[\begin{array}{l}1\\0\\0\\0\\0\end{array}\right]\)? Explain why this behavior is consistent with the Perron-
Frobenius theorem.
d. What do you think the PageRank vector for this Internet should be? Is any one page of a higher quality than another?
e. Now consider the Internet with eight web pages, shown in Figure.4.5.10.
Figure 4.5.10. Another model of the Internet with eight web pages.
Notice that this version of the Internet is identical to the first one that we saw in this activity, except that a single link from page 7 to page 1 has been removed. We can therefore find its Google matrix \( G \) by slightly modifying the earlier matrix.
What is the long-term behavior of a Markov chain defined by \( G \) and why is this behavior not desirable? How is this behavior consistent with the Perron-Frobenius theorem?
The Perron-Frobenius theorem Theorem...4.5.6 tells us that a Markov chain \(\mathbf{x}_{k+1}=G \mathbf{x}_{k}\) converges to a unique steady-state vector when the matrix \( G \) is positive. This means that \( G \) or some power of \( G \) should have only positive entries. Clearly, this is not the case for the matrix formed from the Internet in Figure.4.5.9.
We can understand the problem with the Internet shown in Figure. 4.5.10 by adding a box around some of the pages as shown in Figure 4.5.5.11. Here we see that the pages outside of the box give up all of their PageRank to the pages inside the box. This is not desirable because the PageRanks of the pages outside of the box are found to be zero. Once again, the Google matrix \( G \) is not a positive matrix.
Figure 4.5.11. The pages outside the box give up all of their PageRank to the pages inside the box.
Google solves this problem by slightly modifying the Google matrix \( G \) to obtain a positive matrix \( G^{\prime}\). To understand this, think of the entries in the Google matrix as giving the probability that an Internet user follows a link from one page of another. To create a positive matrix, we will allow that user to randomly jump to any other page on the Internet with a small probability.
To make sense of this, suppose that there are \( N \) pages on our internet. The matrix
Activity 4 . 5 . 6 . Consider the Internet with

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!