This will give you two different representations of the network: fr.n is a network object, while...
Fantastic news! We've Found the answer you've been seeking!
Question:
![image text in transcribed](https://s3.amazonaws.com/si.experts.images/answers/2024/05/6647efe7a487b_5036647efe77e468.jpg)
Transcribed Image Text:
This will give you two different representations of the network: fr.n is a network object, while fr is a 71 71 matrix where a 1 in cell (i, j) indicates lawyer i likes lawyer j (directed edge). (a) We would like to test whether popularity (indegree) is statistically different between different lawyers, in other words whether all indegree differences may be due to chance. We will do it in two ways: i. First, fit both a p statistical model with indegree, outdegree, mutuality, and one with outde- gree, mutuality only. Assuming the likelihood calculations are correct, perform a generalized likelihood ratio test (GLRT) between them and conclude whether the inclusion of the indegree is statistically useful. ii. Second, perform a permutation test using the matrix fr. You can choose how to do it, here's a suggestion: condition the analysis on the number of outgoing edges from each lawyer, and under the null these outgoing edges are randomly divided between the other 70 lawyers. Thus, each permuted matrix should: Preserve row sums Randomly shuffle columns for each row Avoid self-friending (no 1's on diagonal) Perform 104 permutations, or as many as you can. You will also have to choose a test statistic to represent how non-uniform the incoming edges distribution is, then calculate it once on the real matrix, and on each permuted matrix. The p-value is the percentage of permuted matrices that give a higher value than the original matrix. You can try more than one statistic. Justify your choice. Summarize your conclusion from both approaches: is popularity (in-degree) uniform or non- uniform for this data? Confirm your conclusion using the plot of the network or other relevant simple analysis. (b) The second task we would like to perform on this data is identify groups and structure by using a latent variable model. i. Fit at least four different models, with varying dimension, with/without a gaussian mixture structure with different numbers of components, etc. State your conclusions about: Number of clusters in the data Other structures and patterns you observe that are of interest Make sure to use both statistical and heuristic/graphical arguments to support your claims. ii. The objects fr.big, fr.big.n contain the same network, with 8 rows corresponding to lawyers with 0 ties (either outgoing or incoming) removed. Repeat the previous analysis on this data. Do your conclusions change? This will give you two different representations of the network: fr.n is a network object, while fr is a 71 71 matrix where a 1 in cell (i, j) indicates lawyer i likes lawyer j (directed edge). (a) We would like to test whether popularity (indegree) is statistically different between different lawyers, in other words whether all indegree differences may be due to chance. We will do it in two ways: i. First, fit both a p statistical model with indegree, outdegree, mutuality, and one with outde- gree, mutuality only. Assuming the likelihood calculations are correct, perform a generalized likelihood ratio test (GLRT) between them and conclude whether the inclusion of the indegree is statistically useful. ii. Second, perform a permutation test using the matrix fr. You can choose how to do it, here's a suggestion: condition the analysis on the number of outgoing edges from each lawyer, and under the null these outgoing edges are randomly divided between the other 70 lawyers. Thus, each permuted matrix should: Preserve row sums Randomly shuffle columns for each row Avoid self-friending (no 1's on diagonal) Perform 104 permutations, or as many as you can. You will also have to choose a test statistic to represent how non-uniform the incoming edges distribution is, then calculate it once on the real matrix, and on each permuted matrix. The p-value is the percentage of permuted matrices that give a higher value than the original matrix. You can try more than one statistic. Justify your choice. Summarize your conclusion from both approaches: is popularity (in-degree) uniform or non- uniform for this data? Confirm your conclusion using the plot of the network or other relevant simple analysis. (b) The second task we would like to perform on this data is identify groups and structure by using a latent variable model. i. Fit at least four different models, with varying dimension, with/without a gaussian mixture structure with different numbers of components, etc. State your conclusions about: Number of clusters in the data Other structures and patterns you observe that are of interest Make sure to use both statistical and heuristic/graphical arguments to support your claims. ii. The objects fr.big, fr.big.n contain the same network, with 8 rows corresponding to lawyers with 0 ties (either outgoing or incoming) removed. Repeat the previous analysis on this data. Do your conclusions change?
Expert Answer:
Posted Date:
Students also viewed these mathematics questions
-
Suppose that two linear equations are graphed on the same set of coordinate axes. Sketch what the graph might look like if the system has the given description. (a) The system has a single solution....
-
Maria Bell and J. R. Green are forming a partnership to which Bell will devote one- third time and Green will devote full time. They have discussed the following alternative plans for sharing income...
-
Hot water at an average temperature of 70C is flowing through a 15-m section of a cast iron pipe (k = 52 W/mK) whose inner and outer diameters are 4 cm and 4.6 cm, respectively. The outer surface of...
-
Which of the following workload models cannot be used with a simulation system model? a. Benchmark b. Instruction mix c. Synthetic job d. Probabilistic
-
The measured volumetric flow rate of ethane at 10.0 atm absolute and 35C is 1.00 x 10 3 L/h. Using an estimated value of the second virial coefficient in the truncated virial equation (Equation...
-
Questions Answer all of the questions below. (Marks: 100) Q.1 What is the difference between Coaxial cable and Fiber-optic cable? (10) Q.2 What are the most common types of software license...
-
1 1. Which of thefollowing is not one of the three types of specifications discussed in thetext? Answer Design specifications Material specifications Performance specifications...
-
The resistance of a conductor wire of length 26 meters is known to be R = 100ohm This wire is bent into a right trapezoid and made into a closed frame as shown in the figure. The frame is kept...
-
The solid V is the cube -1 x 1, -1 y 1, -1 < <1. The closed surface S is the boundary of V. The vector field F is defined by F(x, y, z) = yx2i+yj+zk i) State the Divergence Theorem. ii) Apply the...
-
Three forces act on a hook attached to the horizontal ground. The hook is at the origin of the x y plane. The x axis points horizontally to the right. The y axis points vertically upward. Force F...
-
You measure the velocity of falling object at several times given in the table below. Time (s) Velocity (m/s) 0.00 0.10 0.20 0.30 0.40 0.50 0.60 0.70 0.80 0.90 1.00 The acceleration is the slope of...
-
41. 42. The following table lists data collected during a recent experiment. In this case, x represents age (in years) and y represents the given diameter (in inches). 3 5 7 11 13 y 5.73 6.44 7.01...
-
A company is considering the purchase of a piece of equipment to be used to manufac- ture a new product. Four machines are being considered. The following table summa- rizes the purchase cost of each...
-
The packaging division of a company having considered several alternative package designs for the company's new product has finally brought down their choices to two designs of which only one has to...
![Mobile App Logo](https://dsd5zvtm8ll6.cloudfront.net/includes/images/mobile/finalLogo.png)
Study smarter with the SolutionInn App