Question: (5 points) Consider a 2 attribute dataset with 2 classes such that attribute 21, 22 are taken from the set V E 10, 20, 30,

(5 points) Consider a 2 attribute dataset with 2 classes such

(5 points) Consider a 2 attribute dataset with 2 classes such that attribute 21, 22 are taken from the set V E 10, 20, 30, 40,50. For each class, assume each attribute fol- lows a multinomial distribution such that all attribute values have non-zero probability (pli = vC) >0 Vi, v). In a pythen notebook, seed the random number generator With a value oft and immediately generate such a dataset with 60k instances (equally distributed among the two classes), picking values of px = v ). Plot the resulting data to visualize what attribute distributions you have created. One way (not neces- Sarily the best) is in 2-D with different color points for each class (you'll have a lot of points, so consider using the marker style :"). If you do this, then many points will lie on top of one another because the attribute values are chosen from a small set), so you'll want to spread them out by adding in some random jitter, and sample a subset of them so that they aren't packed too densely (also consider using an alpha i 1 to make the points transparent). Bestire just to do this for the visualization, or for the real data. Play with your the attribute value distributions until you feel that you can fusually" predict where one class should occur in the 2-D space. (5 points) Continuing from above, use your ook-stratified dataset to generate 20 smaller datasets of size 30,60,100,300,600,1000,3000 using non-overlapping slices of the full data set. Now, perform a set of experiments to estimate the values p(= vC) used to generate the sample. Show how the estimates improve as the partitions grow. For simplicity, just pick one class to present, either C. org. (5 points) Consider a 2 attribute dataset with 2 classes such that attribute 21, 22 are taken from the set V E 10, 20, 30, 40,50. For each class, assume each attribute fol- lows a multinomial distribution such that all attribute values have non-zero probability (pli = vC) >0 Vi, v). In a pythen notebook, seed the random number generator With a value oft and immediately generate such a dataset with 60k instances (equally distributed among the two classes), picking values of px = v ). Plot the resulting data to visualize what attribute distributions you have created. One way (not neces- Sarily the best) is in 2-D with different color points for each class (you'll have a lot of points, so consider using the marker style :"). If you do this, then many points will lie on top of one another because the attribute values are chosen from a small set), so you'll want to spread them out by adding in some random jitter, and sample a subset of them so that they aren't packed too densely (also consider using an alpha i 1 to make the points transparent). Bestire just to do this for the visualization, or for the real data. Play with your the attribute value distributions until you feel that you can fusually" predict where one class should occur in the 2-D space. (5 points) Continuing from above, use your ook-stratified dataset to generate 20 smaller datasets of size 30,60,100,300,600,1000,3000 using non-overlapping slices of the full data set. Now, perform a set of experiments to estimate the values p(= vC) used to generate the sample. Show how the estimates improve as the partitions grow. For simplicity, just pick one class to present, either C. org

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Requirements for PART I Do you see any potential areas of risk that the partners in the Briarwood City office should consider regarding RCC? In your response, consider business risk to JKN, en-...

Code the function greedy_predicator without using numpy/pandas Please include explanation of the code & the computational complexity To see the description of the function: Scroll down the...

Please help with add and remove methods Implement the AVL class (a subclass of BST) by completing the provided skeleton code in the file avl.py. Once completed, your implementation will include...

Ms. Excel Practice Assignment 5 1. Create a Ms. Excel file using the Project Timeline template and enter the following into the template. Change the dates per your judgement. Delete the extra tasks...

* PROBLEM #1 DOWN BELOW * ** PROBLEM #2 ARE THE EXERCISES CIRCLED DOWN BELOW ** Ms. Excel Practice Assignment 5 1. Create a Ms. Excel file using the Project Timeline template and enter the following...

Hello again, this is another class and I need your help again. I misunderstood the start date of the class so I'm a bit late for this first week so I'm sorry for having to ask you to finish some of...

Chrome File Edit View History Bookmarks Profiles Tab Window Help Q 8 . Tue Feb 8 5:52 PM Question 4 - Chapter 2 Homew X BUIS-475 Online Session - O X Course Hero X Bb Discussion Board - ECON351-! X...

You will submit a final report, written in Word (or similar word processing software), based on your findings and submissions from parts 1-4. It is highly suggested you not submit this paper without...

Answer Questions in Handout 2-5 Resource needed for Handout Questions Included Below: Handout 2-5 Please respond to the following questions. 1. Please read page 95. Look closely at the chart for...

Discuss the factors that would make the power plant cycle described in Problem 6.99 an irreversible cycle.

When a viscous fluid is confined between two long concentric cylinders as in Fig. 4.17, the torque per unit length T required to turn the inner cylinder at angular velocity is a function of ,...

Exercise 3.7 Lets look in more detail at division. We will use the octal numbers in the following table. A B a. 50 23 b. 25 44 3.7.1 [20] Using a table similar to that shown in Figure 3.11,...

Campbell Inc. produces and sells outdoor equipment. On July 1, 20Y1, Campbell issued $30,000,000 of 10-year, 10% bonds at a market (effective) interest rate of 9%, receiving cash of $31,951,110....

KEY QUESTION What is meant when economists say that the Federal Reserve Banks are central banks, quasi-public banks, and bankers banks? What are the seven basic functions of the Federal Reserve...

KEY QUESTION When a commercial bank makes loans, it creates money; when loans are repaid, money is destroyed. Explain.

KEY QUESTION How do economists distinguish between the absolute and relative sizes of the public debt? Why is the distinction important? Distinguish between refinancing the debt and retiring the...