You are building a ML model to predict whether houses for sale in a particular neighbour-...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
You are building a ML model to predict whether houses for sale in a particular neighbour- hood will be sold in the next month or not. You have a dataset of 10,000 houses, their sale prices, date of purchase and other relevant attributes. (a) What will be your training- test data set split? (b) You get 20,000 new samples as test data for your model evaluation. You notice that your model correctly predicts Yes 70% of the time and correctly predicts No, 30% of the time. If there are 18,000 actual YES samples in the dataset, compute the confusion matrix, Type I errors and Type II errors and accuracy. [3] (c) You tune hyper-parameters for your model and get the following ROC curve. Identify the parameter configuration (1, 2, 3 or 4) will you choose and justify the choice. [1] You are building a ML model to predict whether houses for sale in a particular neighbour- hood will be sold in the next month or not. You have a dataset of 10,000 houses, their sale prices, date of purchase and other relevant attributes. (a) What will be your training- test data set split? (b) You get 20,000 new samples as test data for your model evaluation. You notice that your model correctly predicts Yes 70% of the time and correctly predicts No, 30% of the time. If there are 18,000 actual YES samples in the dataset, compute the confusion matrix, Type I errors and Type II errors and accuracy. [3] (c) You tune hyper-parameters for your model and get the following ROC curve. Identify the parameter configuration (1, 2, 3 or 4) will you choose and justify the choice. [1]
Expert Answer:
Answer rating: 100% (QA)
a The proper preparation test informational collection split will rely upon the size of the dataset ... View the full answer
Related Book For
Statistics Unlocking The Power Of Data
ISBN: 9780470601877
1st Edition
Authors: Robin H. Lock, Patti Frazer Lock, Kari Lock Morgan, Eric F. Lock, Dennis F. Lock
Posted Date:
Students also viewed these general management questions
-
A mail-order catalog company has developed a classification model to predict whether a prospective customer will place an order if he or she receives a catalog in the mail. If the prospective...
-
The website www.domain.com.au lists houses for sale in Australia. The asking prices of established houses with 4 bedrooms, 2 bathrooms and 2 car spaces for sale in August 2009 in Bundaberg (central...
-
You are building a model to determine the best location for a single pharmaceutical plant to make a new drug to serve global demand. Why is modeling every country as a single demand point probably...
-
I t was Swiss hotelier Csar Ritz, founder of the Parisian Htel Ritz back in 1898, who coined the familiar phrase the customer is always right. For the modern hospitality manager who must constantly...
-
As a result of a temperature rise of 32 C o , a bar with a crack at its center buckles upward (Figure). If the fixed distance L 0 is 3.77 m and the coefficient of linear expansion of the bar is 25 x...
-
Corporation DS owns assets worth $550,000 and has $750,000 outstanding debts. One of DS's creditors just informed DS that it is writing off a $15,000 account receivable from DS because it believes...
-
How might aspects of hospitality, tourism and events overlap or work together?
-
Purchasing Department cost drivers, activity-based costing, simple regression analysis. Fashion Flair operates a chain of 10 retail department stores. Each department store makes its own purchasing...
-
Green Rock Corporation had the following transaction: On: Oct .31: It declared cash dividend of $72,500 Dec. 15: It set date of record Jan. 25: Will pay dividend. Make the journal entries to record...
-
At the beginning of 2022, Donna Harp was employed as a cinematographer by Farah Movie, Inc., a motion picture company in Los Angeles, California. In June, she accepted a new job with Ocala Production...
-
1. Consider a linear system as follows: 0 x+ ---- -3 -5 y = [1 0]x X = a. Design a controller u=-Kx to yield a 10% overshoot and a settling time of 0.5 second, i.e., s+16s+183.1-0. (5%) b. Evaluate...
-
What are the main limitations of the EOQ model?
-
What steps can be taken to minimize bad debts?
-
During 2010, the City of Coyotes General Fund received \($10,000,\) which was recorded as a general revenue when it was actually a program revenue earned by its park program. a. What was the correct...
-
Working capital should be managed to maximize profitability. Explain and comment.
-
How can a company operate with minimum levels of working capital?
-
our atte P8-21 (similar to) Betas Answer the questions below for assets A to D shown in the table: EFF a. What impact would a 13% increase in the market return be expected to have on each asset's...
-
What are three disadvantages of using the direct write-off method?
-
Null and alternative hypotheses for a test are given. Give the notation (x, for example) for a sample statistic we might record for each simulated sample to create the randomization distribution. H 0...
-
Mean 5 and standard deviation 0.5. Sketch a curve showing a distribution that is symmetric and bell-shaped and has approximately the given mean and standard deviation. In each case, draw the curve on...
-
Over 30,000 people participated in an online poll on cnn.com asking Have you ever driven with a pet on your lap? The results show that 34% answered yes and 66% answered no. Can we conclude that 34%...
-
Assume that United Technologies Corporation is evaluating a proposal to change the companys manual design system to a computer-aided design (CAD) system. The proposed system is expected to save...
-
In 2008 the City of Chicago agreed to lease 35,000 parking meters to a Morgan Stanley-led partnership Morgan Stanley for a one-time sum of \($1.15\) billion. The lease has been criticized as an...
-
Pure White Automatic Laundry must either have a complete overhaul of its current dry-cleaning system or purchase a new one. Its cost of capital is 18 percent. Pure Whites accountant has developed the...
Computer Aided Verification 13th International Conference 1st Edition - ISBN: 3540423451 - Free Book
Study smarter with the SolutionInn App