Question: 1. Why would I choose to use an Extra Trees model vs. a Random Forest model? Select the best answer. Because it is easier to

1. Why would I choose to use an Extra Trees model vs. a Random Forest model? Select the best answer.

Because it is easier to explain.

Because it doesn't add noise by resampling with replacement.

Because it improves accuracy.

Because it reduces variance and works faster.

2. I am a real estate agent building a support vector regressor to determine the price of a house I'm listing. Why should I use this model vs. an OLS model? Select the best answer.

SVR models only consider errors falling outside of a given range, creating some error tolerance.

SVR models work better for problems with high variability, and offer a more precise prediction.

SVR models work well with non-linear data.

SVR models can produce multiple outputs.

3. When would the choice between Gradient Boosting and XGBoost not matter?

When scalability is not a concern

When explainability is not a concern

When working with only continuous data points

When the accuracy score from both models is the same

4. In which of the following scenarios would you expect K-means models to struggle?

When data consists of anisotropic clusters

When data consists of circular clusters

When data is noisy

When data consists of cylindrical clusters

5. Which of the following is not a strength of hierarchical clustering?

Hierarchical clustering takes a bottom-up approach to establish clusters and can work well with non-linearly separable data

Hierarchical clustering works well with clearly defined and separable clusters

Hierarchical clustering works best with clusters of varied spreads

6. Which of the following business metrics are useful to assess the effectiveness of recommender systems?

Accuracy Score

Customer Lifetime Value

Mean Squared Error

Product Uptake

7. How does a Random Forest model compute feature importance?

By calculating the average value of gini within each split.

By calculating a weighted impurity for each node, for each feature.

By calculating the weighted impurity for each tree, for each node.

By calculating the average number of splits until nodes are pure.

8. Which of the following is a strength for stacked ensembles?

Since models are diverse, the strengths from each approach can lead to a better prediction

They process data in parallel through many models for a faster output

They give us the ability to combine classifiers and regressors in a single process

Since models are usually simplistic, the outputs are explainable and easy to compute

9. In which scenario below is soft voting better than hard voting? Select the best answer.

Predicting whether to mine in a particular geographical region

Predicting whether a credit card transaction is fraudulent

Predicting which product to recommend to a customer

Predicting whether someone has a disease

10. Collaborative filtering can use cosine similarity to find similar customers.

True

False

11. Ensemble learning is used because ________. Select the best answer.

It reduces computational inefficiencies.

It provides a better accuracy score.

It is better than other models for highly complex problems.

It reduces overfitting by layering many models together.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!

STAT 2263 - UNB Online College of Extended Learning, University of New Brunswick Assignment #2: Probability Instructions: Students are advised to submit this assignment within 2 months of their...

use the code r Script below to Answer the questions from number 3 to 7 Questions : 3. Model #1 - First Logistic Regression Model Reporting Results Report the results of the regression model. Address...

13.( 1 pt) For each of the following situations, indicate whether ANOVA is appropriate; if not appropriate, the reason why not; and, if appropriate, the type of ANOVA that would be used (i.e.,...

A vendor is recommending a program to make supervisors better at 'dealing with difficult conversations' at work. How would you apply the concepts of optimisation and the Kirkpa- trick model to set up...

Is it possible to estimate the ROI of training for all training programs? Which are more or less susceptible to this calculation? 6 TRAINING AND DEVELOPMENT CFO asks CEO, 'What happens if we invest...

Algorithms in Artificial Intelligence (or, the old name: Introduction to Algorithmic Decision Making) Part 1 Based on slides by David Sarne and Lirong Xia Course Tentative Schedule Introduction...

Python and most Python libraries are free to download or use, though many users use Python through a paid service. Paid services help IT organizations manage the risks associated with the use of...

Provide an appropriate query to show that the above program can give an incorrect result. [4 marks] (b) Explain the cause of the error. [6 marks] (c) Suggest a correction. [5 marks] (d) Write a...

Ineeded help with my assignment. It was returned saying needs revision. I am not sure what needs to be done. If I have a sample document , then I could understand what the questionmeans. I. DATA...

SUMMARY OF LEARNING OBJECTIVES AND KEY POINTS 1. Identify the basic elements of organizations. Organizations are made up of a series of elements: Designing jobs Grouping jobs Establishing reporting...

Shown below are selected data from the financial statements of Hamilton Stores, a retail lighting store. From the balance sheet: Cash . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ....

Anne and Michael own and operate a successful mattress business. They have decided to take the business public. They contribute all the assets of the business to newly formed Spring Corporation each...

Which one of the following correctly describes a crime form? A B C The government crime form provides for crime coverage for a commercial entity but only offers a Loss Sustained policy trigger. The...

(a) Plot the spectrum of a PAM wave produced by the modulating signal m (t) Am cos (2imt) assuming a modulation frequency m = 0.25 Hz, sampling period Ts = 1 s, and pulse duration T = 0.45 s. (b)...

A Multi-stage Rocket In the first stage of a two-stage rocket, the rocket is fired from the launch pad starting from rest but with a constant acceleration of 3.50 m/s2 upward. At 25.0 s after launch,...

A horizontal wire holds a solid uniform ball of mass m in place on a tilted ramp that rises 35.00 above the horizontal. The surface of this ramp is perfectly smooth, and the wire is directed away...

A 1.5 kg box is initially at rest on a horizontal surface n at t = 0 a horizontal force F = (1.8t)i N (with t in seconds) is applied to the box. The acceleration of the box as a function of time t is...

A 2.4-m-long boom is held by a ball-and-socket joint at C and by two cables AD and BE. Determine the tension in each cable and the reaction at C. ism 0.3 m. 24 m ssO N

Among a group of 5 8 programmers, who know either Java or Python or both, 1 5 only know Java and 1 3 only know Python. What is the probability of randomly choosing a programmer who knows both...

Making journal entries for sales and cost of goods sold. During the month of June total sales were $233,000. The cost of those goods was $151,000. Give the general journal entries to record the sales...

Making journal entries for sales and cost of goods sold. During March 19X4, the Phoenix Manufacturing Company had total sales of $1,200,000. The goods were sold at a gross profit rate of 15 percent...

Analyzing, recording, and posting process cost transactions. The Clifford Manufacturing Company uses the process cost system. General ledger accounts and balances as of July 1, 19X5, are listed...