Question: I am having a hard time understanding which variables are continuous that best predict num_people. I have added the question 1 below. event_ID: used to

I am having a hard time understanding which variables are continuous that best predict num_people. I have added the question 1 below.

  • event_ID: used to label each event
  • data_type: the type of data breached
  • num_people (in millions): the number of people impacted by a data breach, expressed in millions
  • num_people_v2: coded version of the variable num_people
  • num_records (in millions): the number of records breached, expressed in millions
  • per_sensitive: percent of sensitive data breached
  • per_sensitive_v2: coded version of the variable per_sensitive
  • dys_impact: the length of the negative financial impact from the data breach
  • dys_detect: the number of days it takes to detect the breach
  • cost_controls (in millions): the amount of money spent on security controls, expressed in millions

Question 1. Use the best subsets approach to determine which variable(s) would best predict num_people. Please be sure to only use the continuous variables in your data to predict your outcome and exclude event_ID. The variables are listed above.

Remember: Continuous variable refers to entities that get a distinct score. It gives us a score for each entity and can take on any value on the measurement scale that we are using.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!