Question: I am using RStudio to run an ANOVA test on a set of data with 14 columns and 89350 rows. I am assessing the variation

I am using RStudio to run an ANOVA test on a set of data with 14 columns and 89350 rows. I am assessing the variation between WEATHER and FATALS (Fatalities in traffic accidents) where FATALS is the dependent value, ie

First I use the function lm() to build a frequency table, then a proportions table as shown here:

weather.lm <- lm(formula = dat$FATALS ~ dat$WEATHER, data = dat)

Second, I use anova() to analyze the variation between the test variable (WEATHER) and the base variable (FATALS). I don't have a problem here.

(a <- anova(dui.mod2, dui.mod3))

Lastly, I use predict() to predict values of FATALS given values of WEATHER.

weather.new <- data.frame(WEATHER = c(1, 2, 3)) # dataframe of new WEATHER data

# Predict the value of the new FATALS using

wet <- predict(object = weather.lm,# The weather.lm regression model

newdata = weather.new)# dataframe of new data

My question is I am giving predict() only 3 values of WEATHER, so I am expecting only 3 values of FATALS. Instead, the function runs across all 89350 rows of my main table. And I notice all the predicted values are the same 89350 times for a given value of WEATHER.

How do I predict 1 for 1, ie I give 1 test value and I get 1 prediction? Or I give 2 and I get 2, and so on?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!

Welcome! Please read this page (in particular) very carefully. Instructions You need to understand how to send your assignments (deliverables) to your instructor. The tabs (bottom of each sheet) in...

MATH 1P98: Practical Statistics (Spring 2017) Assignment 4 Physical Submission due Thursday, July 6th (in class) Electronic Submission due Friday, July 7th (on Sakai) A reminder: handing in this...

Study Guide Healthcare Statistics By Jacqueline K. Wilson, RHIA About the Author Jacqueline K. Wilson is a Registered Health Information Administrator (RHIA) who has more than ten years of experience...

PROJ6000: Principles of Project Management Assessment Assessment 2 - this is actually a case study and its called a Individual Report: PMBoK versus PRINCE2 or Agile in contemporary projects . so one...

PROJ6000: Principles of Project Management Assessment 3 - Project Charter Report. Length 2,000 words (+/- 10%) Task Summary After reading the project case study, use it to develop a 2,000-word...

1. Read the case study below. This will form the basis for your Project Charter, because you will assume that you are the project manager for this project. 2. After reading the case study, begin to...

How are the wastes produced in nuclear power plants different from those produced in a thermal power plants ? What happens to the waste of a nuclear power plant?

Suppose the European and Japanese economies succumb to a recession and reduce their demand for U.S. goods for several years. Using the AS/AD framework, explain the macroeconomic consequences of this...

How would you explen the order of the fout management functons for achieving effective performance? Miviple Chotce Ni four finctions occur similtereourly, mitualy inh upnong one avorter....

A company issues 1,050 shares of its common stock for $33,600 cash. Prepare journal entries to record this event under each of the following separate situations.