Question: Help with coding in R: cyl

Help with coding in R:

cyl<-factor(scan(text= "6 6 4 6 8 6 8 4 4 6 6 8 8 8 8 8 8 4 4 4 4 8 8 8 8 4 4 4 8 6 8 4")) am<-factor(scan(text= "1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 0 0 0 0 0 1 1 1 1 1 1 1"))

1. Using the data `cyl` and `am` (transmission type) from Part II, group vehicles based into 8 cylinder and less than 8 cyl. Test whether there is evidence of association between many cylinders and automatic transmissions. (_Hint:_ use `levels()` to re-level `cyl` and then use `chisq.test()`).

2. The built in dataset `faithful` records the time between eruptions and the length of the prior eruption for 272 inter-eruption intervals (load the data with `data(faithful)`). Examine the distribution of each of these variables with `stem()` or `hist()`. Plot these variables against each other with the length of each eruption (`eruptions`) on the x axis. How would you describe the relationship?

3) Fit a regression of `waiting` as a function of `eruptions`. What can we say about this regression? Compare the distribution of the residuals (`model$resid` where `model` is your lm object) to the distribution of the variables.

4) Is this data well suited to regression? Create a categorical variable from `eruptions` to separate long eruptions from short eruptions (2 groups) and fit a model (ANOVA) of `waiting` based on this. (_Hint:_ use `cut()` to make the categorical variable, and `lm()` to fit the model). How did you choose the point at which to cut the data? How might changing the cutpoint change the results?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

I need help on coding for R program using RStudio. I just need the coding part. Rstudio packages: gmodels gdata https://drive.google.com/file/d/14ufW_2oafqnBcXTXhS5ag2Mrhu0sfOQ8/view?usp=sharing Here...

I need help with coding in R for this question. Please Help! #Given the data y below, what is the probability AND log likelihood that the data are drawn from the following distributions? x

I have this project and added the link to data. I need help with coding in R. https://docs.google.com/spreadsheets/d/13Gsyt9yUyVR-qCPixsvOCqmS2edfsAvbmWu4tCB_kog/edit?usp=sharing question: Prompts...

Need help with coding in R (13. In this problem, we implement and compare support vector machines {SVMs} under various settings. We study a simulated dataset which contains a binary response variable...

I need help with coding in R: Consider a loaded dice such that the probability to obtain an outcome of 1 is 2p/3, the probability of obtaining 2, 3, 4 or 5 is p each, and the probability of obtaining...

Help please by using R studio. Edit View History Bookmarks Window Help O . . . O Quiz 1 - labdelhalim411@hcc.edu You will use RStudio to generate all statistics and graphs. . VERY IMPORTANT: You must...

critique of statistical validity must address two main points, (1) effect size and (2) statistical significance. Effect Size: Whether/How do they report how large their effect was or how much...

Part I: Maximizing The Number of Exposures of Ads Amber has recently faced quite a reduced number of customers due to the Corona pandemic. Vijay has decided to develop a promotional campaign for...

Q2 (Adaptive Modulation and Coding) Suppose that the bit error rate (BER) performance can be approximated by P = 0.2e-1.5yG(r)/(M-1), where y is the SNR, Gc (r) denotes the coding gain of the coding...

The U.S. government would like to help the American auto industry compete against foreign automakers that sell trucks in the United States. It can do this by imposing an excise tax on each foreign...

Create a simple smart home smart city IoT network using cisco packet tracer Minimum of three new IoT devices Appropriate device configuration Reflects real world scenarios/purpose.

The cost formula for a company can be modeled by C 66050x03x2 where x represents the number of items made A formula for the companys income is modeled with R103x07where is the number of items sold A...

Find the radius of gyration of a plate covering the region bounded by x=3, x=5, y=0, and y=4 with respect to the y-axis.

1. What business and social problems does data center power consumption cause? Whats too hot to handle? It might very well be your companys data center, which can easily consume more than 100 times...

5. What are the challenges of managing IT infrastructure and management solutions?

2. What solutions are available for these problems? Are they management, organizational, or technology solutions? Explain your answer. Whats too hot to handle? It might very well be your companys...