World Health Organisation's (WHO) specialised cancer agency, the International Agency for Research on Cancer (IARC) has...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
World Health Organisation's (WHO) specialised cancer agency, the International Agency for Research on Cancer (IARC) has designated fine particulate matter (PM2.5) as carcinogenic to human beings. PM2.5 particles have a diameter of 2.5 micrometers (0.0025 mm) or smaller and they are small enough for people to breath them deeply into lungs and sometimes PM2.5 particles can even enter the bloodstream. Research indicates that temperature in degrees (°C), relative humidity in percentage (%), wind speed in kilometers per hour (km/h), and precipitation in millimeters (mm) are potential predictors for PM2.5 concentration in milligram per cubic meter (µg/m³). A random sample of the annual mean temperature, humidity, wind, precipitation and PM2.5 concentration at 56 test locations was collected. The data is available in the file pm25.csv on iLearn. It is located under Assessment →→ Assignment →→ Assignment datasets. Variable temperature humidity wind precipitation pm25 Description The annual mean temperature in degrees The annual mean relative humidity in percentage The annual mean wind speed in kilometers per hour The annual mean precipitation in millimeters The annual mean PM2.5 concentration in milligram per cubic meter a. [7 marks] Produce a plot and a correlation matrix of the data. Comment on possible relationships between the response and predictors and relationships between the predictors themselves. b. [6 marks] . Fit a model using all the predictors to explain the pm25 response. Using the full model, estimate the impact of humidity on PM2.5 concentration. Do this by producing a 95% confidence interval that quantifies the change in PM2.5 concentration for each extra percentage of relative humidity and comment. c. [14 marks] Conduct an F-test for the overall regression i.e. is there any relationship between the response and the predictors. In your answer: • Write down the mathematical multiple regression model for this situation, defining all appropriate parameters. Write down the Hypotheses for the Overall ANOVA test of multiple regression. Produce an ANOVA table for the overall multiple regression model (One combined regression SS source is sufficient). Compute the F statistic for this test. State the Null distribution for the test statistic. Compute the P-Value State your conclusion (both statistical conclusion and contextual conclusion). ● ● ● d. [10 marks] Validate the full model and comment on whether the full regression model is appropriate to explain the PM2.5 concentration at various test locations. e. [2 marks] Find the R2 and comment on what it means in the context of this dataset. f. [3 marks] Using model selection procedures discussed in the course, find the best multiple regression model that explains the data. State the final fitted regression model. g. [3 marks] Comment on the R² and adjusted R²2 in the full and final model you chose in part f. In particular explain why those goodness of fitness measures change but not in the same way. World Health Organisation's (WHO) specialised cancer agency, the International Agency for Research on Cancer (IARC) has designated fine particulate matter (PM2.5) as carcinogenic to human beings. PM2.5 particles have a diameter of 2.5 micrometers (0.0025 mm) or smaller and they are small enough for people to breath them deeply into lungs and sometimes PM2.5 particles can even enter the bloodstream. Research indicates that temperature in degrees (°C), relative humidity in percentage (%), wind speed in kilometers per hour (km/h), and precipitation in millimeters (mm) are potential predictors for PM2.5 concentration in milligram per cubic meter (µg/m³). A random sample of the annual mean temperature, humidity, wind, precipitation and PM2.5 concentration at 56 test locations was collected. The data is available in the file pm25.csv on iLearn. It is located under Assessment →→ Assignment →→ Assignment datasets. Variable temperature humidity wind precipitation pm25 Description The annual mean temperature in degrees The annual mean relative humidity in percentage The annual mean wind speed in kilometers per hour The annual mean precipitation in millimeters The annual mean PM2.5 concentration in milligram per cubic meter a. [7 marks] Produce a plot and a correlation matrix of the data. Comment on possible relationships between the response and predictors and relationships between the predictors themselves. b. [6 marks] . Fit a model using all the predictors to explain the pm25 response. Using the full model, estimate the impact of humidity on PM2.5 concentration. Do this by producing a 95% confidence interval that quantifies the change in PM2.5 concentration for each extra percentage of relative humidity and comment. c. [14 marks] Conduct an F-test for the overall regression i.e. is there any relationship between the response and the predictors. In your answer: • Write down the mathematical multiple regression model for this situation, defining all appropriate parameters. Write down the Hypotheses for the Overall ANOVA test of multiple regression. Produce an ANOVA table for the overall multiple regression model (One combined regression SS source is sufficient). Compute the F statistic for this test. State the Null distribution for the test statistic. Compute the P-Value State your conclusion (both statistical conclusion and contextual conclusion). ● ● ● d. [10 marks] Validate the full model and comment on whether the full regression model is appropriate to explain the PM2.5 concentration at various test locations. e. [2 marks] Find the R2 and comment on what it means in the context of this dataset. f. [3 marks] Using model selection procedures discussed in the course, find the best multiple regression model that explains the data. State the final fitted regression model. g. [3 marks] Comment on the R² and adjusted R²2 in the full and final model you chose in part f. In particular explain why those goodness of fitness measures change but not in the same way.
Expert Answer:
Answer rating: 100% (QA)
a 7 marks Produce a plot and a correlation matrix of the data Comment on possible relationships between the response and predictors and relationships between the predictors themselves Here is the plot ... View the full answer
Related Book For
Business And Professional Ethics
ISBN: 9780357441886
9th Edition
Authors: Leonard J Brooks, Paul Dunn
Posted Date:
Students also viewed these economics questions
-
Joe transfers land with a FMV of $ 5 0 0 , 0 0 0 and basis of $ 1 0 0 , 0 0 0 for land owned by John. Land is subject to a mortgage of $ 1 0 0 , 0 0 0 ; John transfers $ 1 0 0 , 0 0 0 cash and land...
-
Managing Scope Changes Case Study Scope changes on a project can occur regardless of how well the project is planned or executed. Scope changes can be the result of something that was omitted during...
-
The A-36 steel wires AB and AD each have a diameter of 2 mm and the unloaded lengths of each wire are LAC = 1.60 m and LAB = LAD = 2.00 m. Determine the required diameter of wire AC so that each wire...
-
Construct the general solution of x ' = Ax involving complex eigenfunctions and then obtain the general real solution. Describe the shapes of typical trajectories. =[ A = 3. -2 1]
-
In 2014, Trish, a self-employed CPA and calendar year taxpayer, acquires and places in service an automobile and a personal computer. Pertinent data include the following: Or each asset, calculate...
-
You have $1,225 in a savings account which earns 8.4% compounded monthly and $1,300 in an account which earns 6% compounded monthly. How many years will it be until the two accounts have the same...
-
What accounting procedures must be completed before the partnership liquidation process begins?
-
Average flow velocity in turbulent tube flow. (a) For the turbulent flow in smooth circular tubes, the function 1 is sometimes useful for curve-fitting purposes: near Re = 4 x 10 3 , n = 6; near Re =...
-
When writing out a diagnostic clinical impression in psychology sometimes the use of z codes/other factors is pertinent in order to include any information that may be clinically relevant to the...
-
A Statement of Cash flows helps with a. Assessing management performance b. Assessing past cash flow only c. Assessing past cash flow and projecting future cash flows d. Projecting future cash flow...
-
solve areas in yellow. Both labor variance and parts variance should equal 80,025 MRO Department - 2020 Results Change in Gross Margin 80,025 Revenues (billed out): Budget Actual Fav./(Unfav.) -...
-
A bond has a par value of $1,000, pays $70 semiannually and has a maturity of 15 years. 1) If the bond earns 12% per year, what is the price of the bond? Rate Nper PMT FV Type PV 2) What is the yield...
-
Calculate the Cash Flow balance (as per balance sheet) in year 3. Assume all missing data is zero Year 1 Year 2 Year 3 P&L Revenues 200.0 250.0 300.1 COGS (75.0) (90.0) (105.0) OPEX (12.0) (15.0)...
-
3 Walking Bear Resources Inc. Equity Section of the Balance Sheet March 31, 2023 Contributed capital: Preferred shares, $17 cumulative, 2,900 shares authorized, issued, and outstanding 812,000...
-
Apply Slack and Lewis (S/L) Operations Strategy (OS) framework and Matrix to a real-world company of your choice. The following structure is recommended. Introduce the organisation and the state of...
-
Provide an analysis of your career strategy as (HR Manager in a multinational company) broken down according to: Goals and values The environment Resources and capabilities Strategy Implementation...
-
In a large midwestern university, 30% of the students live in apartments. If 200 students are randomly selected, find the probability that the number of them living in apartments will be between 55...
-
If you were a management accountant, would you buy a product from a supplier for personal use at 25% off list?
-
What is the most important contribution of a corporate code of conduct?
-
Is the 2019 Business Roundtable Statement (BRS) redefining the purpose of corporations likely to make any difference to boards of directors and to activists?
-
Consider the problem of generating samples from \(Y \sim \operatorname{Gamma}(2,10)\). (a) Direct simulation: Let \(U_{1}, U_{2} \sim\) idd \(\mathscr{U}(0,1)\). Show that \(-\ln \left(U_{1} ight) /...
-
Let \(U, V \sim_{\text {iid }} \mathscr{U}(0,1)\). The reason why in Example 3. 7 the sample mean and sample median behave very differently is that \(\mathbb{E}[U / V]=\infty\), while the median of...
-
As a generalization of Example C.9, consider a random walk on an arbitrary undirected connected graph with a finite vertex set \(\mathscr{V}\). For any vertex \(v \in \mathscr{V}\), let \(d(v)\) be...
Study smarter with the SolutionInn App