Question: Now that you've learned about hypothesis testing and p-values, you should also be aware that these methods can be used incorrectly. Or, even worse,

Now that youve learned about hypothesis testing and p-values, you should also be aware that these methods can be used incorr

Part A) Use the entire dataset to determine whether Nefarians layout is an improvement over the original layout. Use an appr

Part C) Bummer. But Nefarian really wants his design to be an improvement, so whats a little bad science? What if he can fin

Now that you've learned about hypothesis testing and p-values, you should also be aware that these methods can be used incorrectly. Or, even worse, maliciously. Usually it involves manipulating the data or the test in such a way to produce a desired result. There's many methods for this, and they've got some cool names like p-hacking and data dredging. In this problem, we will focus on the idea of using subsets of data to find a desired result. Nefarian just landed his first data science position as an intern at a new e-commerce company. His project was the design and test a new website layout that would lead to more purchases. To test his new layout, the company gathered four different groups of 50 customers and recorded how many of those ended up purchasing an item. This test was then repeated on multiple days. The effectiveness of Nefarian's layout is measured by the number of customers that made a purchase. This data is stored in the data frame purchases. Nefarian wants to land a permanent position at the company after his internship is over, so he really wants to impress his supervisors with his new layout. He knows that the site has an average purchase rate of 0.8 and wants to see if his layout is an improvement. purchases purchases = purchases[,-1] names (purchases) = c("group", "num_purchases") head (purchases)| read.csv("purchases.csv") A data.frame: 6 x 2 group num_purchases a 36 2 a 42 a 41 a 40 a 36 a 42 Part A) Use the entire dataset to determine whether Nefarian's layout is an improvement over the original layout. Use an appropriate hypothesis test and a significance level of a = 0.05. Store the p-value for this test in the variable p3.a and round your answer to two decimal places. Note: In case you haven't see a data frame before, think of it like a spreadsheet where each row is an instance each data and each column is a vector of specific values. To access the values in the "num_purchases" column, use purchases$num_purchases . # your code here p3.a = NA Part C) Bummer. But Nefarian really wants his design to be an improvement, so what's a little bad science? What if he can find a subset of data that supports his claim? Thinking back, Nefarian remembers that Group C supposedly contained some very impulsive customers. Using the same hypothesis from Part A, determine if Nafarian's layout was a statistically significant improvement at the a = 0.05 significance level, if he only looks at sampels from Group C. Save the p-value of this test as p3.c, rounded to three decimal places. Note: To filter the dataframe to only contain data for Group C, use purchases[purchases$group=="c",]. # your code here p3.c = NA Now that you've learned about hypothesis testing and p-values, you should also be aware that these methods can be used incorrectly. Or, even worse, maliciously. Usually it involves manipulating the data or the test in such a way to produce a desired result. There's many methods for this, and they've got some cool names like p-hacking and data dredging. In this problem, we will focus on the idea of using subsets of data to find a desired result. Nefarian just landed his first data science position as an intern at a new e-commerce company. His project was the design and test a new website layout that would lead to more purchases. To test his new layout, the company gathered four different groups of 50 customers and recorded how many of those ended up purchasing an item. This test was then repeated on multiple days. The effectiveness of Nefarian's layout is measured by the number of customers that made a purchase. This data is stored in the data frame purchases. Nefarian wants to land a permanent position at the company after his internship is over, so he really wants to impress his supervisors with his new layout. He knows that the site has an average purchase rate of 0.8 and wants to see if his layout is an improvement. purchases purchases = purchases[,-1] names (purchases) = c("group", "num_purchases") head (purchases)| read.csv("purchases.csv") A data.frame: 6 x 2 group num_purchases a 36 2 a 42 a 41 a 40 a 36 a 42 Part A) Use the entire dataset to determine whether Nefarian's layout is an improvement over the original layout. Use an appropriate hypothesis test and a significance level of a = 0.05. Store the p-value for this test in the variable p3.a and round your answer to two decimal places. Note: In case you haven't see a data frame before, think of it like a spreadsheet where each row is an instance each data and each column is a vector of specific values. To access the values in the "num_purchases" column, use purchases$num_purchases . # your code here p3.a = NA Part C) Bummer. But Nefarian really wants his design to be an improvement, so what's a little bad science? What if he can find a subset of data that supports his claim? Thinking back, Nefarian remembers that Group C supposedly contained some very impulsive customers. Using the same hypothesis from Part A, determine if Nafarian's layout was a statistically significant improvement at the a = 0.05 significance level, if he only looks at sampels from Group C. Save the p-value of this test as p3.c, rounded to three decimal places. Note: To filter the dataframe to only contain data for Group C, use purchases[purchases$group=="c",]. # your code here p3.c = NA Now that you've learned about hypothesis testing and p-values, you should also be aware that these methods can be used incorrectly. Or, even worse, maliciously. Usually it involves manipulating the data or the test in such a way to produce a desired result. There's many methods for this, and they've got some cool names like p-hacking and data dredging. In this problem, we will focus on the idea of using subsets of data to find a desired result. Nefarian just landed his first data science position as an intern at a new e-commerce company. His project was the design and test a new website layout that would lead to more purchases. To test his new layout, the company gathered four different groups of 50 customers and recorded how many of those ended up purchasing an item. This test was then repeated on multiple days. The effectiveness of Nefarian's layout is measured by the number of customers that made a purchase. This data is stored in the data frame purchases. Nefarian wants to land a permanent position at the company after his internship is over, so he really wants to impress his supervisors with his new layout. He knows that the site has an average purchase rate of 0.8 and wants to see if his layout is an improvement. purchases purchases = purchases[,-1] names (purchases) = c("group", "num_purchases") head (purchases)| read.csv("purchases.csv") A data.frame: 6 x 2 group num_purchases a 36 2 a 42 a 41 a 40 a 36 a 42 Part A) Use the entire dataset to determine whether Nefarian's layout is an improvement over the original layout. Use an appropriate hypothesis test and a significance level of a = 0.05. Store the p-value for this test in the variable p3.a and round your answer to two decimal places. Note: In case you haven't see a data frame before, think of it like a spreadsheet where each row is an instance each data and each column is a vector of specific values. To access the values in the "num_purchases" column, use purchases$num_purchases . # your code here p3.a = NA Part C) Bummer. But Nefarian really wants his design to be an improvement, so what's a little bad science? What if he can find a subset of data that supports his claim? Thinking back, Nefarian remembers that Group C supposedly contained some very impulsive customers. Using the same hypothesis from Part A, determine if Nafarian's layout was a statistically significant improvement at the a = 0.05 significance level, if he only looks at sampels from Group C. Save the p-value of this test as p3.c, rounded to three decimal places. Note: To filter the dataframe to only contain data for Group C, use purchases[purchases$group=="c",]. # your code here p3.c = NA Now that you've learned about hypothesis testing and p-values, you should also be aware that these methods can be used incorrectly. Or, even worse, maliciously. Usually it involves manipulating the data or the test in such a way to produce a desired result. There's many methods for this, and they've got some cool names like p-hacking and data dredging. In this problem, we will focus on the idea of using subsets of data to find a desired result. Nefarian just landed his first data science position as an intern at a new e-commerce company. His project was the design and test a new website layout that would lead to more purchases. To test his new layout, the company gathered four different groups of 50 customers and recorded how many of those ended up purchasing an item. This test was then repeated on multiple days. The effectiveness of Nefarian's layout is measured by the number of customers that made a purchase. This data is stored in the data frame purchases. Nefarian wants to land a permanent position at the company after his internship is over, so he really wants to impress his supervisors with his new layout. He knows that the site has an average purchase rate of 0.8 and wants to see if his layout is an improvement. purchases purchases = purchases[,-1] names (purchases) = c("group", "num_purchases") head (purchases)| read.csv("purchases.csv") A data.frame: 6 x 2 group num_purchases a 36 2 a 42 a 41 a 40 a 36 a 42 Part A) Use the entire dataset to determine whether Nefarian's layout is an improvement over the original layout. Use an appropriate hypothesis test and a significance level of a = 0.05. Store the p-value for this test in the variable p3.a and round your answer to two decimal places. Note: In case you haven't see a data frame before, think of it like a spreadsheet where each row is an instance each data and each column is a vector of specific values. To access the values in the "num_purchases" column, use purchases$num_purchases . # your code here p3.a = NA Part C) Bummer. But Nefarian really wants his design to be an improvement, so what's a little bad science? What if he can find a subset of data that supports his claim? Thinking back, Nefarian remembers that Group C supposedly contained some very impulsive customers. Using the same hypothesis from Part A, determine if Nafarian's layout was a statistically significant improvement at the a = 0.05 significance level, if he only looks at sampels from Group C. Save the p-value of this test as p3.c, rounded to three decimal places. Note: To filter the dataframe to only contain data for Group C, use purchases[purchases$group=="c",]. # your code here p3.c = NA

Step by Step Solution

★★★★★

3.49 Rating (169 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock

Ans Hypothesis Test The purchase 3 ie p3 is 41 p341 A... View full answer

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Accounting Questions!

You are working as an intern at Coral Gables Products, a privately owned manufacturing company. Shortly after you read Chapter 13 in this book, you got into a discussion with the Chief Financial...

A business is prospering in such a way that its total (accumulated) profit after t years is 1000t2 dollars. (a) How much did the business make during the third year (between t = 2 and t = 3)? (b)...

A tennis ball is struck in such a way that it leaves the racket with a speed of 4.87 m/s in the horizontal direction. When the ball hits the court, it is a horizontal distance of 1.95 m from the...

Sharp Paper Inc. has three paper mills, one of which is located in Memphis, Tennessee. The Memphis mill produces 300 different types of coated and uncoated specialty printing papers. This large...

Primo Paper, Inc., has three paper mills, one of which is located in Seattle, Washington. The Seattle mill produces 200 different types of coated and uncoated specialty printing papers. This large...

Obtain Colgate-Palmolives (C-P) Form 8-K dated December 6, 2004. Companies file an 8-K with the SEC when they want to announce a special event has occurred at their business. As is often the case...

There are a number of sources of economic and demographic information that can assist the management accountant. The information includes financial information such as interest rates, employment,...

Describe how the graph of f varies as varies. Graph several members of the family to illustrate the trends that you discover. In particular, you should investigate how maximum and minimum points and...

Brooks Enterprises has never paid a dividend. Free cash flow is projected to be $80,000 and $100,000 for the next 2 years, respectively, and after the second year it is expected to grow at a constant...

Brooks Enterprises has never paid a dividend. Free cash flow is projected to be $80,000 and $100,000 for the next 2 years, respectively; after the second year, FCF is expected to grow at a constant...

Why might a failing project not be terminated?

Two 20-year bonds are identical in all respects except that one allows the issuer to call the bond in return for $1,000 cash at any time after five years while the other contains no call provisions....

The series 7 8 - 7 1 0 7 1 2 - 7 1 4 7 1 6 - dots can be rewritten as n = 1 ( - 1 ) n - 1

Proof by Natural Deduction - Predicate Logic. Use a direct proof to show that the following argument is valid. Premise 1: (3x)Kx - (x)(Lx Mx) Premise 2: Kc Lc Conclusion: Mc

1. Connect 8 LEDs with PORTD of the microcontroller. 2. All LEDs need to be grounded through current limiting resistors of 1 K ohm. 3. C-program the microcontroller such that it shows the 8-bit...

Julie has just retired. Her company's retirement program has two options as to how retirement benefits can be received. Under the first option, Julie would receive a lump sum of $126,000 immediately...

(b) Estimate the energy difference between the stable and unstable chair conformations of each of the following tetramethylcyclohexanes. Which is more stable isomer? Me Me (i) (ii) .. Me ..Me Versus...

Roy is a resident in Singapore who came to Hong Kong in March 2018 for holiday. He was introduced by a HK property agent to visit a residential property in Tai Koo Shing. The property was owned by a...

2. Think of the prison as a criminal justice organization. How does political power play out among line staff, supervisors and administrators? Do inmates participate in the political structure

plz i need answer with explanation ther Major Purchase L. Reply what you've learned - Auto Purchase S a ma CH /00 only income of which that it went to your pas more than credit Of You to drive the de...

pls answer accordenly Using the Rosen Shingle Creek. E.property as the subject of this exercise. Apply what you've learned in this module and textbook chapters as it relates to any of the following:...

SKILL ANALYSIS CASE INVOLVING POWER AND INFLUENCE ANALYSIS Dynica Software Solutions Dynica Technologes recently announced plans to construct a new production facit ity in River Woods. The new...

oday is the day I ve been working up to this moment all month long All year for that matte Today I sit with the jocks the most popular people at our school the highest of the Volleybal and Football...

need help determing the info, reports can be found ln sec.gov 1) Only enter answers in vellow cells 2) Calculate each financial statement item expressed as a percent change of the base period for...