Question: Data Management and Hypothesis Testing Assignment Overview: This assignment, due at the end of Week 6 , accounts for 1 0 % of your final

Data Management and Hypothesis Testing
Assignment Overview: This assignment, due at the end of Week 6, accounts for 10% of your
final grade and requires you to apply the concepts of data cleaning, hypothesis testing, and
analysis. You will work individually to gather, clean, and prepare data from at least two different
datasets, use AI tools to assist you in generating hypotheses, and then test those hypotheses using
Excel.
Assignment Objectives:
Utilize AI to generate research hypotheses based on multiple business datasets.
Apply data management techniques in Excel for multiple datasets.
Conduct hypothesis testing in Excel to evaluate the AI-assisted hypotheses.
Interpret and report the findings clearly and concisely.
Assignment Components:
1. Dataset Selection and Preparation
o Dataset Requirement: Choose a minimum of two datasets relevant to a business
problem. These datasets can be provided by the instructor or self-selected. The
datasets should include multiple variables that can be analyzed for potential
relationships.
o Data Cleaning:
Missing Data: Identify and address any missing data within each dataset
using appropriate techniques (e.g., imputation, removal).
Outliers: Detect and decide how to handle outliers in each dataset
Data Consistency: Ensure consistency in data formats, units, and coding
across the datasets (e.g., text vs. numeric).
Documenting: Document your data cleaning process, explaining the steps
taken and why, particularly how you merged the datasets.
2. AI-assisted Hypotheses
o Hypothesis Generation:
Use an AI tool (e.g., ChatGPT or another AI platform) to analyze your
datasets and generate potential research hypotheses. These hypotheses
should relate to relationships or differences between variables across the
two datasets.
o Evaluation of Hypotheses:
Review the hypotheses, considering their feasibility and relevance to the
datasets and business context.
Select at least 2 hypotheses to test using statistical methods, ensuring that
the hypotheses involve comparisons or relationships that require data from
both datasets.
3. Hypothesis Testing in Excel
o Testing Process:
Use Excel to perform hypothesis testing, utilizing data from both datasets.
Depending on your hypothesis, you may conduct t-tests, z-tests, chi-square
tests, or ANOVA, ensuring you select the appropriate test for your data
type and hypothesis.
o Confidence Intervals:
Calculate and interpret confidence intervals to evaluate your hypothesis.
Explain what the intervals indicate about your data and hypotheses.
o Statistical Significance:
Determine the statistical significance of your results. Discuss whether your
findings support or refute the hypotheses, particularly focusing on the
comparison across datasets.
4. Report Findings
o Summary Report:
Create a concise report summarizing the following:
Data Gathering and Cleaning Process: Describe the datasets and
the steps taken to clean and prepare them, including any challenges
in merging or comparing the datasets.
Hypothesis Testing: Detail the AI process of developing the
hypotheses and the results of your hypothesis tests in Excel,
focusing on the analysis across the two datasets.
Interpretation: Interpret the results, explaining whether the
hypotheses were supported by the data from both datasets.
Conclusion: Discuss the implications of your findings for the business
problem and any potential limitations of your analysis, especially
considering the use of multiple datasets.
o Report Format:
Your report should include relevant tables or charts generated in Excel.
Use clear headings for each section.
5. Submission Guidelines
o Deadline: Submit your completed assignment by 10/6.
o Format: Submit your report as a Word document, and include the AI prompts
and output in a separate Word document, as well as the Excel file with your data
and analysis.
o File Naming: Use the following format for your file names:
LastName_FirstName_Week6Assignment
o Submission: Upload your files to the assignment folder in D2L
6. Assessment Criteria:
o Data Gathering and Cleaning (30%): Quality of data selection and
thoroughness of the cleaning process across multiple datasets.
o Hypotheses (20%): Relevance and feasibility of the hypotheses, especially in the
context of multiple datasets.
o Hypothesis Testing (30%): Correct application of statistical tests and
interpretation of results, with a focus on cross-dataset analysis.
o Report Quality (20%): Clarity, organization, and professionalism of the report,
including the effective use of Excel for analysis and presentation.
Final Note: This assignment is designed to challenge you to integrate multiple datasets in a
business context, using AI tools alongside traditional statistical analysis techniques with
Microsoft Excel. It will deepen your understanding of how to handle complex data scenarios and
apply hypothesis testing to real-world business problems

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!