Question: Create a tab on your excel spreadsheet and call it Notes. You can also use a word document instead of an excel spreadsheet for this
Create a tab on your excel spreadsheet and call it "Notes". You can also use a word document instead of an excel spreadsheet for this assignment. Describe your dataset in detail such as total number of observations, and total number of variables.
2) Give an overview of the data and discuss some of the relationships that you are interested in looking at and what you expect to find.
3) Clean the data, if it isn't already cleaned up. Leave behind only what you need to, excluding certain data ranges or variables irrelevant to the questions you want to "ask" of the data. In your document, describe what steps you took to clean the data — what you chose to exclude. and why, or what "impurities came in the original file that you had to correct.
4) Using filters, get a casual & sense of the data, noting apparent relationships or tendencies that seem to emerge even before you run any more precise methods. Note these observations in your document
5) In your document, ask three different questions that can be answered by your dataset, whether or not you are prepared to answer them. Some of them may be answered by the subsequent steps (e.g., "what is the relationship between variable x and variable ; or "are the average values of variables x and y significantly ditferent ?&).
6) Establish some basic descriptive statistics about the key variables of interest: their mean and median values, and the standard deviation. Note these in your document.
7) Create simple graphs for your variables of interest. "these could be bar graphs or any other types of charts that you think appropriately displays your data.
8) Perform any two hypothesis tests - it is your choice - using confidence intervals. Report the results in your document, and interpret using ordinary language.
9) Perform any two simple linear regression analyses. Remember that simple linear regression finds the relationship between the dependent variable, and a single other independent variable. You may use the same dependent variable twice, or two different dependent variables. Report the results in your document, and interpret using ordinary language.
10) Conclude your document with your general observations: Were your results what you expected, or other than you expected? Are there certain other questions you now have that could be answered by the dataset? Are there questions you have for which you need additiona data not present in the dataset?