Question: you will use your data sample from see the attachment ''data simple.'' to describe your response variable. You'll need to provide some numerical summary statistics,
you will use your data sample from see the attachment ''data simple.'' to describe your response variable. You'll need to provide some numerical summary statistics, such as mean and median, as well as visual summary in the form of a boxplot or histogram. Required content: Descriptive Statistics for response variable (mean, median, 5 number summary, std deviation) Histogram of response variable (don't forget to include axis and chart titles - be sure to specify the units of measurement) Side-by-side boxplot of response variable broken out by categorical variable. Written summary of your response variable, including interpretations of the numerical and graphical representations as well as a description of the shape of the distribution (skewed, symmetric, etc).
Data Simple: There are 14 columns for the data set which are labeled the following: age, sex, cprestecg, thalach, exang, oldpeak, trestbps, chol, fbs, slope, thal, num, and ca.
These column titles mean nothing without a description of their meaning and what information each of the columns hold.
The following are what they mean: age- age in years;
sex- sex (1 = male, 0 = female); cp- chest pain type -- Value 1: typical angina -- Value 2: atypical angina -- Value 3: non-anginal pain -- Value 4: asymptomatic; trestbps- resting blood pressure (in mm Hg on admission to the hospital); chol- serum cholestoral in mg/dl; fbs- (fasting blood sugar > 120 mg/dl) (1 = true, 0 = false); restecg- resting electrocardiographic results -- Value 0: normal -- Value 1: having ST-T wave abnormality (T wave inversions and/or ST elevation or depression of > 0.05 mV) -- Value 2: showing probable or definite left ventricular hypertrophy by Estes' criteria; thalach- maximum heart rate achieved; exang- exercise induced angina (1 = yes, 0 = no); oldpeak- ST depression induced by exercise relative to rest; slopethe slope of the peak exercise ST segment -- Value 1: upsloping -- Value 2: flat -- Value 3: downsloping; ca- number of major vessels (0-3) colored by flourosopy, thal- 3 = normal, 6 = fixed defect, 7 = reversable defect; num (the predicted attribute)- diagnosis of heart disease (angiographic disease status) -- Value 0: < 50% diameter narrowing -- Value 1: > 50% diameter narrowing in any major vessel.
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
