Question: Classification 3 0 Points Using dataset Red _ wine _ quality . csv answer following questions. Either use Python coding or Weka. Provide your work

Classification
30 Points
Using dataset Red_wine_quality .csv answer following questions. Either use Python coding or Weka. Provide your work in form of code and its output, if using Python. Screenshots, parameters used (with explanation) for Weka.
1. Is the dataset balanced? Which technique did you use to resolve the issue (Evaluation Method, Sampling technique, Threshold moving).-5Points
2. Does dataset have Scaling issue? Did you normalize your data? Show your normalization of dataset. -5 Points
Use following code for normalization-
from sklearn.preprocessing import MinMaxScaler
scaler = MinMaxScaler(feature_range=(0,1))
normal_df = scaler.fit_transform(df)
normal_df = pd.DataFrame(normal_df, columns = df.columns)
3. What is dependent variable and what is its data type? Is it in right format? If not convert it to nominal datatype. -5 Points
4. Create baseline model using following code--5 Points
from sklearn.dummy import DummyClassifier
dummy_classifier = DummyClassifier(strategy='most_frequent',random_state=0)
dummy_classifier.fit(X_train,y_train)
acc_baseline = dummy_classifier.score(X_test,y_test)
print("Baseline Accuracy =", acc_baseline)
5. Choose one Classifier (Decision tree, Nave Bayes, or SVM) to train your model. What is its accuracy, CV Score and F1 Score. -5 Points
6. Is your model better than baseline model? How do you evaluate your model performance? 5 Points

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!