Question: Machine Learning Classification and Clustering Models Multiclass strategies to evaluate and compare the performance of combined over - sampling and under - sampling methods at

Machine Learning Classification and Clustering Models
Multiclass strategies to evaluate and compare the performance of combined over-sampling and under-sampling methods at different sampling fractions when building a machine learning model for a synthetic dataset.
Instructions:
1. Generate the synthetic dataset as follows: make_classification(n_samples=1000, n_features=8, n_informative=3, n_classes=3, flip_y=0.2, weights=[0.55,0.35,0.1], random_state=42)
2. Balance the dataset (choose any oversampling method)
3. Considering the previous dataset, compare the accuracy performance of at least 3 classification algorithms when using the one-vs-one and one-vs-all strategies.
4. In terms of F1 measure, calculate the macro and weighted average performances.
5. Repeat steps 3 and 4, but do not implement any resampling strategy this time. Do you notice any performance change in terms of accuracy and F1 metrics?
6. Visualize and analyze your results
Note: Kindly include the complete code.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!