Question: Objectives: Apply and evaluate three different machine learning methods on image recognition dataset. Compare the effectiveness of these methods using various performance metrics. Understand the
Objectives:
Apply and evaluate three different machine learning methods on image recognition dataset.
Compare the effectiveness of these methods using various performance metrics.
Understand the challenges involved in applying machine learning to digit recognition.
Dataset: Each line consists of the digit id followed by the grayscale values. There are training observations and test observations, distributed as follows:
: Total
Train: :
Test: :
The test set is notoriously "difficult", and a error rate is excellent. These data were kindly made available by the neural network group at AT&T research labs.
Instructions
A Feature Selection: Utilize any appropriate feature selection method to select relevant features before training the model.
B Machine Learning Models Implementation:
Implement any three machine learning models we covered in our classes like: KNNANN, SVM LDA, QDA, Linear Regression, etc. You may also try implementing other methods such as XGBoost, Random Forest, etc.
You may use Rapid Miner, WEKA, Google CoLab, Python Notebook, Python libraries such as Scikitlearn for implementing these models.
C Model Evaluation:
Evaluate each model using fold Cross Validation on the following performance metrics:
Accuracy
Precision
Recall
F Score
ROCAUC Score
Use crossvalidation to ensure the reliability of your results.
Test the performance of your model on the provided test dataset.
D Comparison and Analysis:
Compare the models based on the performance metrics.
Discuss the strengths and weaknesses of each model in the context of digit
recognition.
Provide insights into the challenges of using machine learning for digit recognition, if any.
Deliverables
A Report: Submit a detailed report that includes:
An overview of your data preprocessing steps.
A brief explanation of the chosen machine learning models.
Show the performance of your models before and after feature selection based on the metrices specified in the model evaluation section. You must provide a table that shows fold cross validation performance of the methods you selected to implement.
On separate tables, you must provide the performance comparison of the models on training and test dataset. If features selection results in better performance, you can use selected features to train and test the models.
Discuss the results you obtained and presented in each of the tables above.
B Code: Submit any code used for data preprocessing, model implementation, and evaluation. Ensure your code is wellcommented and organized.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
