Question: Practical data Science with Python Assignment Assessment : Recommender Systems This assignment focuses on recommender systems, a data science application widely used in the real
Practical data Science with Python Assignment Assessment : Recommender Systems This assignment focuses on recommender systems, a data science application widely used in the real world. You will need to develop and implement appropriate solutions to complete the corresponding tasks and present the results (virtually). These tasks must be completed individually. Introduction This assignment focuses on recommender systems, a data science application widely used in the real world. You will need to develop and implement appropriate solutions to complete the corresponding tasks, and present the results (virtually). These tasks must be completed individually. Academic Integrity All assignments will be checked with plagiarism-detection software; any student found to have plagiarised will be subject to disciplinary actions. Plagiarism includes, e.g., submitting code that is not your own or submitting text that is not your own. Allowing others to copy your work is also considered as plagiarism. All plagiarisms will be penalised; there are no exceptions and no excuses. The Dataset This assignment deals with movie recommendation. The dataset to be used throughout the assignment is the MovieLens 1M Dataset. Task 1: User-based Collaborative Filtering (6 marks) In this task you need to develop, implement, and evaluate user-based (i.e., user-user) collaborative filtering that uses KNN (k-nearest neighbour), i.e., KNN-based Collaborative Filtering. Randomly choose one user (as the active user) and predict this users ratings on movies. Note that in the given dataset, the user might have only rated some of the movies. Specific requirements include:
- Choose an appropriate similarity metric (and other parameters).
- Implement the approach in Python. Add detailed comments for the code (to explain
- your implementation).
- Study the impact of the parameter K (of KNN), with at least 5 different values.
- Use RMSE (root-mean-square error) as the metric for evaluation.
- Summarise the results/findings concisely in slides (and presentation).
- Choose an appropriate value for the parameter K (of KNN) (and other parameters).
- Implement the approach in Python. Add detailed comments for the code (to explain
- your implementation).
- Compare the performance of at least 2 similarity metrics.
- Use RMSE (root-mean-square error) as the metric for evaluation.
- Summarise the results/findings concisely in slides (and presentation).
- Describe the idea (in your own words) clearly and precisely in slides (and presentation). Cite references wherever necessary.
- In slides, explicitly give the source of the solution (if you choose Option 1) or provide
- Name the solution as Option1RecSys if you choose Option 1 or Option2RecSys for
- Implement the approach in Python. Add detailed comments for the code (to explain
- Movie Average: recommends items with the highest average ratings (see also Week
- KNN-based Collaborative Filtering: the approach developed in the above Task 1.
- The solution developed in the above Task 3.1, i.e., Option1RecSys or Option2RecSys.
- a cover page/slide containing e.g., your name and student ID, in addition to Assignment info,
- a concise outline, key results, and findings of Task 1,
- a concise outline, key results, and findings of Task 2,
- clear and complete description of the new solution developed in Task 3.1 (with
- proper citations),
- literature review (with proper citations), if applicable,
- necessary/key details of the algorithm of Option1RecSys or Option2RecSys,
- 4
- key results, visualisation, and findings of Task 3.2,
- a list of references.
- The slides should be no more than 10 pages in total (that is, no more than 10 slides)
- Save your clean slides as a PDF file and name it A3Slides.pdf. This version should
- not contain any recordings therein.
- The presentation (recording) should be no more than 5 minutes.
- The recording video file must be in MP4 format. Name it A3Presentation.mp4.
- The recording video file must be less than 30MB in size.
| Assignment 3 (Sem 2, 2023) | |||||||
|---|---|---|---|---|---|---|---|
| Criteria | Ratings | Pts | |||||
| This criterion is linked to a learning outcomeTask 1 User-based Collaborative Filtering |
| 6pts | |||||
| This criterion is linked to a learning outcomeTask 2 Item-based Filtering |
| 6pts | |||||
| This criterion is linked to a learning outcomeTask 3 A Better Recommender System |
| 12pts | |||||
| This criterion is linked to a learning outcomeTask 4 Presentation |
| 6pts | |||||
| Total points:30 |
Attachments:
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
