Question: CS 3 2 5 : Artificial Intelligence Homework On Matrix factorization In this homework, you are going to find the optimum K value for Matrix
CS : Artificial Intelligence
Homework On Matrix factorization
In this homework, you are going to find the optimum K value
for Matrix Factorization based prediction model on a reallife
dataset. K value can range from to You can start with
and increment it by as your next K value.
Dataset: You will use the MovieLens k dataset, which is a
stable benchmark dataset with ratings given by
users for movies, with each user having rated at least
movies. This dataset consists of many files that contain
information about the movies, the users, and the ratings given by
users to the movies they have watched.
uitem: Information about the items movies; this is a tab
separated list of
movie id movie title release date video release date
I
IMDb URL unknown Action Adventure
Animation
Children's Comedy Crime Documentary Drama
Fantasy
FilmNoir Horror Musical Mystery Romance
SciFi
Thriller War Western
The last fields are the genres, a indicates the movie is of
that genre, a indicates it is not; movies can be in several genres
at once. The movie ids are the ones used in the udata data set.
udata: The full rating data set, ratings by users on
items. Each user has rated at least movies. Users and
items are numbered consecutively from The data is randomly
ordered. This is a tab separated list of user id item id rating
timestamp. The time stamps are Unix seconds since
UTC
uuser: Demographic information about the users; this is a tab
separated list of user id age gender occupation zip code. The
user ids are the ones used in the udata data set. Also, read the
'read.txt for more information on the dataset.
Go through the udata file to build the rating matrix user
moverating data Now, from the rating matrix build the
training matrix. As each user has rated at least movies,
select movies randomly for each user which the user rated, and
put them in the test data file. The training matrix will contain all
the movie ratings except those that are in the test data file. The
movies in the test file, you have to make them unrated. For
example: if user rated movie to in the rating matrix, then
if test data file contains movie rating of to by user then
training matrix file will contain movie rating of to by user
as rated and to as unrated. In summary, the test data file will
contain movie ratings for each user, and the training matrix
will contain all the movie rating information except those in the
test data for all users.
Now, use the matrix factorization algorithm to predict the
missing values in the training matrix. And test with different
K values and choose the best one with minimum error value.
What to submit: Submit a zip file of all your code that can
be executed and excel file with different value and the
corresponding error value.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
