ASSESSMENT TASK 2 ( PROBLEM SOLVING ) in 2 0 2 3 T 3 Using aggregation functions for data analysis The provided zip file contains the data file ENB 2 0 2 3 txt and the R code AggWaFit 7 1 8 R to use with the following tasks, include these in your R working directory Total Marks 1 0 0 , Weighting 2 0 Energy Appliances Dataset The Dataset for this assignment is modified version of a subset of data used in Candanedo et al , 2 0 1 7 The experimental data have been used to create models of energy use of appliances in a low energy house The modified Dataset provides the energy use of Appliances ( denoted as Y ) The Dataset comprises 5 features ( variables ) , which are denoted as X 1 , X 2 , X 3 , X 4 and X 5 The details about these variables are given below X 1 Temperature in living room area ( Celsius degrees ) X 2 Humidity in living room area ( percentage ) X 3 Temperature in office room ( Celsius degrees ) X 4 Humidity in office room ( percentage ) X 5 Pressure ( millimeter of mercury ) Y Appliances energy consumption ( Wh ) For more information about the variables see Candanedo et al , 2 0 1 7 Assignment tasks T 1 Understand the data ( i ) ( ii ) ( iii ) Download the txt file ( ENB 2 0 2 3 txt ) from CloudDeakin and save it to your R working directory Assign the data to a matrix, e g using the data as matrix ( read table ( ENB 2 0 2 3 txt ) ) The variable of interest is Y To investigate Y , generate a subset of num row 4 0 0 ( use the same setting for the following tasks as well ) with numerical data e g using my data the data sample ( 1 num samples,num row ) c ( 1 num col ) This would give you a new dataset with num row rows and num col columns Values of num sample and num col have to be determined from the data provided ( iv ) Use scatter plots and histograms to understand the relationship between each of the variables X 1 X 2 , X 3 , X 4 , X 5 , and your variable of interest Y , i e , catter plots of ( X 1 , Y ) , ( X 2 , Y ) , , ( X 5 , Y ) , and histograms of X 1 X 2 , X 3 , X 4 , X 5 , Y T 2 Transform the data Choose any FOUR variables from X 1 , X 2 , X 3 , X 4 , X 5 Make appropriate transformations so that the values can be aggregated in order to predict the variable of interest Y Assign your transformed data along with your transformed variable of interest to an array ( it should be num row rows and 5 columns ) Save it to a txt file titled name transformed txt write table ( your data, name transformed txt ) The following tasks are based on the saved transformed data T 3 Build models and investigate the importance of each variable ( i ) Download the AggWaFit R file to your working directory and load into the R workspace using, source ( AggWaFit 7 1 8 R ) ( ii ) Use the fitting functions to learn the parameters for a A weighted arithmetic mean ( WAM ) , b Weighted power means ( WPM ) with p 0 5 , c Weighted power means ( WPM ) with p 2 , d An ordered weighted averaging function ( OWA ) T 4 Use your model for prediction Using your best fitting model from T 3 , i e , WAM, WPM ( 0 5 ) , WPM ( 2 ) , or OWA, predict Y ( Appliances ) for the following inputs X 1 1 9 1 , X 2 4 3 2 9 , X 3 1 9 7 , X 4 4 3 4 , X 5 7 4 3 6 You should use the same pre processing as in Task 2 Compare your prediction with the measured Y 6 0 T 5 Summarise your data analysis in up to 2 0 slides for a 5 minute presentation The slides should include the following content Correlations between the variables What kinds of data distributions you have identified in the raw data, use the histograms you have produced List and explain the transformations applied for the selected four variables and the variable of interest Explain the importance of the variables you have selected The best fitting model on your selected data include two tables one with the error measures and correlation coefficients, and one summarizing the weights parameters and any other useful information learned for your data Your prediction result and comment on wheather you think it is reasonable Discuss the best conditions ( in terms of your chosen variables ) under which a low energy use of appliances will occur Comment on the implications and limitations of the fitting model you used for prediction The slides should contain all necessary information to prove your findings All the bold terms above must appear in slide titles For the 5 minute presentation, you may provide a link to YouTube or upload a mp 4 video Any content beyond 5 minutes will not be graded SUBMISSION Submit to the SIT 7 1 8 CloudDeakin Dropbox Your final submission must include the following TWO files 1 The presentation slides with video, name slides ( pdf ) , covering all of the items in above ( where name is replaced with your name you can use your surname or first name ) ( a link to YouTube or uploading a mp 4 file ) 2 The R code file ( that you have written to produce your results ) named name code R ( where name is replaced with your surname or first name RMD file is not allowed ) Your assignment will not be assessed if the code is missing, or the outputs of the code are

The Answer is in the image, click to view ...

Question: ASSESSMENT TASK 2 ( PROBLEM SOLVING ) in 2 0 2 3 T 3 Using aggregation functions for data analysis The provided zip file contains

ASSESSMENT TASK

2 (

PROBLEM SOLVING

)

2023

3

Using aggregation functions for data analysis

The provided zip file contains the data file

[

ENB

_2023 .

txt

]

and the R code

[

AggWaFit

718 .

]

to use with the following tasks, include these in your R working directory.

Total Marks

100,

Weighting

20 %

Energy Appliances Dataset

The Dataset for this assignment is modified version of a subset of data used in Candanedo et al

, 2017 .

The experimental data have been used to create models of energy use of appliances in a low

-

energy house. The modified Dataset provides the energy use of Appliances

(

denoted as Y

) .

The Dataset comprises

5

features

(

variables

),

which are denoted as X

1,

2,

3,

4

and X

5 .

The details about these variables are given below:

1

: Temperature in living room area

(

Celsius degrees

)

2

: Humidity in living room area

(

percentage

)

3

: Temperature in office room

(

Celsius degrees

)

4

: Humidity in office room

(

percentage

)

5

: Pressure

(

millimeter of mercury

)

Y: Appliances energy consumption

(

)

For more information about the variables see Candanedo et al

, 2017 .

Assignment tasks

1 .

Understand the data

(

) (

)

(

iii

)

Download the txt file

(

ENB

_2023 .

txt

)

from CloudDeakin and save it to your R working directory. Assign the data to a matrix, e

.

.

using

the.data

< -

.

matrix

(

read

.

table

("

ENB

_2023 .

txt

"))

The variable of interest is Y

.

To investigate Y

,

generate a subset of num

_

row

= 400 (

use the same setting for the following tasks as well

)

with numerical data e

.

.

using:

.

data

< -

the.data

[

sample

(1

:num

_

samples,num

_

row

)

(1

:num

_

col

)]

This would give you a new dataset with num

_

row rows and num

_

col columns. Values of num

_

sample and num

_

col have to be determined from the data provided.

(

)

Use scatter plots and histograms to understand the relationship between each of the variables X

1

2,

3,

4,

5,

and your variable of interest Y

,

.

.,

catter plots of

(

1,

), (

2,

), . . ., (

5,

),

and histograms of X

1

2,

3,

4,

5,

.

2 .

Transform the data

Choose any FOUR variables from X

1,

2,

3,

4,

5 .

Make appropriate transformations so that the values can be aggregated in order to predict

the variable of interest Y

.

Assign your transformed data along with your transformed variable of interest to an array

(

it should be

` `

num

_

row

rows and

5

columns

) .

Save it to a txt file titled "name

-

transformed.txt

" .

write.table

(

your

.

data,"name

-

transformed.txt

")

The following tasks are based on the saved transformed data.

3 .

Build models and investigate the importance of each variable.

(

)

Download the AggWaFit.R file to your working directory and load into the

R workspace using,

source

("

AggWaFit

718 .

")

(

)

Use the fitting functions to learn the parameters for

.

A weighted arithmetic mean

(

WAM

),

.

Weighted power means

(

WPM

)

with p

= 0.5,

.

Weighted power means

(

WPM

)

with p

= 2,

.

An ordered weighted averaging function

(

OWA

) .

4 .

Use your model for prediction.

Using your best fitting model from T

3,

.

.,

WAM, WPM

(0.5),

WPM

(2),

or OWA, predict Y

(

Appliances

)

for the following inputs:

1 = 19.1,

2 = 43.29,

3 = 19.7,

4 = 43.4,

5 = 743.6

You should use the same pre

-

processing as in Task

2 .

Compare your prediction with the measured Y

= 60 .

5 .

Summarise your data analysis in up to

20

slides for a

5 -

minute presentation

The slides should include the following content:

-

Correlations between the variables;

-

What kinds of data distributions you have identified in the raw data, use the histograms you have produced;

-

List and explain the transformations applied for the selected four variables and the variable of interest;

-

Explain the importance of the variables you have selected;

-

The best fitting model on your selected data; include two tables:

one with the error measures and correlation coefficients, and one summarizing the weights

/

parameters

and any other useful information learned for your data;

-

Your prediction result and comment on wheather you think it is reasonable;

-

Discuss the best conditions

(

in terms of your chosen variables

)

under which a low energy use of

appliances will occur.

-

Comment on the implications and limitations of the fitting model you used for prediction.

The slides should contain all necessary information to prove your findings. All the bold terms above must appear in slide titles. For the

5 -

minute presentation, you may provide a link to YouTube or upload a mp

4

video. Any content beyond

5

minutes will not be graded.

SUBMISSION:

Submit to the SIT

718

CloudDeakin Dropbox.

Your final submission must include the following TWO files:

1 .

The presentation slides with video, "name

-

slides"

(

pdf

),

covering all of the items in above

(

where

name

is replaced with your name

-

you can use your surname or first name

)

(

a link to YouTube or uploading a mp

4

file

) .

2 .

The R code file

(

that you have written to produce your results

)

named "name

-

code.R

" (

where

name

is replaced with your surname or first name;

.

RMD file is not allowed

) .

Your assignment will not be assessed if the code is missing, or the outputs of the code are

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

The resulting bar chart shows that when HMK is the AR Clerk and FKL is the Cash Receipts Clerk, CT is the GL Accounting Clerk for $226,851 of current AR balances. However, there are $25,352 of...

From 'railFence.cpp' #include #include #include "encoder.h" #include "decoder.h" std::string msg; int key; ///Determines the number of letters on each rail, given a key and an encoded message, and...

Assignment #2 Problem Solving and Programming in C++ Department of Computer Science Old Dominion University Objectives: This assignment will give you an opportunity to explore the process of dividing...

CAN YOU PLEASE HELP ME SOLVE THE QUESTION REFERENCING THE ANNUAL REPORT OF BILLABOG 2015 AND 2016. THANKS FEDERATION BUSINESS SCHOOL BUACC1508 ACCOUNTING AND FINANCE ASSESSMENT TASK 2: GROUP...

Good Morning, This is the 3rd homework assignment I am requesting of you as you have did excellent on the two prior which I greatly appreciate. This is a new course that is starting today and I am...

In C++ For this assignment, download the A6 code pack. This zip file contains several files: main.cpp - the predetermined main.cpp. This file shows the usage and functionality that is expected of...

Overview:You have been tasked with writing a simple text editor. Your text editor will allow the user to create, read, and edit text files.Basic Idea:Your text editor will be implemented using a...

coc coc S Manage organisational fi X Paraphrasing Tool - Quill X * BSBFIN601 AT2 - Manag X @ Search Results for "who i x @ (88) Noi buon me toi - H X & how to sceen shot - Tim X @ how to screenshot...

Listen to the following videos and then complete the assignments using the changed numbers on the guidance report. Place your answers on the guidance report. Open the Guidance Report and rework the...

Apply the reasoning pertaining to the last entry of Table 7-1 to free convection from a sphere and compare with Equation (7-50). Equation (7-50) Nuf = 2 + 0.43(Grf Prf)1/4

Sales $50m, Variable Cost $45m, Fixed Cost $3m, Calculate change in net income for each of following case: 1. 10% increase in sales volume. 2. 10% increase in fixed cost. 3. 10% decrease in sales...

2. Cond Nast Traveler magazine conducts an annual survey of subscribers in order to determine the best places to stay throughout the world. Table 1.6 shows a sample of nine European hotels (Cond Nast...

5. Develop a scenario comparing two PH programs and involving the use of a CBA.

1. How have the origins of the study of intercultural communication in the United States affected its present focus?

7. Identify six intercultural communication dialectics.

3. How have the worldviews of researchers influenced how they studied intercultural communication?