Question: Create a R script or R markdown file. # read the 'HousePrice.csv ' , load it to a dataframe. house 2 read.csv ( ' HousePrice

Create a R script or R markdown file.

# read the 'HousePrice.csv

',

load it to a dataframe.

house

2

read.csv

('

HousePrice

.

csv

')

# convert continuous house price values to labels

house

2

$Price

-

factor

(

with

(

house

2,

ifelse

((

house

2

$Price

1000000),

'low','high'

)))

# check number of samples in each category

table

(

house

2

$Price

)

# Divide as training and testing:

20 %

test

80 %

train and get the training

data size

sample

_

size

-

floor

(0.8 *

nrow

(

house

))

# check the training data size

sample

_

size

# get train data index

train

_

ind

-

sample

(

seq

_

len

(

nrow

(

house

2)),

size

=

sample

_

size

)

# generate training and test dataset

train

-

house

2 [

train

_

ind,

]

test

-

house

2 [-

train

_

ind,

]

# use glm to build logistic model

glm

.

fit

-

glm

(

Price Sqft

_

Area

+

Lot

_

Area

+

Age

+

Crime, data

=

train,

family

=

binomial

)

summary

(

glm

.

fit

)

data

=

train

)

Coefficients:

Null deviance:

3597.9

5835

degrees of freedam Residual deviance:

2388.4

5831

degrees of freedom

(70

observatians deleted due to missingness

)

AIC:

2318.4

# predict on test dataset

predictedprob

-

predict

(

glm

.

fit, newdata

=

test, type

=

"response"

)

head

(

predictedprob

)

# check the probability

newdata

-

data.frame

(

test$Sqft

_

Area, test$Price, predictedprob

)

head

(

newdata

)

ggplot

(

newdata

,

aes

(

=

test.Sqft

_

Area, y

=

predictedprob

)) +

geom

_

point

()

# evaluate the prediction results

glm

.

pred

=

factor

(

ifelse

(

predictedprob

> 0.5,

'low','high'

))

confusionMatrix

(

test$Price, glm

.

pred

)

Create a R script or R markdown file. # read the

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Accounting Questions!

Explore Data Portal Create a R script or R markdown file. Download a dataset that is in csv format. Write code to import a dataset and explore the dataset. Explore the open data portal for a city or...

please help me writing R script for these two task.. help me with the code... i will insert csv file myself - (3 marks) Read the "assignment1.csv" file and Save it as a dataframe named "CerealsDF"...

### Name of File Name your assignment file * * ` BRFSS _ Part 1 ` * * . This is a Quarto "markdown" file, which has the file has the extension ' . qmd ' . ### Data Set - These data come from the [...

JupytrLab Please help me with the first exercise my professor has not answered any of my emails. I have no idea what Im doing. Thank you very much Se birthwt.txt X X Chapter_02.pynb Markdown a + X 0...

The csv file "house _ price.csv " contains the basic information about houses. Read the data from the csv file "house _ price.csv " , and create a dataframe object. Plot the histogram chart and the...

This is an assignment need written by language R. The requirement and needed file is upload to the google drive. It contains a csv file and requirement file. I put the content of requirement file...

It MUST be the Rmarkdown file. The file you create will have a " . rmd " file extension located in the project directory you should have created as part of starting a new project. 1 . Create a new R...

1. Autocorrelation In this problem, you will simulate two error distributions with the same mean and sd. One will be pure white noise (a normal distribution) and the other will intentionally have a...

solve all ` ` ` { r setup, include = FALSE } knitr::opts _ chunk$set ( echo = TRUE ) ` ` ` ### Instructions 1 . This is an R Markdown format used for publishing markdown documents to GitHub. When you...

Identify a problem in organization (that is appropriate for the level of the qualification is identified and qualifies and the boundaries of the problem are complete / a problem in the team or...

Given a circuit below. The forward break-over voltage of the diode D is VED=0.7V. View the diode is the load of the circuit, a) Simplify the circuit to a Thvenin equivalent circuit b) Determine...

Barry's preparer also explains the conditions related to the American opportunity credit. Which of the following is a condition of eligibility for the American opportunity credit? Select one: a . The...

Can you elucidate the role of molecular chaperones and protein quality control systems in maintaining the integrity and homeostasis of the cytoplasm, particularly under conditions of cellular stress...