Question: I need help with my project 2 for my data science class. This is my research question and information on the topic. I need help

I need help with my project 2 for my data science class. This is my research question and information on the topic. I need help with the R Markdown in RStudio. I will also upload the instructions.

Research Project: Internet Use and Life Satisfaction

Overall Summary:

This project examines the relationship between internet use in various countries and overall happiness. The dataset, sourced from Our World in Data, contains information on the percentage of the population using the internet in a particular country and the average life satisfaction score in each country. To test this relationship formally, countries are classified into two categories: those with high internet use and those with low internet use. The project then examines the question of whether the average life satisfaction is statistically different between these two groups of countries.

Step 1: Research Question

Is there a difference in the average life satisfaction between countries where people use the internet much and countries where people use the internet a little?

Step 2: Dataset Used

Source: Our World in Data: https://ourworldindata.org/happiness-and-life-satisfaction?

  • Description: The dataset contains global data on life satisfaction and on the internet usage of several countries and years. For this analysis, I will use the latest year of available data to be certain that I am getting a snapshot of the cross-section. Each row represents a different country.

Step 3: Hypotheses

We are testing if there is a statistically significant difference between the mean score of life satisfaction between countries with high internet usage (above 70%) and countries with low internet usage 70% or below.

  • Categorical Variable: "Internet Use Group (has two codes: "High" and "Low")
  • Quantitative Variable: "Life satisfaction"

Null Hypothesis (H0): There is no difference in the mean life satisfaction of countries with high internet usage and countries with low internet usage.

  • H: _high = _low (where mu high is the population mean of life satisfaction in countries with high internet use and mu low is for low internet use countries)

Alternative Hypothesis (H): There is a difference in the mean life satisfaction of the two groups.

  • H: _high _low

This will be tested using an independent two-sample t-test.

Step 4: Variables and Their Types:

Variable Name

Description

Type

Measurement Level

Internet Use Group

The categorical variable I selected for this hypothesis test. It was created from the quantative variable Internet Users (% of population). Countries with the internet usage above the 70% or below were labeled "Low".

Categorical

Nominal

Life satisfaction

Average self-reported happiness or satisfaction score (0-10 scale).

Quantitative

Continuous (Interval Scale)

Internet users (% of population)

Original quantitative variable used to create the internet Use Group.

Quantitative

Continuous (Ratio Scale)

Country

Name of the country.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!