Question: I need help with my project 2 for my data science class. This is my research question and information on the topic. I need help
I need help with my project 2 for my data science class. This is my research question and information on the topic. I need help with the R Markdown in RStudio. I will also upload the instructions.
Research Project: Internet Use and Life Satisfaction
Overall Summary:
This project examines the relationship between internet use in various countries and overall happiness. The dataset, sourced from Our World in Data, contains information on the percentage of the population using the internet in a particular country and the average life satisfaction score in each country. To test this relationship formally, countries are classified into two categories: those with high internet use and those with low internet use. The project then examines the question of whether the average life satisfaction is statistically different between these two groups of countries.
Step 1: Research Question
Is there a difference in the average life satisfaction between countries where people use the internet much and countries where people use the internet a little?
Step 2: Dataset Used
Source: Our World in Data: https://ourworldindata.org/happiness-and-life-satisfaction?
- Description: The dataset contains global data on life satisfaction and on the internet usage of several countries and years. For this analysis, I will use the latest year of available data to be certain that I am getting a snapshot of the cross-section. Each row represents a different country.
Step 3: Hypotheses
We are testing if there is a statistically significant difference between the mean score of life satisfaction between countries with high internet usage (above 70%) and countries with low internet usage 70% or below.
- Categorical Variable: "Internet Use Group (has two codes: "High" and "Low")
- Quantitative Variable: "Life satisfaction"
Null Hypothesis (H0): There is no difference in the mean life satisfaction of countries with high internet usage and countries with low internet usage.
- H: _high = _low (where mu high is the population mean of life satisfaction in countries with high internet use and mu low is for low internet use countries)
Alternative Hypothesis (H): There is a difference in the mean life satisfaction of the two groups.
- H: _high _low
This will be tested using an independent two-sample t-test.
Step 4: Variables and Their Types:
Variable Name | Description | Type | Measurement Level |
Internet Use Group | The categorical variable I selected for this hypothesis test. It was created from the quantative variable Internet Users (% of population). Countries with the internet usage above the 70% or below were labeled "Low". | Categorical | Nominal |
Life satisfaction | Average self-reported happiness or satisfaction score (0-10 scale). | Quantitative | Continuous (Interval Scale) |
Internet users (% of population) | Original quantitative variable used to create the internet Use Group. | Quantitative | Continuous (Ratio Scale) |
Country | Name of the country. |
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
