Question: Question This dataset (Given in the link below) is a subset of 2013-2014 individual sample file, provided by the ATO and has been edited to
Question
This dataset (Given in the link below) is a subset of 2013-2014 individual sample file, provided by the ATO and has been edited to only include a subset of the cases and variables.
Data dictionary of this dataset is given in the following table.
please use the link at the end of the questions for raw data from excel file for graphs, and other information.
Variable NameDescription
Lodgment_methodLodgement method
A = Tax Agent S = Self Preparer
Sw_amtSalary/wage amount (AUD)
Tot_inc_amtTotal income (AUD)
Tot_ded_amtTotal deductions (AUD)
All data processing should be performed in Excel or stat key.
01
What are the variables and what are their types?
02
Did around 25% of taxpayers in 2013-2014 self-prepare their tax return? Using Dataset 1(link given for google drive), provide the frequency and the proportion (either as a decimal or a percentage) for each type of the lodgement method. You also need to provide a graphical display that easily shows the proportion of each lodgement method. Finally, give a comment about your findings and answer the question.
03
Is the average salary of taxpayers in 2013-2014 less than $45,000? Using Dataset 1(given google drive link), describe the salary amount distribution of Australian taxpayers in 2013-2014. You need to provide numerical summary (sample size, mean, standard deviation and median) as well as graphical display which shows the outliers, if any. Finally, give a comment about your findings and answer the question.
04
For the total income between 75000 - 80000, is there a difference in the total deduction between different lodgement methods? Using Dataset 1(given link with google drive), first filter the total income to include only the income between $75,000 to $80,000 (inclusive). Then provide the numerical summary for the total deduction grouped by different lodgement method. You also need to provide graphical display which shows any outliers. Finally, give a comment about your findings and answer the question.
05
Is there any relationship between total income amount and total deduction amount for self-preparer? Using Dataset 1 (google drive link is given), first filter the data to include only self-prepare lodgement method, then describe the relationship between total income amount and total deduction amount. You need to provide both numerical summary as well as graphical display. Finally, give a comment about your findings and answer the question.
The google drive link will be provided in the comments below, the file is excel and all raw data can be taken from there, statkey or excel can be used to generate graph.
Download the file and work on it. Thanks.
https://www.dropbox.com/scl/fi/65n8eds6bduyr24w2kizj/Datasets-and-all-tabs.xlsx?dl=0&rlkey=j01dv3feyq9u85dxgfqtc8zoh
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
