Statistical Assigment 2 This statistical assignment gets your hands dirty with R. This statistical assignment is...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Statistical Assigment 2 This statistical assignment gets your hands "dirty" with R. This statistical assignment is due at noon on Monday May 1. You may work in teams of 2 or 3 (or by yourself...) and hand in one assignment for the team. Make sure that all names as they appear on Albert of the team are on the assignment. 20% will be docked for each day late the assignment is handed in. 1 Dataset Details The dataset for this assignment "data_assign2.csv" comes from the US Census. The sample in your dataset covers women with at least 2 children. The dataset consists of 254,645 observations from the 1990 census. There are variables in the dataset are: . thirdkid: indicator equal to one if mother has 3 or more kids (i.e., =1 if kids 3, =0 if kids <3) boy1st: indicator equal to one if first child is a boy (i.e., =1 if First child is boy, =0 if First child is girl) boy2nd: indicator equal to one if second child is a boy (i.e., =1 if Second child is boy, =0 if Second child is girl) ageml: age of the mother in 1990 agefstm: age of the mother at first birth black: indicator equal to one if mother is African American (and = 0 otherwise) hisp: indicator equal to one if mother is Hispanic (and =0 otherwise) othrace: indicator equal to one if mother is other non-white race (and =0 otherwise) (e.g., Filipino, Pacific Islander, Multi-racial) workedm: indicator equal to one if mother is working (=0 if not working) incomem: weekly labour market income of the mother (not used in this assignment, but there if you are interested). General Instructions Please hand in assignment as one document. Please answer the questions succinctly. For this assignment, we are interested in estimating the following relationship: Yi Bo + Bithirdkid; + where: y: Indicator for mother i working (i.e., the 'workedm' variable in your dataset) thirdkid: Indicator for mother i having three children Question 1 What is the probability of having a third child for parents whose first two children are the same gender? What about for parents whose first two children are of different genders? Describe why this provides a 'first-stage' for the instrument. Question 2 Estimate the 'first-stage' and 'second-stage' of the instrumental variable regression using "same gender for first two kids" as the instrument without any additional controls. Interpret the point estimate for both the 'first-stage' and 'second-stage'. Question 3 Run the instrumental variable regression without any controls. Interpret the resulting point estimate and clearly describe which type of individuals is identifying the 'LATE'. Question 4 Discuss the (internal) validity of the instrument. In addition, state who exactly a defier would be in this example and whether the no defier (or monotonicity) assumption is likely to hold. Question 5 Now run the IV regression with the following controls: race, age of the mother, age of the mother squared, age of the mother at first birth, age of the mother at first birth squared. Does the point estimate change relative to question 3? Is this reassuring in terms of (internal) validity? Describe why or why not. Question 6 We could imagine doing a different instrumental variable, but with a similar spirit. Specif- ically, there is a large amount of research that many cultures have a "boy" preference for children (i.e., parents prefer having boys to girls). In that spirit, we could define the in- strumental variable instead as having two girls" relative to "having two boys" (for first two children). Intuitively, if prior research is correct having two girls should induce some individuals to have another child relative to having two boys. Create that instrument and run the IV regression (with controls). Why do you think the estimate differs substantially from the IV estimate in Q5? [hint: think of the LATE!] Statistical Assigment 2 This statistical assignment gets your hands "dirty" with R. This statistical assignment is due at noon on Monday May 1. You may work in teams of 2 or 3 (or by yourself...) and hand in one assignment for the team. Make sure that all names as they appear on Albert of the team are on the assignment. 20% will be docked for each day late the assignment is handed in. 1 Dataset Details The dataset for this assignment "data_assign2.csv" comes from the US Census. The sample in your dataset covers women with at least 2 children. The dataset consists of 254,645 observations from the 1990 census. There are variables in the dataset are: . thirdkid: indicator equal to one if mother has 3 or more kids (i.e., =1 if kids 3, =0 if kids <3) boy1st: indicator equal to one if first child is a boy (i.e., =1 if First child is boy, =0 if First child is girl) boy2nd: indicator equal to one if second child is a boy (i.e., =1 if Second child is boy, =0 if Second child is girl) ageml: age of the mother in 1990 agefstm: age of the mother at first birth black: indicator equal to one if mother is African American (and = 0 otherwise) hisp: indicator equal to one if mother is Hispanic (and =0 otherwise) othrace: indicator equal to one if mother is other non-white race (and =0 otherwise) (e.g., Filipino, Pacific Islander, Multi-racial) workedm: indicator equal to one if mother is working (=0 if not working) incomem: weekly labour market income of the mother (not used in this assignment, but there if you are interested). General Instructions Please hand in assignment as one document. Please answer the questions succinctly. For this assignment, we are interested in estimating the following relationship: Yi Bo + Bithirdkid; + where: y: Indicator for mother i working (i.e., the 'workedm' variable in your dataset) thirdkid: Indicator for mother i having three children Question 1 What is the probability of having a third child for parents whose first two children are the same gender? What about for parents whose first two children are of different genders? Describe why this provides a 'first-stage' for the instrument. Question 2 Estimate the 'first-stage' and 'second-stage' of the instrumental variable regression using "same gender for first two kids" as the instrument without any additional controls. Interpret the point estimate for both the 'first-stage' and 'second-stage'. Question 3 Run the instrumental variable regression without any controls. Interpret the resulting point estimate and clearly describe which type of individuals is identifying the 'LATE'. Question 4 Discuss the (internal) validity of the instrument. In addition, state who exactly a defier would be in this example and whether the no defier (or monotonicity) assumption is likely to hold. Question 5 Now run the IV regression with the following controls: race, age of the mother, age of the mother squared, age of the mother at first birth, age of the mother at first birth squared. Does the point estimate change relative to question 3? Is this reassuring in terms of (internal) validity? Describe why or why not. Question 6 We could imagine doing a different instrumental variable, but with a similar spirit. Specif- ically, there is a large amount of research that many cultures have a "boy" preference for children (i.e., parents prefer having boys to girls). In that spirit, we could define the in- strumental variable instead as having two girls" relative to "having two boys" (for first two children). Intuitively, if prior research is correct having two girls should induce some individuals to have another child relative to having two boys. Create that instrument and run the IV regression (with controls). Why do you think the estimate differs substantially from the IV estimate in Q5? [hint: think of the LATE!]
Expert Answer:
Related Book For
Posted Date:
Students also viewed these programming questions
-
Planning is one of the most important management functions in any business. A front office managers first step in planning should involve determine the departments goals. Planning also includes...
-
List three specific parts of the Case Guide, Objectives and Strategy Section (See below) that you had the most difficulty understanding. Describe your current understanding of these parts. Provide...
-
Write a program MooresLaw that takes a command-line argument \(n\) and outputs the increase in processor speed over a decade if microprocessors double every \(n\) months. How much will processor...
-
A 1200-kg car moving on a horizontal surface has speed u = 85 km/h when it strikes a horizontal coiled spring and is brought to rest in a distance of 2.2 m. What is the spring stiffness constant of...
-
What is an equity swap? What is a commodity swap?
-
Suppose \(V=V(S)\). Find the most general solution of the Black-Scholes equation.
-
Automotive Products (AP) designs, manufactures, and sells automotive parts. Actual variable manufacturing overhead costs for 2012 were $308,600. AP's simple costing system allocates variable...
-
The design, layout, and content of a Web site are critical to it being easily found by search engines. It is important to recognize these concepts from your own use and experience of the Web. You...
-
I farm on a small scale. Last year, I started off by planting soybeans and corn. This year, the price of soybeans increased, and the price of corn remained the same. Naturally, I planted more corn...
-
The average price of a random sample of 12 bottles of diet salad dressing taken from different stores is $1.43. The standard deviation is 0.09. The average price of a random sample of 16 low calorie...
-
Do you consider your own culture to be high-contact or low-contact? How does it feel to interact with people whose cultural norms for touch and proximity are highly different from your own?
-
Which barriers to effective listening are the most common for you?
-
What are some ways to reinforce the core values of your organization at the start of a meeting?
-
Which social customs are important for your own culture? Why would failing to observe social customs in another culture be problematic?
-
How are power distance, uncertainty avoidance, and cultural masculinity/femininity relevant for business communication?
-
Explain why temporary differences result in deferred tax assets or deferred tax liabilities while permanent differences do not, and describe the difference in the formation of deferred tax assets and...
-
The Strahler Stream Order System ranks streams based on the number of tributaries that have merged. It is a top-down system where rivers of the first order are the headwaters (aka outermost...
-
1. Ryder, a police officer in Philadelphia, was alone in his patrol car when he received a radio call of a man with a gun. Ryder was the first officer to arrive and was confronted by a suspect...
-
Working in teams of three or four, interview personnel at your local bank to check liability for lost or stolen bank-issued ATM, credit, and debit cards.
-
Match each of the numbered definitions with the correct term in the following list. Write the letter of your choice in the answer column. a. Beneficiary b. Bequest c. Codicil d. Decedent e. Devise f....
-
Define trust region fidelity.
-
Is the result of any Trust Region method depends on the initial guess value \(X_{0} \in \mathbb{R}^{n}\) ? Justify your answer.
-
Discuss the convergence criteria of Trust Region method, show how the other parameters \(\left(\tau_{1}, \tau_{2}, \tau_{3} ight.\) and \(\left.\tau_{4} ight)\) affect the convergence.
Study smarter with the SolutionInn App