Problem Set # 2
Master in Business and Quantitative Methods
Contents
Chapter 3. Exercises on limited dependent variables
Chapter 4. Exercises on panel data
1
0.1 Exercises on limited dependent variables
Note: the problems (1,2 and 3) of this section use the setup of Problem 1.
Problem 1 The following table reports observations that are drawn from a censored normal distribution: The underlying model is:
3.8396 7.2040 0 0 4.4132 8.0230 5.7971 7.0828 0 0.80260 13.0670 4.3211
0 8.6801 5.4571 0 8.1021 0
1.2526 5.6016
yi∗ =µ+εi,
yi =yi∗ if µ+εi >0, 0 otherwise εi ∼N(0, σ2).
(1)
The OLS estimator of µ in the context of this tobit model is simply the sample mean.
1. Compute the mean of all 20 observations. Would you expect this esti- mator to overestimate or underestimate µ?
2. If we consider only the nonzero observations, then the truncated re- gression model applies. The sample mean of the nonlimit observations is the least squares estimator in this context. Compute it and then comment on whether this sample mean should be an overestimate or underestimate of the true mean.
Problem 2 We now consider the tobit model that applies to the full data
Problem 4 A top university requires all students to write an entry exam.
Students that obtain a score less than 100 are not admitted. Students with score above 100, the scores are registered. Then, the university selects stu- dents, from this group, for admittance. For each student, they observe the result of the exam:
reject, if the score is less than 100 or the score, if it is 100 or more.
Imagine that the dean is interested in the relationship between the students’
background characteristics and the score of the entry exam. He specifies the following model:
y∗i =β0+xi0β1+εi, εi∼N(0, σ2) yi =y∗i if yi∗≥100
yi =rejected if yi∗ <100,
where yi is the observed score of student i and xi the vector of background characteristics (excluding an intercept).
1. The dean does a regression ofyi onxi and a constant, using the scores of 100 and more. Show that this approach does not lead to consistent estimators of β1.
0.2 Exercises on panel data
Problem 5 The file panel.xls constains data on investment (y) and profit (x) forn= 3 firms over T = 10periods.
1. Pool the data and compute the least squares regression coefficients of the model yit=α+βxit+εit.
2. Estimate the fixed effects model and test the hypothesis that the con- stant term is the same for all three firms.
3. Carry out Hausman’s specification test for the random versus fixed effect model.
Problem 6 Suppose the fixed effects model yi=Xiβ+iαi+εi
is formulated with an overall constant term and n−1 dummy variables instead of n. Investigate the effect that this supposition has on the set of dummy variable coefficients and on the least squares estimates of the slopes.
Problem 7 Consider the following simple panel data model yit=xitβ+α∗i +εit, i= 1, ..., n t= 1, ..., T where β is one dimensional and where it is assumed that
α∗i = ¯xiλ+αi, with αi ∼N ID(0, σα2) εit∼N ID(0, σε2) mutually independent and independent of allxits, where x¯i =PT
t=1xit. The parameterβ can be estimated by the fixed effects (or within) estimator given by
βˆF E = Pn
i=1
PT
t=1(xit−x¯i)(yit−y¯i) Pn
i=1
PT
t=1(xit−x¯i)2
As an alternative, the correlation between the error term αi∗+εit and xit can be handled by instrumental variables.
1. Give an expression for theβˆIV usingxit−x¯i as an instrument for xit. Show that βˆIV and βˆF E are identical.
2. Another way to eliminate the individual effects α∗i from the model is doing the following transformation:
yit−y¯i = (xit−x¯i)β+ (εit−ε¯i)
Which is the OLS estimator (β) based on this model? In which condi-ˆ tions is (βˆ) a consistent estimator of β?
3. Consider the between estimator βˆB for β. Give an expression for βˆB
1. Give an expression for the probability that rit = 1 given zit and ψi. 2. Compute the likelihood for this model.
3. Explain why it is not possible to treatψs as fixed unknown parameters and estimateδconsistently (for fixed T) from this fixed effects probit?