STATISTICS AND ECONOMETRICS
Problem Set IV
Fall 2011
Jose G. Montalvo Universitat Pompeu Fabra
PART A
1. The le wages.dta contains the data of Blackburn and Newmark (1992)
"Unobserved ability, e ciency wages and interindustry wage", Quarterly Journal of Economics, 107, 1421-1436. The data are organized as follows (obs=935):
wage: monthly earnings hours: average weekly hours IQ: IQ score
KWW: knowledge of world work score educ: years of education
exper: years of work experience tenure: years with current employer age: age in years
married: =1 if married black: =1 if black
south: =1 if live in south urban: =1 if live in SMSA sibs: number of siblings brthord: birth order
meduc: mother's education feduc: father's education
a) Consider the following speci cation
ln(wagei) = 1+ 2EDU Ci+ 3EXP ERi+ 4EDU Ci EXP ERi+othersi+ui where others include tenure, married, south, black and urban. What is the interpretation of 2? Why is this coe cient so important? Estimate the model and discuss the results. Does the estimated coe cient of 3 makes sense?
b) Write down the return for another year of experience for someone with 12 years of education and 10 years of experience using simple calculus. What is your estimate for another year of experience?
c) Reparametrize the model so that the coe cient on EXPER is the return to another year of experience starting at EXPER=10 and educ=12.
Call the coe cient of the new variable : Estimate the model and obtain a 90% coe cient interval for :
d) Drop the cross product term (experience by education) and add the variables IQ. Why do you want to include this variable?. What happen with the return to education? Explain.
e) Include KWW in the regression in d). Comment on the individual statistical signi cance of educ and exper. Are these three varaibles jointly signi cant? Explain.
f) Run the regression in e) but without IQ and KWW. Is the return to education higher or lower than before? Explain.
g) Estimate the return to education, using the model in e), at exper=10 and obtain a 95% con dence interval.
2*. (Only for advance students. Alternative to 1.) Hausman, Hall and Griliches (Econometrica, 1984) suggest that the Poisson distribution approx- imates relatively well the distribution of patents granted to a rm in any given year (nit). The speci cation used is
P r(nit) =
nit it e it
nit! (1)
where is the conditional expectation of the distribution. The speci ca- tion of this conditional distribution is
E(nitj Xit) = it; nit = exp( +
j=5X
j=0
jln(R&Dt j) + 6T + uit) (2) where T represents a time trend.
(a) Derive the log-likelihood function of a sample of N rms over T periods for this Poisson speci cation. Derive a estimator for the information matrix.
Is the likelihood function globally concave? Why?
(b) The le PATENTS.PRN contains the data used by Hall, Griliches and Hausman ("Patents and R&D: Is there a lag," International Economic
Review, 1986. You can nd this paper in the website of the course) and extend the original sample of 128 rms used by Hall, Hausman and Griliches (1984). The structure of the data is as follows: rm id, industry number, scienti c sector dummy, ln(capital in 1972), number, ln(R&D) (10 numbers, from 1970 up to 1979), number of patents (10 numbers, from 1970 up to 1979). There are 346 rms in this le. Using this data set estimate a Poisson model with the speci cation presented above.
(c) Use the likelihood ratio principle to test the joint hypothesis that
1 = 2 = 3 = 4 = 5 = 0.
PART B
3. The Poisson distribution is a good description of the probability of a rare disease when we are random sampling from a large number of inhabitants or the probability of getting a patent in a large random sample of rms. The discrete function is given by
f (X; ) = e X
X! ; X = 0; 1; 2; 3; :::
For a random sample from this Poisson distribution a) Obtain the MLE estimator for :
b) Show that your estimator is unbiased and consistent.
c) Derive the Cramer-Rao lower bound for any unbiased estimator of : Show that your ML estimator attains that bound.
c) Obtain the likelihood ratio test for testing H0 : = 2 versus H1 : 6= 2:
Derive also the Wald and the Lagrange Multiplier test statitistic for the same null hypothesis.
4. Consider the random variables x1 and x2, which are bivariate normal with x1 N (0; 12); x2 N (0; 22); and correlation : Show that the ex- pectation of x1 conditional on x2 is linear on x2: Calculate the variance of x1 conditional on x2: How are these results modi ed if the means of x1 and x2 are 1 and 2? What is the importance of these result? (Hint: notice that we have justi ed linear regression as an approximation to a potentially non-linear conditional expectation).
5*. (Only for advance students. Alternative to 4) Let's assume that X is a random vector normally distributed,
X =
"
x1 x2
#
E(X) =
"
1 2
#
V CV (X) =
"
11 12
21 22
#
(3)
If you feel more confortable working with scalars then convert the matrices in scalars and solve the questions below.
(a) Show that there is a non-singular transformation, Z = CX, such that Z can be partioned into two subvectors, one corresponding to x2and the other distributed independently of x2. Hint: Use the following transformation
z1 = x1 x2 (4)
z2 = x2 (5)
Calculate E(Z) and V CV (Z).
(b) Calculate the joint density function of (x1; x2) and the conditional density of x1 given x2, f (x1 j x2). Interpret the expectation and the variance- covariance matrix of this conditional density in terms of a regression.
(c) Explain the importance of the previous results in the context of sto- chastic regressors and normality. Explain the relevance of the linearity of the conditional expectation under this conditions in terms of the interpretation of linear regression.
6. Consider the following speci cation for heteroskedasticity
b
u2i = 1+ 2Z2i+ 3Z3i+ ::: + sZsi+ vi
a) Describe a F tets for the null hypothesis that all the parameters except the constant are equal to 0 (H0 : 2 = 3 = ::: = s = 0;no heteroskedastic- ity) using the sum of squares residual approach. Explain what is the form of the sum of squares residuals without constrains and with constrains.
b) When we discuss in class the issue of asymptotic properties one of the results implied that the asympotitc 2 test could be written as the product of the degrees of freedom of the numerator of the corresponding F by the value of the F (rF). Use that result to obtain the asymptotic distribution of the F test in section a.
c) Explain why var(ud2i) =var(vdi): Explain why the var(vdi) =
Pbv2i
N s
d) Show that var(ud2i) = N1 P(ub2i ub2i): Use the method of moments to explain the appropriatness of the calculation.
e) In the expresion of the chi-square of b) you got in the denominator var(vdi): Use the equality in c) and the de nition of the estimated variance of u2i to show that the chi-square distribution in b) can be written also a N R2:
f) Under what conditions the result in e) can be called White's test?
7. Consider the following SURE system
Y1 = X1 1+ u1 (6)
Y2 = X2 2+ u2 (7)
where X1 and X2 are non-stochastic matrices. Assuming that Eu1 = 0 and Eu2 = 0. The variance-covariance matrix of the perturbations is characterized by the following conditions: E(u1u01) = 11I; E(u2u02) =
22I; E(u1u02) = 12I where the values of ij are known. Besides, the X's are orthogonal, X10X2 = 0.
a) Obtain the SURE estimators and show their relationship with the least squares estimator. Interpret.
b) Compare the variance-covariance of the LS estimator with the variance- covariance of the SURE estimator.