ANSWER ALL QUESTIONS ON THE EXAM PAPER AND SHOW FULL WORKING Question 1 Heteroskedasticity[10 marks] A researcher on the economics of innovation is interested in the country-level factors that...

1 answer below »


ANSWER ALL QUESTIONS ON THE EXAM PAPER AND SHOW FULL WORKING






Question 1
Heteroskedasticity[10 marks]





A researcher on the economics of innovation is interested in the country-level factors that predict R&D (Research and Development) investment.To study this, they take data on 27 OECD countries and regress R&D investment (as a percentage of GDP) against economic output (measured as the log of GDP), and educational attainments (the fraction of young people that have some tertiary education).


The following regression equation is used to model the relationship. Hereydenotes the fraction of expenditure on R&D andandare the GDP and educational variables respectively.



The economist is worried that the error term from this model might be ‘heteroskedastic’.Give a brief definition/description of heteroskedasticity and explain why it is a problem for regression models such as this.
















(2 marks)


A plot of the residual terms against the log of real GDP is given below in Figure 1.On the basis of this graph, what would you conclude about the error variance in your model?What problems would it raise for your estimation? Discuss.




Figure 1. Residuals Log GDP per Capita – R&D Model




























(2 marks)


Suppose you feel that the error term of the stated model is heteroskedastic and has the structure



Perform a GLS transformation of the modelsuch that the errors will be homoskedastic. State the transformed model.











(3 marks)



Prove theoretically that the errors from the transformed model will have a constant variance.












(3 marks)






Question 2
Autocorrelation[8 marks]





A researcher studying energy markets is trying to produce a model for daily electricity prices () based upon daily temperatures (). A data set containing 201 observations is analyzed where prices are measured in cents per kilowatt-hour while temperatures are in degrees Celsius. The model below is estimated



where both variables are stationary.


The output from Eviews is given in Table 1.





Table 1. Eviews Output for Electricity Model

























































































































































Dependent Variable: PRICE







Method: Least Squares













Sample: 1 201









Included observations: 201



























Variable



Coefficient



Std. Error



t-Statistic



Prob.























C



10.67810



1.115281



9.574360



0.0000



TEMP



0.571329



0.055512



10.29205



0.0000























R-squared



0.347383



Mean dependent var



22.05066



Adjusted R-squared



0.344104



S.D. dependent var



2.646967



S.E. of regression



2.143710



Akaike info criterion



4.372854



Sum squared resid



914.5033



Schwarz criterion



4.405722



Log likelihood



-437.4718



Hannan-Quinn criter.



4.386154



F-statistic



105.9263



Durbin-Watson stat



0.386975



Prob(F-statistic)



0.000000






























































To visualise the model the researcher also produces the following plot:




Figure 2. Residual, Actual and Fitted Series for Electricity Price Model






Note: Observation numbers ordered by time are given on the horizontal axis. The right vertical axis gives price and the left vertical axis gives the residual.



Based on Figure 2 comment briefly on the performance of the model. Are the standard errors reported in Table 1 likely to be correct? Why or why not?














(2 marks)


A Lagrange Multiplier (LM) test may be used to check for autocorrelation in the residuals of the model given in Table 1. The output of the test is given below in Table 2.




Table 2. Lagrange Multiplier Test for Autocorrelation – Electricity Price Model



































































































































































































































Breusch-Godfrey Serial Correlation LM Test:

























F-statistic



167.8504



Prob. F(2,197)



0.0000



Obs*R-squared



126.6675



Prob. Chi-Square(2)



0.0000

































Test Equation:









Dependent Variable: RESID







Method: Least Squares













Sample: 1 201









Included observations: 201







Presample missing value lagged residuals set to zero.























Variable



Coefficient



Std. Error



t-Statistic



Prob.























C



-0.107613



0.681878



-0.157818



0.8748



TEMP



0.005326



0.033939



0.156932



0.8755



RESID(-1)



0.800328



0.071299



11.22498



0.0000



RESID(-2)



-0.008061



0.071390



-0.112916



0.9102























R-squared



0.630186



Mean dependent var



3.92E-15



Adjusted R-squared



0.624555



S.D. dependent var



2.138344



S.E. of regression



1.310240



Akaike info criterion



3.397998



Sum squared resid



338.1957



Schwarz criterion



3.463735



Log likelihood



-337.4988



Hannan-Quinn criter.



3.424598



F-statistic



111.9003



Durbin-Watson stat



1.939251



Prob(F-statistic)



0.000000






























Give the test equation, the hypotheses, the test statistic and the appropriate p-value.What order of autocorrelation appears to be present in the residuals? Discuss.











(2 marks)


Another test for autocorrelation incomes from the correlogram.The output is reported in Table 3.



Table 3. Correlogram Q-Statistics of Residuals – Electricity Price Model



Are the results of the correlogram consistent with the results from the LM test? Why or why not? Explain your answer.







(2 marks)


Given the results of the LM test and the correlogram, how could the original equation



be modified to model the autocorrelation more effectively? Give equations for two alternative regression models that could potentially account for any autocorrelation that you have observed.




















(2 marks)
















Question 3
Dummy Variables[8 marks]


A political scientist is interested in the factors that influence voter preferences. To model the effect of various characteristics of US political candidates upon polling performance, the following equation can be estimated



whereis the candidate’s vote share,is the candidate’s age,is the budget of the campaign andis the number of endorsements the candidate received.


The political scientist feels that there may be structural differences in the regression equations for candidates that do and do not have advanced degrees. LetDdenote a dummy variable that is equal to zero if the candidatedoes nothave an advanced degree; and one if the candidatedoeshave an advanced degree.


Help the political scientist by showing the procedure used to conduct the Chow test. Give the unrestricted and restricted models and the null and alternative hypotheses. What conclusion should the political scientist draw if the null hypothesis is rejected?




















(4 marks)


Explain what is meant by the term ‘Dummy Variable Trap’.
















(2 marks)



In a famous research paper, David Card and Alan Krueger used a difference in difference estimator to evaluate the effect of minimum wages on employment in the US. A minimum wage was implemented in New Jersey (NJ) but not in Pennsylvania (PA) and the authors took employment data in both states before and after this occurred. Let FTE denote the level of full time equivalent employment,D denote a dummy variable indicating the time period after the wages were introduced, and NJ denote a dummy variable that indicates New Jersey.


Card and Krueger estimated the model



and the output is in Table 4.





















Table 4. Difference-in-Difference Equation for FTE Employment






































































































































































Dependent Variable: FTE







Method: Least Squares













Sample: 1 820









Included observations: 794



























Variable



Coefficient



Std. Error



t-Statistic



Prob.























C



23.33117



1.071870



21.76679



0.0000



D



-2.165584



1.515853



-1.428625



0.1535



NJ



-2.891761



1.193524



-2.422877



0.0156



D*NJ



2.753606



1.688409



1.630888



0.1033























R-squared



0.007401



Mean dependent var



21.02651



Adjusted R-squared



0.003632



S.D. dependent var



9.422746



S.E. of regression



9.405619



Akaike info criterion



7.325517



Sum squared resid



69887.88



Schwarz criterion



7.349079



Log likelihood



-2904.230



Hannan-Quinn criter.



7.334571



F-statistic



1.963536



Durbin-Watson stat



1.860208



Prob(F-statistic)



0.117983






























What effect did the minimum wage have on FTE employment in New Jersey? Is this result consistent to what is predicted by economic theory?Why or why not?









(2 marks)











Question 4
Instrumental Variables[9 marks]


Since the early 2000s, the United States has seen a dramatic increase in the abuse of prescription opioids (a strong painkiller chemically derived from heroin). Approximately 500,000 deaths have been attributed to the drug, and its usage has been described as an ongoing emergency for public health.


Consider a health economist who is interested in determining the health effects of opioid use.She takes numerical summary data on the health (y) of individuals (a number from 0-100 where higher values indicate better health), and regresses this against their age (), body mass index (), education (),and a gender dummy (). She also includes the key variable measuring opioid use (). The model is the following linear specification:



Explain to the health economist why variablemay be endogenous withy, and hence whycannot be interpreted as the causal impact of opioid use on health.



















(3 marks)



To estimate a causal effect in this context, at least one valid instrument is required. List three statistical properties required of a variable in order to act as a valid instrument.











(3 marks)



Suppose you determine two valid instrumentsandfor estimating the casual impact of opioid use upon health. Explain how the Hausman test employing instrumentsandcan be used to determine if the variableis endogenous. Outline the two-stage testing procedure, including the test equation and hypotheses.














(3 marks)





Question 5
Non-Stationary Time Series[15 marks]


Daily prices for three stock market indices (from Hong Kong - Hang Seng, Japan - Nikkei and the US – S&P500) are shown below.There are approximately 250 observations from each series sourced from 2020-2021.




Figure 3. Value of Hang Seng Index – 2020-2021





Figure 4. Value of Nikkei 225 - 2020-2021







Figure 5. Value of S&P500 – 2020-2021




A financial analyst examines the time-series properties of each variable by estimating the following Dicky-Fuller equations:



(1)


(2)


(3)



To test for stationarity the following hypotheses are used





and tau (τ) statistics are obtained for each variable using the three test equations. The results for the three indices are given in Table 5.




Table 5. τ Statistics for Dickey Fuller Tests – Hang Seng, Nikkei, S&P500






























Model



Hang Seng (τ)



Nikkei (τ)



S&P 500 (τ)





0.79



1.36



2.06





-1.34



-0.81



-0.86





-2.15



-1.83



-3.77




By identifying the appropriate test statistic and critical value for each variable, determine whether the prices of the indices are stationary at the 5% significance level.













(3 marks)



Briefly explain (i.e. one sentence each) how the appropriate test statistics and critical values are determined in each case.













(3 marks)


The analyst feels that due to some regional similarities, the Hang Seng and Nikkei indices might be cointegrated. To test for cointegration they run the following regression:



A plot of the residual seriesis presented below.




Figure 6. Residual Plot – Cointegrating Equation




On the basis of the plot, do you think the Hang Seng and Nikkei indices are cointegrated?What features of the plot would you look for in determining whether the series are cointegrated or not? Explain.










(2 marks)



To test for cointegration the analyst performs a unit root test upon these residuals. If a value ofis obtained, determine if the series are cointegrated. Show your working.









(3 marks)






Suppose the analyst decides that the Nikkei and Hang Seng indices are I(1) and proceeds to model them in first differences. They specify the following ARDL model



whereis the change in the Nikkei in timetandis the change in the Hang Seng at timet. The results are reported below in Table 6.



Table 6. Autoregressive Distributed Lag Model – Nikkei and Hang Seng










































































































































































Dependent Variable: DNIKKEI







Method: Least Squares













Sample (adjusted): -





Included observations: 241 after adjustments

























Variable



Coefficient



Std. Error



t-Statistic



Prob.























C



26.01895



16.73171



1.555068



0.1213



DNIKKEI(-1)



-0.077421



0.064901



-1.192903



0.2341



DNIKKEI(-2)



-0.001613



0.064269



-0.025103



0.9800



DHANGSENG(-1)



0.124815



0.053114



2.349933



0.0196



DHANGSENG(-2)



0.093489



0.053312



1.753631



0.0808























R-squared



0.030175



Mean dependent var



28.29643



Adjusted R-squared



0.013737



S.D. dependent var



258.4867



S.E. of regression



256.7052



Akaike info criterion



13.95426



Sum squared resid



15551818



Schwarz criterion



14.02656



Log likelihood



-1676.489



Hannan-Quinn criter.



13.98339



F-statistic



1.835710



Durbin-Watson stat



2.016709



Prob(F-statistic)



0.122702





























Interpret the model by briefly discussing its dynamics.Does the Nikkei index exhibit momentum effects?Do changes in the Hang Seng index appear to drive changes in the Nikkei?If so, how long does it take for effects to spill over from the Hong Kong market to the Japanese market? Use a significance level of 10% to answer this question.













(4 marks)


Answered Same DayJun 14, 2021

Answer To: ANSWER ALL QUESTIONS ON THE EXAM PAPER AND SHOW FULL WORKING Question 1 Heteroskedasticity[10...

Subhanbasha answered on Jun 14 2021
139 Votes
Question 1:
Ans:
· Definition and problem of heteroskedasticity: Heteroskedasticity will happen when there is residual or normal distribution. The variance of the error term is not a constant then it will
be called as error term in the regression model behaves like random. That means error term does not follows the Heteroskedasticity. In the plot of predicted versus residuals we can observe the pattern of residual terms. Then the predicted values of expenditure on R&D will vary differently than the original values.
The problem of the heteroskedasticity is that the model will not be predicting the dependent variable correctly. This will also effect on the p values, t and F values so the model can be identified as significant but it actually not significant. So it leads to wrong conclusions of the model. The predicted value of expenditure on R&D also very far from the original value.
· The plot above is showing that the residuals are not in normal form they are following random pattern. So, the variance is not constant and it will vary. Here the Heteroscedasticity is present and it will lead to the wrong predictions and decisions about the predicted values. The predicted value of Real GDP will very far from the actual value.
· Here we use log transformation of the model. By applying the transformation the model will be as follows
Ln(y) = β0 + β1ln (X1) + β2 ln (X2) + e
Y* = β0*+ β1X1*+ β2X2*+e*
The above equation is the linear log model of the original model.
Here Y* = ln(Y), X1= ln (X1)……..
After build a model we need to be take exponent of the dependent variable to get the original value.
· The transformed model is Y* = β0*+ β1X1*+ β2X2*+e* which will satisfy the OLS assumptions of the model that is the mean value of the residuals or error term is zero.
e*= Y* - β0*- β1X1*- β2X2*
That means that while we use the log transformation the residual values will decrease then the residuals will scatter equally among mean value.
So the variance will be constant that is
That is the model will give the constant variance after applying the...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here