Panel Data Econometrics with R - Yves Croissant - E-Book

Panel Data Econometrics with R E-Book

Yves Croissant

0,0
84,99 €

-100%
Sammeln Sie Punkte in unserem Gutscheinprogramm und kaufen Sie E-Books und Hörbücher mit bis zu 100% Rabatt.

Mehr erfahren.
Beschreibung

Panel Data Econometrics with R provides a tutorial for using R in the field of panel data econometrics. Illustrated throughout with examples in econometrics, political science, agriculture and epidemiology, this book presents classic methodology and applications as well as more advanced topics and recent developments in this field including error component models, spatial panels and dynamic models. They have developed the software programming in R and host replicable material on the book’s accompanying website.

Sie lesen das E-Book in den Legimi-Apps auf:

Android
iOS
von Legimi
zertifizierten E-Readern

Seitenzahl: 546

Veröffentlichungsjahr: 2018

Bewertungen
0,0
0
0
0
0
0
Mehr Informationen
Mehr Informationen
Legimi prüft nicht, ob Rezensionen von Nutzern stammen, die den betreffenden Titel tatsächlich gekauft oder gelesen/gehört haben. Wir entfernen aber gefälschte Rezensionen.



Table of Contents

Cover

Dedication

Preface

Acknowledgments

About the Companion Website

Chapter 1: Introduction

1.1 Panel Data Econometrics: A Gentle Introduction

1.2 R for Econometric Computing

1.3 plm for the Casual R User

1.4 plm for the Proficient R User

1.5 plm for the R Developer

1.6 Notations

Chapter 2: The Error Component Model

2.1 Notations and Hypotheses

2.2 Ordinary Least Squares Estimators

2.3 The Generalized Least Squares Estimator

2.4 Comparison of the Estimators

2.5 The Two‐ways Error Components Model

2.6 Estimation of a Wage Equation

Chapter 3: Advanced Error Components Models

3.1 Unbalanced Panels

3.2 Seemingly Unrelated Regression

3.3 The Maximum Likelihood Estimator

3.4 The Nested Error Components Model

Chapter 4: Tests on Error Component Models

4.1 Tests on Individual and/or Time Effects

4.2 Tests for Correlated Effects

4.3 Tests for Serial Correlation

4.4 Tests for Cross‐sectional Dependence

Chapter 5: Robust Inference and Estimation for Non‐spherical Errors

5.1 Robust Inference

5.2 Unrestricted Generalized Least Squares

Chapter 6: Endogeneity

6.1 Introduction

6.2 The Instrumental Variables Estimator

6.3 Error Components Instrumental Variables Estimator

6.4 Estimation of a System of Equations

6.5 More Empirical Examples

Chapter 7: Estimation of a Dynamic Model

7.1 Dynamic Model and Endogeneity

7.2 GMM Estimation of the Differenced Model

7.3 Generalized Method of Moments Estimator in Differences and Levels

7.4 Inference

7.5 More Empirical Examples

Chapter 8: Panel Time Series

8.1 Introduction

8.2 Heterogeneous Coefficients

8.3 Cross‐sectional Dependence and Common Factors

8.4 Nonstationarity and Cointegration

Chapter 9: Count Data and Limited Dependent Variables

9.1 Binomial and Ordinal Models

9.2 Censored or Truncated Dependent Variable

9.3 Count Data

9.4 More Empirical Examples

Chapter 10: Spatial Panels

10.1 Spatial Correlation

10.2 Spatial Lags

10.3 Individual Heterogeneity in Spatial Panels

10.4 Serial and Spatial Correlation

Bibliography

Index

End User License Agreement

List of Tables

Chapter 02

Table 2.1 Wage Equation.

Chapter 05

Table 5.1 Covariance structures as combinations of the basic building blocks.

Chapter 06

Table 6.1 Estimations of the gravity model.

List of Illustrations

Chapter 02

Figure 2.1 Imports in terms of the national product for the ForeignTrade data.

Figure 2.2 Cost in terms of output for the TurkishBanks data.

Figure 2.3 Cost and output for the TexasElectr data set.

Figure 2.4 Democracy and lagged income for the data DemocracyIncome25.

Chapter 03

Figure 3.1 Estimators of the variance components for unbalanced panels.

Figure 3.2 Error components estimators for the nested error component model.

Chapter 07

Figure 7.1 Relationship between income and democracy.

Figure 7.2 First step coefficient as a function of

.

Figure 7.3 The supplementary condition of th system‐

GMM

estimator.

Chapter 08

Figure 8.1 Individual coefficients, HousePriceUS.

Figure 8.2 Autoregressive processes with different

parameters.

Figure 8.3 Histogram of the Student statistic in case of a unit root.

Chapter 09

Limited dependent variable.

Figure 9.2 Distribution of

and

.

Figure 9.3

OLS

bias for the censored and the truncated samples.

Figure 9.4 Distribution of

and of

.

Figure 9.5 Symmetry of the distribution of

.

Figure 9.6 Average annual citations by age, BRC versus control articles.

Chapter 10

Figure 10.1 Growth of house prices indexes in the USA between 1980 and 2000.

Guide

Cover

Table of Contents

Begin Reading

Pages

C1

xiii

xiv

xv

xvii

xix

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

53

54

55

56

57

58

59

60

61

62

63

64

65

66

67

68

69

70

71

72

73

74

75

76

77

78

79

80

81

83

84

85

86

87

88

89

90

91

92

93

94

95

96

97

98

99

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

199

200

201

202

203

204

205

206

207

208

209

211

212

213

214

215

216

217

218

219

220

221

222

223

224

225

226

227

228

229

230

231

232

233

234

235

236

237

238

239

240

241

242

243

244

245

246

247

248

249

250

251

252

253

254

255

256

257

258

259

260

261

262

263

264

265

266

267

268

269

270

271

272

273

274

275

276

277

278

279

280

281

282

283

284

285

286

287

288

289

290

291

292

293

294

295

297

298

299

300

301

E1

Panel Data Econometrics with R

Edited by

Yves Croissant

Professor of Economics CEMOI Faculté de Droit et d'Economie Université de La Réunion France

 

Giovanni Millo

Senior Economist Group Insurance Research, Assicurazioni Generali S. p. A. Trieste, Italy

Copyright

This edition first published 2019

© 2019 John Wiley & Sons Ltd

All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, except as permitted by law. Advice on how to obtain permission to reuse material from this title is available at http://www.wiley.com/go/permissions.

The right of Yves Croissant and Giovanni Millo to be identified as the authors of this work has been asserted in accordance with law.

Registered Offices

John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ 07030, USA

John Wiley & Sons Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, UK

Editorial Office

9600 Garsington Road, Oxford, OX4 2DQ, UK

For details of our global editorial offices, customer services, and more information about Wiley products visit us at www.wiley.com.

Wiley also publishes its books in a variety of electronic formats and by print‐on‐demand. Some content that appears in standard print versions of this book may not be available in other formats.

Limit of Liability/Disclaimer of Warranty

While the publisher and authors have used their best efforts in preparing this work, they make no representations or warranties with respect to the accuracy or completeness of the contents of this work and specifically disclaim all warranties, including without limitation any implied warranties of merchantability or fitness for a particular purpose. No warranty may be created or extended by sales representatives, written sales materials or promotional statements for this work. The fact that an organization, website, or product is referred to in this work as a citation and/or potential source of further information does not mean that the publisher and authors endorse the information or services the organization, website, or product may provide or recommendations it may make. This work is sold with the understanding that the publisher is not engaged in rendering professional services. The advice and strategies contained herein may not be suitable for your situation. You should consult with a specialist where appropriate. Further, readers should be aware that websites listed in this work may have changed or disappeared between when this work was written and when it is read. Neither the publisher nor authors shall be liable for any loss of profit or any other commercial damages, including but not limited to special, incidental, consequential, or other damages.

Library of Congress Cataloging‐in‐Publication Data

Names: Croissant, Yves, 1969- author. | Millo, Giovanni, 1970- author.

Title: Panel data econometrics with R / Yves Croissant, Giovanni Millo.

Description: First edition. | Hoboken, NJ : John Wiley & Sons, 2018. |

Includes index. |

Identifiers: LCCN 2018006240 (print) | LCCN 2018014738 (ebook) | ISBN

9781118949177 (pdf) | ISBN 9781118949184 (epub) | ISBN 9781118949160

(cloth)

Subjects: LCSH: Econometrics. | Panel analysis. | R (Computer program

language)

Classification: LCC HB139 (ebook) | LCC HB139 .C765 2018 (print) | DDC

330.0285/5133-dc23

LC record available at https://lccn.loc.gov/2018006240

Cover Design: Wiley

Cover Image: ©Zffoto/Getty Images

Dedication

To Agnès, Fanny and Marion, to my parents - Yves

To the memory of my uncles, Giovanni and Mario - Giovanni

Preface

While R is the software of choice and the undisputed leader in many fields of statistics, this is not so in econometrics; yet, its popularity is rising both among researchers and in university classes and among practitioners. From user feedback and from citation information, we gather that the adoption rate of panel‐specific packages is even higher in other research fields outside economics where econometric methods are used: finance, political science, regional science, ecology, epidemiology, forestry, agriculture, and fishing.

This is the first book entirely dedicated to the subject of doing panel data econometrics in R, written by the very people who wrote most of the software considered, so it should be naturally adopted by R users wanting to do panel data analysis within their preferred software environment. According to the best practices of the R community, every example is meant to be replicable (in the style of package vignettes); all code is available from the standard online sources, as are all datasets. Most of the latter are contained in a dedicated companion package, pder. The book is supposed to be both a reasonably comprehensive reference on R functionality in the field of panel data econometrics, illustrated by way of examples, and a primer on econometric methods for panel data in general.

While we have tried to cover the vast majority of basic methods and much of the more advanced ones (corresponding roughly to graduate and doctoral level university courses), the book is still less exhaustive than main reference textbooks (one for all, Baltagi, 2013) the a priori being that the reader should be able to apply all the methods presented in the book through available R code from plm and related, more specialized packages.

One should note from the beginning that, from a computational viewpoint, the average R user tends to be more advanced than users of commercial statistical packages. R users will generally be interested in interactive statistical programming whereby they can be in full control of the procedures they use and eventually be looking forward to write their own code or adapt the existing one to their own purposes. All that said, despite its reputation, R lends itself nicely to standard statistical practice: issuing a command, reading output. Hence the potential readership spans an unusually broad spectrum and will be best identified by subject rather than by level of technical difficulty.

Examples are usually written without employing advanced features but still using a fair amount of syntax beyond what would be the plain vanilla “estimate, print summary” procedure sketched above; the reader replicating them will therefore be exposed to a number of simple but useful constructs—ranging from general purpose visualization to compact presentation of results—stemming from the fact that she is using a full‐featured programming language rather than a canned package.

The general level is introductory and aimed at both students and practitioners. Chapters 1–2, and to some extent 4–5, cover the basics of panel data econometrics as taught in undergraduate econometrics classes, if at all. With some overlapping, the main body of the book (Ch. 3–6) covers the typical subjects of an advanced panel data econometrics course at graduate level. Nevertheless, the coverage of the later chapters (especially 7–10) spans fields typical of current applied research; therefore it should appeal particularly to graduate students and researchers. For all this, the book might play two main roles: companion to advanced textbooks for graduate students taking a panel data course, with Chapters 1–7 covering the course syllabus and 8–10 providing more cutting‐edge material for extensions; and reference text for practitioners or applied researchers in the field, covering most of the methods they are ever likely to use, with applied examples from recent literature. Nevertheless, its first half can be used in an undergraduate course as well, especially considering the wealth of examples and the possibility to replicate all material. Symmetrically, the last chapters can appeal to researchers wanting to employ cutting‐edge methods—for which there is usually around only quite unfriendly code written in matrix language by methodologists—with the relative user‐friendliness of R. As an example, Ch. 10 is based on the R tutorials one of the authors gives at the Spatial Econometrics Advanced Institute in Rome, the world‐leading graduate school in applied spatial econometrics.

Econometrics is a late comer to the world of R, although of course much of basic econometrics employs standard statistical tools, which were present in base R. Typical functionality, addressing the emphasis on model assumptions and testing, which is characteristic of the discipline, started to appear with the lmtest package and the accompanying paper of Zeileis & Hothorn (2002); a review paper on the use of R in econometrics, focused on teaching, was published at about the same time (Racine & Hyndman, 2002). This was followed by further dedicated packages extending the scope of specialized methods to structural equation modeling, time series, stability testing, and robust covariance estimation, to name a few; while despite the availability of some online tutorials, no dedicated book would appear in print until Kleiber & Zeileis (2008).

In the wake of any organized and comprehensive R package for panel data econometrics, Yves Croissant started developing plm in 2006, presenting one early version of the software at the 2006 useR! Conference in Vienna. Giovanni Millo joined the project as coauthor shortly thereafter. Two years later, an accompanying paper to plm (Croissant & Millo, 2008) featured prominently in the econometrics special issue of the Journal of Statistical Software testifying the improved availability of econometric methods in R and the increased relevance of the R project for the profession.

More recently, Kevin Tappe has become the third author. Liviu Andronic, Arne Henningsen, Christian Kleiber, Ott Toomet, and Achim Zeileis importantly contributed to the package at various times. Countless users provided feedback, smart questions, bug reports, and, often, solutions.

Estimating the user base is no simple task, but the available evidence points at large and growing numbers. The 2008 paper describing an earlier version of the package has since been downloaded almost 100,000 times and peaked on Goggle Scholar's list as the 25th most cited paper in the Journal of Statistical Software, the leading outlet in the field, before hitting the five‐year reporting limit. At the time of writing, it counts over 400 citations on Google Scholar, despite the widespread bad habit of not citing software papers. The monthly number of package downloads from a leading mirror site has been recently estimated at 6,000.

Chapters 2, 3, 6, 7, and 8 have been written by Yves Croissant; 1, 5, 9 (except the first generation unit root testing section), and 10 by Giovanni Millo, chapter 4 being co‐written.

The book has been produced through Emacs+ESS (Rossini et al., 2004) and typeset in LaTeX using Sweave (Leisch, 2002) and later knitr (Xie, 2015). Plots have been made using ggplot2 (Wickham, 2009) and tikz (Tantau, 2013).

The companion package to this book is pder (Croissant & Millo, 2017); the methods described are mainly in the plm package (Croissant & Millo, 2008) but also in pglm (Croissant, 2017) and splm (Millo & Piras, 2012). General purpose tests and diagnostics tools of packages car (Fox & Weisberg, 2011), lmtest (Zeileis & Hothorn, 2002), sandwich (Zeileis, 2006b), and AER (Kleiber & Zeileis, 2008) have been used in the code, as have some more specialized tools available in MASS (Venables & Ripley, 2002), censReg (Henningsen, 2017), nlme (Pinheiro et al., 2017), survival (Therneau & Grambsch, 2000), truncreg (Croissant & Zeileis, 2016), pcse (Bailey & Katz, 2011), and msm (Jackson, 2011). dplyr (Wickham & Francois, 2016) has been used to work with data.frames and Formula with general formulas. stargazer (Hlavac, 2013) and texreg (Leifeld, 2013) were used to produce fancy tables, the fiftystater package (Murphy, 2016) to plot a United States map. The packages presented and the example code are entirely cross‐platform as being part of the R project.

Acknowledgments

We thank Kevin Tappe, now a coauthor of “plm,” for his invaluable help in improving, checking and extending the functionality of the package. It is difficult to overstate the importance of his contribution.

Achim Zeileis, Christian Kleiber, Ott Toomet, Liviu Andronic, and Nina Schoenfelder have contributed code, fixes, ideas, and interesting discussions at different stages of development. Too many users to list here have provided feedback, good words of encouragement, and bug reports. Often those reporting a bug have also provided, or helped in working out, a solution.

We thank the authors of all the papers that are replicated or simply cited here, for their inspiring research and for making their datasets available. Barbara Rossi (editor) and James MacKinnon (maintainer of the data archive) of the Journal of Applied Econometrics (JAE) are thanked together with the original authors for kindly sharing the JAE data archive datasets.

Personal thanks

Yves Croissant

The first drafts of several chapters of the book have been written while giving a panel data course in the applied economics master of the University of La Reunion. I thank the students of this course for their useful feedback, which helped improving the text. I've been working with Fabrizio Carlevaro on several projects for about 20 years. During this collaboration, he shared with me his deep knowledge of econometrics, and the endless discussions we had were an invaluable source of inspiration for me.

Giovanni Millo

I thank my parents, Luciano and Lalla, for lifelong support and inspiration; Roberta, for her love and patience; my uncle Marjan, for giving me my first electronic calculator—a TI30—when I was a child, sparking a lasting interest for automatic computing; my mentors Attilio Wedlin, Gaetano Carmeci, and Giorgio Calzolari, for teaching me econometrics; and Davide Fiaschi, Angela Parenti, Riccardo “Jack” Lucchetti, Eduardo Rossi, Giuseppe Arbia, Gianfranco Piras, Elisa Tosetti, Giacomo Pasini, and other friends from the “small world” of Italian econometrics—again, too many to list exhaustively here—for so many interesting discussions about econometrics, computing with R, or both.

About the Companion Website

This book is accompanied by a companion website:

www.wiley.com/go/croissant/data-econometrics-with-R

The website includes code for reproducing all examples in the book, which can be found below:

Examples Ch.1

Examples Ch.2

Examples Ch.3

Examples Ch.4

Examples Ch.5

Examples Ch.6

Examples Ch.7

Examples Ch.8

Examples Ch.9

Examples Ch.10

The datasets are to be found in the pder package in the below link:

https://cran.r-project.org/web/packages/pder/index.html

Scan this QR code to visit the companion website.