An Introduction to Statistical Computing - Jochen Voss - E-Book

An Introduction to Statistical Computing E-Book

Jochen Voss

0,0
67,99 €

-100%
Sammeln Sie Punkte in unserem Gutscheinprogramm und kaufen Sie E-Books und Hörbücher mit bis zu 100% Rabatt.

Mehr erfahren.
Beschreibung

A comprehensive introduction to sampling-based methods in statistical computing

The use of computers in mathematics and statistics has opened up a wide range of techniques for studying otherwise intractable problems. Sampling-based simulation techniques are now an invaluable tool for exploring statistical models. This book gives a comprehensive introduction to the exciting area of sampling-based methods.

An Introduction to Statistical Computing introduces the classical topics of random number generation and Monte Carlo methods. It also includes some advanced methods such as the reversible jump Markov chain Monte Carlo algorithm and modern methods such as approximate Bayesian computation and multilevel Monte Carlo techniques

An Introduction to Statistical Computing:

  • Fully covers the traditional topics of statistical computing.
  • Discusses both practical aspects and the theoretical background.
  • Includes a chapter about continuous-time models.
  • Illustrates all methods using examples and exercises.
  • Provides answers to the exercises (using the statistical computing environment R); the corresponding source code is available online.
  • Includes an introduction to programming in R.

This book is mostly self-contained; the only prerequisites are basic knowledge of probability up to the law of large numbers. Careful presentation and examples make this book accessible to a wide range of students and suitable for self-study or as the basis of a taught course.

Sie lesen das E-Book in den Legimi-Apps auf:

Android
iOS
von Legimi
zertifizierten E-Readern

Seitenzahl: 492

Veröffentlichungsjahr: 2013

Bewertungen
0,0
0
0
0
0
0
Mehr Informationen
Mehr Informationen
Legimi prüft nicht, ob Rezensionen von Nutzern stammen, die den betreffenden Titel tatsächlich gekauft oder gelesen/gehört haben. Wir entfernen aber gefälschte Rezensionen.



Contents

Cover

Series Page

Title Page

Copyright Page

List of Algorithms

Preface

Nomenclature

1: Random number generation

1.1 Pseudo random number generators

1.2 Discrete distributions

1.3 The inverse transform method

1.4 Rejection sampling

1.5 Transformation of random variables

1.6 Special-purpose methods

1.7 Summary and further reading

Exercises

2: Simulating statistical models

2.1 Multivariate normal distributions

2.2 Hierarchical models

2.3 Markov chains

2.4 Poisson processes

2.5 Summary and further reading

Exercises

3: Monte Carlo methods

3.1 Studying models via simulation

3.2 Monte Carlo estimates

3.3 Variance reduction methods

3.4 Applications to statistical inference

3.5 Summary and further reading

Exercises

4: Markov Chain Monte Carlo methods

4.1 The Metropolis–Hastings method

4.2 Convergence of Markov Chain Monte Carlo methods

4.3 Applications to Bayesian inference

4.4 The Gibbs sampler

4.5 Reversible Jump Markov Chain Monte Carlo

4.6 Summary and further reading

Exercises

5: Beyond Monte Carlo

5.1 Approximate Bayesian Computation

5.2 Resampling methods

5.3 Summary and further reading

Exercises

6: Continuous-time models

6.1 Time discretisation

6.2 Brownian motion

6.3 Geometric Brownian motion

6.4 Stochastic differential equations

6.5 Monte Carlo estimates

6.6 Application to option pricing

6.7 Summary and further reading

Exercises

Appendix A: Probability reminders

A.1 Events and probability

A.2 Conditional probability

A.3 Expectation

A.4 Limit theorems

A.5 Further reading

Appendix B: Programming in R

B.1 General advice

B.2 R as a Calculator

B.3 Programming principles

B.4 Random number generation

B.5 Summary and further reading

Exercises

Appendix C: Answers to the exercises

C.1 Answers for Chapter 1

C.2 Answers for Chapter 2

C.3 Answers for Chapter 3

C.4 Answers for Chapter 4

C.5 Answers for Chapter 5

C.6 Answers for Chapter 6

C.7 Answers for Appendix B

References

Index

WILEY SERIES IN COMPUTATIONAL STATISTICS

Consulting Editors:

Paolo GiudiciUniversity of Pavia, Italy

Geof H. GivensColorado State University, USA

Bani K. MallickTexas A & M University, USA

Wiley Series in Computational Statistics is comprised of practical guides and cutting edge research books on new developments in computational statistics. It features quality authors with a strong applications focus. The texts in the series provide detailed coverage of statistical concepts, methods and case studies in areas at the interface of statistics, computing, and numerics.

With sound motivation and a wealth of practical examples, the books show in concrete terms how to select and to use appropriate ranges of statistical computing techniques in particular fields of study. Readers are assumed to have a basic understanding of introductory terminology.

The series concentrates on applications of computational methods in statistics to fields of bioinformatics, genomics, epidemiology, business, engineering, finance and applied statistics.

Titles in the Series

Biegler, Biros, Ghattas, Heinkenschloss, Keyes, Mallick, Marzouk, Tenorio, Waanders, Willcox – Large-Scale Inverse Problems and Quantification of Uncertainty

Billard and Diday – Symbolic Data Analysis: Conceptual Statistics and Data Mining

Bolstad – Understanding Computational Bayesian Statistics

Borgelt, Steinbrecher and Kruse – Graphical Models, 2e

Dunne – A Statistical Approach to Neutral Networks for Pattern Recognition

Liang, Liu and Carroll – Advanced Markov Chain Monte Carlo Methods

Ntzoufras – Bayesian Modeling Using WinBUGS

Tufféry – Data Mining and Statistics for Decision Making

This edition first published 2014 © 2014 John Wiley & Sons, Ltd

Registered office John Wiley & Sons, Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, United Kingdom

For details of our global editorial offices, for customer services and for information about how to apply for permission to reuse the copyright material in this book please see our website at www.wiley.com.

The right of the author to be identified as the author of this work has been asserted in accordance with the Copyright, Designs and Patents Act 1988.

All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, except as permitted by the UK Copyright, Designs and Patents Act 1988, without the prior permission of the publisher.

Wiley also publishes its books in a variety of electronic formats. Some content that appears in print may not be available in electronic books.

Designations used by companies to distinguish their products are often claimed as trademarks. All brand names and product names used in this book are trade names, service marks, trademarks or registered trademarks of their respective owners. The publisher is not associated with any product or vendor mentioned in this book.

Limit of Liability/Disclaimer of Warranty: While the publisher and author have used their best efforts in preparing this book, they make no representations or warranties with respect to the accuracy or completeness of the contents of this book and specifically disclaim any implied warranties of merchantability or fitness for a particular purpose. It is sold on the understanding that the publisher is not engaged in rendering professional services and neither the publisher nor the author shall be liable for damages arising herefrom. If professional advice or other expert assistance is required, the services of a competent professional should be sought.

Library of Congress Cataloging-in-Publication Data

Voss, Jochen.  An introduction to statistical computing : a simulation-based approach / Jochen Voss. − First edition.    pages cm. – (Wiley series in computational statistics)  Includes bibliographical references and index.  ISBN 978-1-118-35772-9 (hardback) 1. Mathematical statistics-Data processing.  I. Title.  QA276.4.V66 2013  519.501′13-dc23

2013019321

A catalogue record for this book is available from the British Library.

ISBN: 978-1-118-35772-9

List of algorithms

Random number generation

alg. 1.2 linear congruential generator
alg. 1.13 inverse transform method
alg. 1.19 basic rejection sampling
alg. 1.22 envelope rejection sampling
alg. 1.25 rejection sampling for conditional distributions

Simulating statistical models

alg. 2.9 mixture distributions
alg. 2.11 componentwise simulation
alg. 2.22 Markov chains with discrete state space
alg. 2.31 Markov chains with continuous state space
alg. 2.36 Poisson process
alg. 2.41 thinning method for Poisson processes

Monte Carlo methods

alg. 3.8 Monte Carlo estimate
alg. 3.22 importance sampling
alg. 3.26 antithetic variables
alg. 3.31 control variates

Markov Chain Monte Carlo methods

alg. 4.2 Metropolis-Hastings method for continuous state space
alg. 4.4 Metropolis-Hastings method for discrete state space
alg. 4.9 random walk Metropolis
alg. 4.11 independence sampler
alg. 4.12 Metropolis-Hastings method with different move types
alg. 4.27 Gibbs sampler
alg. 4.31 Gibbs sampler for the Ising model
alg. 4.32 Gibbs sampler in image processing
alg. 4.36 reversible jump Markov Chain Monte Carlo

Beyond Monte Carlo

alg. 5.1 basic Approximate Bayesian Computation
alg. 5.6 Approximate Bayesian Computation with regression
alg. 5.11 general bootstrap estimate
alg. 5.15 bootstrap estimate of the bias
alg. 5.18 bootstrap estimate of the standard error
alg. 5.20 simple bootstrap confidence interval
alg. 5.21 BCa bootstrap confidence interval

Continuous-time models

alg. 6.6 Brownian motion
alg. 6.12 Euler-Maruyama scheme
alg. 6.15 Milstein scheme
alg. 6.26 multilevel Monte Carlo estimates
alg. 6.29 Euler-Maruyama scheme for the Heston model

Preface

This is a book about exploring random systems using computer simulation and thus, this book combines two different topic areas which have always fascinated me: the mathematical theory of probability and the art of programming computers. The method of using computer simulations to study a system is very different from the more traditional, purely mathematical approach. On the one hand, computer experiments normally can only provide approximate answers to quantitative questions, but on the other hand, results can be obtained for a much wider class of systems, including large and complex systems where a purely theoretical approach becomes difficult.

In this text we will focus on three different types of questions. The first, easiest question is about the normal behaviour of the system: what is a typical state of the system? Such questions can be easily answered using computer experiments: simulating a few random samples of the system gives examples of typical behaviour. The second kind of question is about variability: how large are the random fluctuations? This type of question can be answered statistically by analysing large samples, generated using repeated computer simulations. A final, more complicated class of questions is about exceptional behaviour: how small is the probability of the system behaving in a specified untypical way? Often, advanced methods are required to answer this third type of question. The purpose of this book is to explain how such questions can be answered. My hope is that, after reading this book, the reader will not only be able to confidently use methods from statistical computing for answering such questions, but also to adjust existing methods to the requirements of a given problem and, for use in more complex situations, to develop new specialised variants of the existing methods.

This text originated as a set of handwritten notes which I used for teaching the ‘Statistical Computing’ module at the University of Leeds, but now is greatly extended by the addition of many examples and more advanced topics. The material we managed to cover in the ‘Statistical Computing’ course during one semester is less than half of what is now the contents of the book! This book is aimed at postgraduate students and their lecturers; it can be used both for self-study and as the basis of taught courses. With the inclusion of many examples and exercises, the text should also be accessible to interested undergraduate students and to mathematically inclined researchers from areas outside mathematics.

Only very few prerequisites are required for this book. On the mathematical side, the text assumes that the reader is familiar with basic probability, up to and including the law of large numbers; Appendix A summarises the required results. As a consequence of the decision to require so little mathematical background, some of the finer mathematical subtleties are not discussed in this book. Results are presented in a way which makes them easily accessible to readers with limited mathematical background, but the statements are given in a form which allows the mathematically more knowledgeable reader to easily add the required detail on his/her own. (For example, I often use phrases such as ‘every set where full mathematical rigour would require us to write ‘every measurable set .) On the computational side, basic programming skills are required to make use of the numerical methods introduced in this book. While the text is written independent of any specific programming language, the reader will need to choose a language when implementing methods from this book on a computer. Possible choices of programming language include Python, Matlab and C/C++. For my own implementations, provided as part of the solutions to the exercises in Appendix C, I used the R programming language; a short introduction to programming with R is provided in Appendix B.

Writing this book has been a big adventure for me. When I started this project, more than a year ago, my aim was to cover enough material so that I could discuss the topics of multilevel Monte Carlo and reversible jump Markov Chain Monte Carlo methods. I estimated that 350 pages would be enough to cover this material but it quickly transpired that I had been much too optimistic: my estimates for the final page count kept rising and even after several rounds of throwing out side-topics and generally tightening the text, the book is still stretching this limit! Nevertheless, the text now covers most of the originally planned topics, including multilevel Monte Carlo methods near the very end of the book. Due to my travel during the last year, parts of this book have been written on a laptop in exciting places. For example, the initial draft of section 1.5 was written on a coach travelling through the beautiful island of Kyushu, halfway around the world from where I live! All in all, I greatly enjoyed writing this book and I hope that the result is useful to the reader.

This book contains an accompanying website. Please visit www.wiley.com/go/statistical_computing

Jochen VossLeeds, March 2013