Choosing and Using Statistics - Calvin Dytham - E-Book

Choosing and Using Statistics E-Book

Calvin Dytham

4,6
30,99 €

oder
-100%
Sammeln Sie Punkte in unserem Gutscheinprogramm und kaufen Sie E-Books und Hörbücher mit bis zu 100% Rabatt.
Mehr erfahren.
Beschreibung

Choosing and Using Statistics remains an invaluable guide for students using a computer package to analyse data from research projects and practical class work.  The text takes a pragmatic approach to statistics with a strong focus on what is actually needed.  There are chapters giving useful advice on the basics of statistics and guidance on the presentation of data. The book is built around a key to selecting the correct statistical test and then gives clear guidance on how to carry out the test and interpret the output from four commonly used computer packages: SPSS, Minitab, Excel, and (new to this edition) the free program, R. Only the basics of formal statistics are described and the emphasis is on jargon-free English but any unfamiliar words can be looked up in the extensive glossary. This new 3rd edition of Choosing and Using Statistics is a must for all students who use a computer package to apply statistics in practical and project work.

Features new to this edition:

  • Now features information on using the popular free program, R
  • Uses a simple key and flow chart to help you choose the right statistical test
  • Aimed at students using statistics for projects and in practical classes
  • Includes an extensive glossary and key to symbols to explain any statistical jargon
  • No previous knowledge of statistics is assumed

Sie lesen das E-Book in den Legimi-Apps auf:

Android
iOS
von Legimi
zertifizierten E-Readern

Seitenzahl: 528

Veröffentlichungsjahr: 2011

Bewertungen
4,6 (16 Bewertungen)
11
3
2
0
0
Mehr Informationen
Mehr Informationen
Legimi prüft nicht, ob Rezensionen von Nutzern stammen, die den betreffenden Titel tatsächlich gekauft oder gelesen/gehört haben. Wir entfernen aber gefälschte Rezensionen.



Contents

Preface

The third edition

How to use this book

Acknowledgements for the first edition

1 Eight steps to successful data analysis

2 The basics

Observations

Hypothesis testing

P-values

Sampling

Experiments

Statistics

3 Choosing a test: a key

Remember: eight steps to successful data analysis

The art of choosing a test

A key to assist in your choice of statistical test

4 Hypothesis testing sampling and experimental design

Hypothesis testing

Acceptable errors

P-values

Sampling

Experimental design

5 Statistics, variables and distributions

What are statistics?

Types of statistics

What is a variable?

Types of variables or scales of measurement

Types of distribution

Discrete distributions

Continuous distributions

Non-parametric ‘distributions’

6 Descriptive and presentational techniques

General advice

Displaying data: summarizing a single variable

Displaying data: showing the distribution of a single variable

Descriptive statistics

Using the computer packages

Displaying data: summarizing two or more variables

Displaying data: comparing two variables

Displaying data: comparing more than two variables

7 The tests 1: tests to look at differences

Do frequency distributions differ?

Do the observations from two groups differ?

Do the observations from more than two groups differ?

There are two independent ways of classifying the data

More than one observation for each factor combination (with replication)

There are more than two independent ways to classify the data

Not all classifications are independent

Nested or hierarchical designs

8 The tests 2: tests to look at relationships

Is there a correlation or association between two variables?

Is there a cause-and-effect relationship between two variables?

Tests for more than two variables

9 The tests 3: tests for data exploration

Types of data

Observation, inspection and plotting

Symbols and letters used in statistics

Greek letters

Symbols

Upper-case letters

Lower-case letters

Glossary

Assumptions of the tests

What if the assumptions are violated?

Hints and tips

Using a computer

Sampling

Statistics

Displaying the data

Chapter 17

Index

This edition first published 2011, © 1999, 2003 by Blackwell Science, 2011 by Calvin Dytham

Blackwell Publishing was acquired by John Wiley & Sons in February 2007. Blackwell’s publishing program has been merged with Wiley’s global Scientific, Technical and Medical business to form Wiley-Blackwell.

Registered Office:

John Wiley & Sons Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, UK

Editorial Offices:

9600 Garsington Road, Oxford, OX4 2DQ, UK

The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, UK 111 River Street, Hoboken, NJ 07030–5774, USA

For details of our global editorial offices, for customer services and for information about how to apply for permission to reuse the copyright material in this book please see our website at www.wiley.com/wiley-blackwell.

The right of the author to be identified as the author of this work has been asserted in accordance with the UK Copyright, Designs and Patents Act 1988.

All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, except as permitted by the UK Copyright, Designs and Patents Act 1988, without the prior permission of the publisher.

Designations used by companies to distinguish their products are often claimed as trademarks. All brand names and product names used in this book are trade names, service marks, trademarks or registered trademarks of their respective owners. The publisher is not associated with any product or vendor mentioned in this book. This publication is designed to provide accurate and authoritative information in regard to the subject matter covered. It is sold on the understanding that the publisher is not engaged in rendering professional services. If professional advice or other expert assistance is required, the services of a competent professional should be sought.

Library of Congress Cataloging-in-Publication Data

Dytham, Calvin.

Choosing and using statistics: a biologist’s guide/by Calvin Dytham. - 3rd ed.

p. cm.

Includes bibliographical references and index.

ISBN 978-1-4051-9838-7 (hardback) - ISBN

1. Biometry. I. Title.

978-1-4051-9839-4 (pbk.)

QH323.5.D98 2011 001.4'22-dc22

2010030975

A catalogue record for this book is available from the British Library.

This book is published in the following electronic format: ePDF 978-1-4443-2843-1

Set in 9.5/12pt Berling by SPi Publisher Services, Pondicherry, India

1 2011

Preface

My aim was to produce a statistics book with two characteristics: to assume that the reader is using a computer to analyse data and to contain absolutely no equations.

This is a handbook for biologists who want to process their data through a statistical package on the computer, to select the most appropriate methods and extract the important information from the, often confusing, output that is produced. It is aimed, primarily, at undergraduates and masters students in the biological sciences who have to use statistics in practical classes and projects. Such users of statistics don’t have to understand exactly how the test works or how to do the actual calculations. These things are not covered in this book as there are more than enough books providing such information already. What is important is that the right statistical test is used and the right inferences made from the output of the test. An extensive key to statistical tests is included for the former and the bulk of the book is made up of descriptions of how to carry out the tests to address the latter.

In several years of teaching statistics to biology students it is clear to me that most students don’t really care how or why the test works. They do care a great deal that they are using an appropriate test and interpreting the results properly. I think that this is a fair aim to have for occasional users of statistics. Of course, anyone going on to use statistics frequently should become familiar with the way that calculations manipulate the data to produce the output as this will give a better understanding of the test.

If this book has a message it is this: think about the statistics before you collect the data! So many times I have seen rather distraught students unable to analyse their precious data because the experimental design they used was inappropriate. On such occasions I try to find a compromise test that will make the best of a bad job but this often leads to a weaker conclusion than might have been possible if more forethought had been applied from the outset. There is no doubt that if experiments or sampling strategies are designed with the statistics in mind better science will result.

Statistics are often seen by students as the ‘thing you must do to data at the end’. Please try to avoid falling into this trap yourself. Thought experiments producing dummy data are a good way to try out experimental designs and are much less labour-intensive than real ones!

Although there are almost no equations in this book I’m afraid there was no way to totally avoid statistical jargon. To ease the pain somewhat, an extensive Glossary and key to symbols are included. So when you are navigating your way through the key to choosing a test you should look up any words you don’t understand.

In this book I have given extensive instructions for the use of four commonly encountered software packages: SPSS, R, Excel and MINITAB. However, the key to choosing a statistical test is not at all package-specific, so if you use a software package other than the four I focus on or if you are using a calculator you will still be able to get a good deal out of this book.

If every sample gave the same result there would be no need for statistics. However, all aspects of biology are filled with variation. It is statistics that can be used to penetrate the haze of experimental error and the inherent variability of the natural world to reach the underlying causes and processes at work. So, try not to hate statistics, they are merely a tool that, when used wisely and properly, can make the life of a biologist much simpler and give conclusions a sound basis.

The third edition

In the 8 years since I wrote the second edition of this book there have, of course, been several new versions of the software produced. I have received many comments about the previous editions and I am grateful for the many suggestions on how to improve the text and coverage. Requests to add further statistical packages have been the most common suggestion for change. There was surprisingly little consensus on the packages to add for the second edition, but since 2000 the freely available, and very powerful, package R has become extremely widely used so I have added that to the mix this time.

How to use this book

This is definitely not a book that should be read from cover to cover. It is a book to refer to when you need assistance with statistical analysis, either when choosing an appropriate test or when carrying it out. The basics of statistical analysis and experimental design are covered briefly but those sections are intended mostly as a revision aid, or to outline of some of the more important concepts. The reviews of other statistics books may help you choose those that are most appropriate for you if you want or need more details.

The heart of the book is the key. The rest of the book hinges on the key, explaining how to carry out the tests, giving assistance with the statistical terms in the Glossary or giving tips on the use of computers and packages.

Packages used

MINITAB® version 15, MINITAB Inc.

SPSS® versions 16 and 17, SPSS Inc.

Excel™ version 2007 and 2008 for Mac, Microsoft Corporation

Running on:

Windows® versions XP, 2000, 7 and Vista, Microsoft Corporation

Mac OS 10, Apple Inc.

Example data

In the spirit of dummy data collection, all example data used throughout this book have been fabricated. Any similarity to data alive or dead is purely coincidental.

Acknowledgements for the first edition

Thanks to Sheena McNamee for support during the writing process, to Andrea Gillmeister and two anonymous reviewers for commenting on an early version of the manuscript and to Terry Crawford, Jo Dunn, David Murrell and Josephine Pithon for recommending and lending various books. Thanks also to Ian Sherman and Susan Sternberg at Blackwell and to many of my colleagues who told me that the general idea of a book like this was a sound one. Finally, I would especially like to thank the students at the University of York, UK, who brought me the problems that provided the inspiration for this book.

Acknowledgements for the second edition

Thanks to all the many people who contacted me with suggestions and comments about the first edition. I hope you can see that many of the corrections and improvements have come directly from you. Five anonymous reviewers provided many useful comments about the proposal for a second edition. Thanks to Sarah Shannon, Cee Brandston, Katrina McCallum and many others at Blackwell for seeing this book through and especially for producing a second superb and striking cover. S’Albufera Natural Parc and Nick Riddiford provided a very convenient bolt-hole for writing. Once again, I give special thanks to Sheena and to my colleagues, PhD students and undergraduate students at the University of York. Finally, thanks to everyone on the MRes EEM course over the last 4 years.

Acknowledgements for the third edition

It’s been thanks to the pushing of Ward Cooper at Wiley-Blackwell and Sheena McNamee that this third edition has seen the light of day. Thanks to Emma Rand, Olivier Missa and Frank Schurr for encouraging me to enter the brave new world of R. Thanks to Nik Prowse for guiding me through the final editing.

Calvin Dytham,

York 1998, 2002 and 2010

1

Eight steps to successful data analysis

This is a very simple sequence that, if you follow it, will integrate the statistics you use into the process of scientific investigation. As I make clear here, statistical tests should be considered very early in the process and not left until the end.

1 Decide what you are interested in.

2 Formulate a hypothesis or several hypotheses (see Chapters 2 and 3 for guidance).

3 Design the experiment, manipulation or sampling routine that will allow you to test the hypotheses (see Chapters 2 and 4 for some hints on how to go about this).

4Collect dummy data (i.e. make up approximate values based on what you expect to obtain). The collection of ‘dummy data’ may seem strange but it will convert the proposed experimental design or sampling routine into something more tangible. The process can often expose flaws or weaknesses in the datacollection routine that will save a huge amount of time and effort.

5 Use the key presented in Chapter 3 to guide you towards the appropriate test or tests.

6 Carry out the test(s) using the dummy data. (Chapters 6–9 will show you how to input the data, use the statistical packages and interpret the output.)

7 If there are problems go back to step 3 (or 2); otherwise, proceed to the collection of real data.

8 Carry out the test(s) using the real data. Report the findings and/or return to step 2.

I implore you to use this sequence. I have seen countless students who have spent a long time and a lot of effort collecting data only to find that the experimental or sampling design was not quite right. The test they are forced to use is much less powerful than one they could have used with only a slight change in the experimental design. This sort experience tends to turn people away from statistics and become ‘scared’ of them. This is a great shame as statistics are a hugely useful and vital tool in science.

The rest of the book follows this eight-step process but you should use it for guidance and advice when you become unsure of what to do.

3

Choosing a test: a key

I hope that you are not reading this chapter with your data already collected and the experiment or sampling programme ‘finished’. If you have finished collecting your data I strongly advise you to approach your next experiment or survey in a different way. As you will see below, I hope that you will be using this key before you start collecting real data.

Remember: eight steps to successful data analysis

1 Decide what you are interested in.

2 Formulate a hypothesis or hypotheses.

3 Design the experiment or sampling routine.

4 Collect dummy data. Make up approximate values based on what you expect.

5Use the key here to decide on the appropriate test or tests.

6 Carry out the test(s) using the dummy data.

7 If there are problems go back to step 3 (or 2), otherwise collect the real data.

8 Carry out the test(s) using the real data.

The art of choosing a test

It may be a surprising revelation, but choosing a statistical test is not an exact science. There is nearly always scope for considerable choice and many decisions will be made based on personal judgements, experience with similar problems or just a simple hunch. There are many circumstances under which there are several ways that the data could be analysed and yet each of the possible tests could be justified.

A common tendency is to force the data from your experiment into a test you are familiar with even if it is not the best method. Look around for different tests that may be more appropriate to the hypothesis you are testing. In this way you will expand your statistical repertoire and add power to your future experiments.

A key to assist in your choice of statistical test

Starting at step 1 in the list above move through the key following the path that best describes your data. If you are unsure about any of the terms used then consult the glossary or the relevant sections of the next two chapters. This is not a true dichotomous key and at several points there are more than two routes or end points.

Lesen Sie weiter in der vollständigen Ausgabe!

Lesen Sie weiter in der vollständigen Ausgabe!

Lesen Sie weiter in der vollständigen Ausgabe!

Lesen Sie weiter in der vollständigen Ausgabe!

Lesen Sie weiter in der vollständigen Ausgabe!

Lesen Sie weiter in der vollständigen Ausgabe!

Lesen Sie weiter in der vollständigen Ausgabe!

Lesen Sie weiter in der vollständigen Ausgabe!

Lesen Sie weiter in der vollständigen Ausgabe!

Lesen Sie weiter in der vollständigen Ausgabe!

Lesen Sie weiter in der vollständigen Ausgabe!

Lesen Sie weiter in der vollständigen Ausgabe!

Lesen Sie weiter in der vollständigen Ausgabe!