The Informed Company - Dave Fowler - E-Book

The Informed Company E-Book

Dave Fowler

0,0
17,99 €

-100%
Sammeln Sie Punkte in unserem Gutscheinprogramm und kaufen Sie E-Books und Hörbücher mit bis zu 100% Rabatt.

Mehr erfahren.
Beschreibung

Learn how to manage a modern data stack and get the most out of data in your organization! Thanks to the emergence of new technologies and the explosion of data in recent years, we need new practices for managing and getting value out of data. In the modern, data driven competitive landscape the "best guess" approach--reading blog posts here and there and patching together data practices without any real visibility--is no longer going to hack it. The Informed Company provides definitive direction on how best to leverage the modern data stack, including cloud computing, columnar storage, cloud ETL tools, and cloud BI tools. You'll learn how to work with Agile methods and set up processes that's right for your company to use your data as a key weapon for your success . . . You'll discover best practices for every stage, from querying production databases at a small startup all the way to setting up data marts for different business lines of an enterprise. In their work at Chartio, authors Fowler and David have learned that most businesspeople are almost completely self-taught when it comes to data. If they are using resources, those resources are outdated, so they're missing out on the latest cloud technologies and advances in data analytics. This book will firm up your understanding of data and bring you into the present with knowledge around what works and what doesn't. * Discover the data stack strategies that are working for today's successful small, medium, and enterprise companies * Learn the different Agile stages of data organization, and the right one for your team * Learn how to maintain Data Lakes and Data Warehouses for effective, accessible data storage * Gain the knowledge you need to architect Data Warehouses and Data Marts * Understand your business's level of data sophistication and the steps you can take to get to "level up" your data The Informed Company is the definitive data book for anyone who wants to work faster and more nimbly, armed with actionable decision-making data.

Sie lesen das E-Book in den Legimi-Apps auf:

Android
iOS
von Legimi
zertifizierten E-Readern

Seitenzahl: 234

Veröffentlichungsjahr: 2021

Bewertungen
0,0
0
0
0
0
0
Mehr Informationen
Mehr Informationen
Legimi prüft nicht, ob Rezensionen von Nutzern stammen, die den betreffenden Titel tatsächlich gekauft oder gelesen/gehört haben. Wir entfernen aber gefälschte Rezensionen.



Table of Contents

Cover

Title Page

Copyright

Dedication

About This Book

Why Write This Book

Who This Book Is For

Who This Book Is Not For

Who Wrote the Book

Who Edited the Book

Influences

How This Book Was Written

How to Read This Book

Foreword

Introduction

Merging Business Context with Data Information

The Four Stages of Agile Data Organization

STAGE 1: SOURCE aka Siloed Data

Chapter One: Starting with Source Data

Common Options for Analyzing Source Data

Chapter Two: The Need to Replicate Source Data

Chapter Three: Source Data Best Practices

Keep a Complexity Wiki Page

Snippet Dictionary

Use a BI Product

Double Check Results

Keep Short Dashboards

Design Before Building

STAGE 2: DATA LAKE aka Data Combined

Chapter Four: Why Build a Data Lake?

What Is a Data Lake?

Reasons to Build a Data Lake Summarized

Chapter Five: Choosing an Engine for the Data Lake

Modern Columnar Warehouse Engines

Modern Warehouse Engine Products

Database Engines

Recommendation

Chapter Six: Extract and Load (EL) Data

ETL versus ELT

EL/ETL Vendors

Extract Options

Load Options

Multiple Schemas

Other Extract and Load Routes

Chapter Seven: Data Lake Security

Access in Central Place

Permission Tiers

Chapter Eight: Data Lake Maintenance

Why SQL?

Data Sources

Performance

Upgrade Snippets to Views

STAGE 3: DATA WAREHOUSE aka the Single Source of Truth

Chapter Nine: The Power of Layers and Views

Make Readable Views

Layer Views on Views

Start with a Single View

Chapter Ten: Staging Schemas

Orient to the Schemas

Pick a Table and Clean It

Other Staging Modeling Considerations

Building on Top of Staging Schemas

Chapter Eleven: Model Data with dbt

Version Control

Modularity and Reusability

Package Management

Organizing Files

Macros

Incremental Tables

Testing

Chapter Twelve: Deploy Modeling Code

Branch Using Version Control Software

Commit Message

Test Locally

Code Review

Schedule Runs

Chapter Thirteen: Implementing the Data Warehouse

Manage Dependencies

Combine Tables Within Schemas

Combine Tables Across Schemas

Keep the Grain Consistent

Create Business Metrics

Keeping Accurate History

Chapter Fourteen: Managing Data Access

How to Secure Sensitive Data in the Data Warehouse

How to Secure Sensitive Data in a BI Tool

Chapter Fifteen: Maintaining the Source of Truth

Track New Metrics

Deprecate Old Metrics

Deprecate Old Schemas

Resolve Conflicting Numbers

Handling Ongoing Requests and Ongoing Feedback

Updating Modeling Code

Manage Access

Tuning to Optimize

Code Review All Modeling

Maintenance Checklist

STAGE 4: DATA MARTS aka Data Democratized

Chapter Sixteen: Data Mart Implementation

Views on the Data Warehouse

Segment Tables

Access Update

Chapter Seventeen: Data Mart Maintenance

Educate Team

Identifies Issues

Identify New Needs

Help Track Success

Chapter Eighteen: Modern versus Traditional Data Stacks: What's Changed?

What's Changed?

Chapter Nineteen: Row‐ versus Column‐Oriented Database

Row‐Oriented Databases

Column‐Oriented Databases

Summary

Chapter Twenty: Style Guide Example

Simplify

Clean

Naming Conventions

Share It

Chapter Twenty-One: Building an SST Example

First Attempt—Same Tables with Prefixes

Second Attempt—Operational Schema (Source Agnostic)

Third Attempt—Application Separate, Other Sources Smashed

Less Planning, More Implementing

Acknowledgments and Contributions

Thank‐yous

Index

End User License Agreement

List of Tables

Chapter 5

TABLE 5.1 Selection Factors

Chapter 10

TABLE 10.1 Friends

TABLE 10.2

TABLE 10.3

TABLE 10.4

Chapter 13

TABLE 13.1 Orders

TABLE 13.2

TABLE 13.3

TABLE 13.4

TABLE 13.5

Chapter 19

TABLE 19.1 Facebook_Friends

TABLE 19.2

TABLE 19.3

TABLE 19.4

TABLE 19.5

TABLE 19.6

TABLE 19.7 Facebook_Friends

TABLE 19.8

TABLE 19.9

TABLE 19.10

TABLE 19.11

TABLE 19.12

TABLE 19.13

TABLE 19.13

Chapter 21

TABLE 21.1 Duplicate Records by Email

List of Illustrations

About This Book

Figure A.1 Data management is a collaborative process.

Figure A.2

Introduction

Figure I.1 Business context vs technical know how chart.

Figure I.2 The four stages of agile data organization represent a process th...

Chapter 1

Figure 1.1 Various methods for data analysis for data sources.

Figure 1.2 Example built in dashboard showing common metrics from Zendesk....

Figure 1.3 A basic export feature on a web services dashboard providing a CS...

Figure 1.4 pgAdmin dashboard is a popular IDE for PostgreSQL.

Figure 1.5 Geckoboard like dashboard displaying standard sales metrics.

Figure 1.6 Mixpanel cohort analysis.

Figure 1.7 Chartio Dashboard Executive Summary of Sales Metrics.

Chapter 2

Figure 2.1 A production system should be used with care.

Figure 2.2 A cloned data source with read‐only access.

Figure 2.3 It is dangerous to use a production system for non production pur...

Chapter 3

Figure 3.1 Using an SQL file in an editor to manage dashboard metrics.

Figure 3.2 A custom SQL building feature, "Visual SQL" from Chartio.

Figure 3.3 A very long dashboard.

Figure 3.4 A simple dashboard outline.

Figure 3.5 The book cover for How to Design a Dashboard, by Matt David.

Chapter 4

Figure 4.1 A data lake containing multiple data sources.

Figure 4.2

Figure 4.3 A dashboard using data from various data sources.

Figure 4.4 For dashboards that consist of large aggregations a transactional...

Chapter 5

Figure 5.1

Figure 5.2 A transactional database can read and write rows quickly, and an ...

Figure 5.3

Figure 5.4

Figure 5.5

Figure 5.6

Chapter 6

Figure 6.1 Data can be transformed while being moved to a data warehouse.

Figure 6.2 Data can first be loaded into a lake and transformed there. This ...

Figure 6.3

Figure 6.4

Figure 6.5

Figure 6.6

Chapter 7

Figure 7.1 Security moves from each data source to the data lake.

Figure 7.2 A table of data sources and with a key graphic indicating which c...

Chapter 8

Figure 8.1 Source data being loaded with SQL into a data lake.

Figure 8.2 More sources being added to the data lake.

Figure 8.3 Data sources having errors connecting to the data lake.

Figure 8.4 A warning about the connection to the data lake.

Figure 8.5 A database receiving a query, then fetching data from a cached st...

Figure 8.6 The “Workload Management Configuration” section on an AWS Redshif...

Figure 8.7 In Google Cloud's BigQuery, it's possible to set maximum bytes bi...

Figure 8.8 In Chartio, a dashboard setting for controlling the frequency of ...

Chapter 9

Figure 9.1 A view references a table’s data without changing how the table’s...

Figure 9.2 A view is an SQL abstraction on top of underlying data.

Figure 9.3 Views can reference other views allowing you to create layers wit...

Figure 9.4 It may not be obvious at first how to model data, start with maki...

Figure 9.5 A warehouse is a cleaned usable version of the data in the lake....

Chapter 10

Figure 10.1 The four stages of agile data organization with an intermediary ...

Figure 10.2

Figure 10.3

Figure 10.4

Figure 10.5

Figure 10.6 indicates all the types of cleaning we have done.

Figure 10.7

Figure 10.8

Figure 10.9 A typical SQL query joining tables de‐normalized to a new struct...

Chapter 11

Figure 11.1

Figure 11.2 Common modeling errors people make.

Figure 11.3 An illustration of table entities connected to a local machine s...

Chapter 12

Figure 12.1 Diagram showing the process lifecycle of model updates.

Figure 12.2

Chapter 13

Figure 13.1

Figure 13.2

Figure 13.3

Figure 13.4 A diagram of two tables joined to create a wider table.

Figure 13.5

Figure 13.6

Figure 13.7 Two tables, combined with a UNION SQL statement.

Figure 13.8 A screenshot from a dash showing a metric for new trials and dai...

Chapter 14

Figure 14.1 The stages of agile data organization showing how security for m...

Figure 14.2 Warehouse resources with security protocols applied.

Chapter 15

Figure 15.1

Figure 15.2 Data being backfilled.

Figure 15.3 A table with a deprecated metric and two strategies for renaming...

Figure 15.4

Figure 15.5

Figure 15.6 Screenshot of a Chartio Jira project board.

Figure 15.7

Figure 15.8 Any modeling code should be peer reviewed before it is incorpora...

Chapter 16

Figure 16.1

Chapter 17

Figure 17.1 An outlier identified in a sales‐per‐day line graph.

Figure 17.2 A date marked representing when a marketing campaign launched.

Figure 17.3 A point in time marked when a field name was updated.

Figure 17.4 The process of adding new columns to a data mart.

Chapter 21

Figure 21.1 Table survey in a spreadsheet.

Figure 21.2 Custom query for internal usage within a visual SQL UI.

Figure 21.3 Data Grip, an SQL editor.

Figure 21.4 An SQL editor with column insert feature selected.

Figure 21.5 An SQL editor with columns inserted.

Part 3

Figure P.1

Figure P.2

Figure P.3

Part 4

Figure P.1 Data marts are subsections of a data warehouse schema.

Figure P.2 An entire data warehouse schema.

Figure P.3 A data warehouse schema sectioned off for each department's ...

Figure P.4 Data warehouse inventory by department needs.

Guide

Cover Page

Title Page

Copyright

Dedication

Table of Contents

About This Book

Foreword

Introduction

Table of Contents

Begin Reading

Acknowledgments and Contributions

Index

WILEY END USER LICENSE AGREEMENT

Pages

iii

iv

v

xiii

xiv

xv

xvi

xvii

xviii

xix

xxi

xxii

xxiii

xxv

xxvi

xxvii

xxviii

xxix

xxx

xxxi

xxxii

xxxiii

xxxiv

xxxv

1

2

3

4

5

6

7

8

9

11

12

13

14

15

16

17

18

19

20

21

23

24

25

26

27

28

29

30

31

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

59

60

61

62

63

64

65

66

67

68

69

70

71

72

73

75

76

77

78

79

80

81

82

83

84

85

86

87

88

89

90

91

92

93

94

95

96

97

98

99

100

101

102

103

104

105

106

107

108

109

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

161

162

163

164

165

167

168

169

171

172

173

174

175

176

177

178

179

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

199

200

201

202

203

204

205

206

207

208

209

211

212

213

214

215

216

217

218

219

220

221

The Informed Company

How to Build Modern Agile Data Stacks that Drive Winning Insights

 

 

Dave FowlerMatt David

 

 

 

 

Copyright © 2022 by Dave Fowler and Matt David. All rights reserved.

Published by John Wiley & Sons, Inc., Hoboken, New Jersey.

Published simultaneously in Canada.

No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording, scanning, or otherwise, except as permitted under Section 107 or 108 of the 1976 United States Copyright Act, without either the prior written permission of the Publisher, or authorization through payment of the appropriate per‐copy fee to the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, (978) 750‐8400, fax (978) 646‐8600, or on the Web at www.copyright.com. Requests to the Publisher for permission should be addressed to the Permissions Department, John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ 07030, (201) 748‐6011, fax (201) 748‐6008, or online at www.wiley.com/go/permissions.

Limit of Liability/Disclaimer of Warranty: While the publisher and author have used their best efforts in preparing this book, they make no representations or warranties with respect to the accuracy or completeness of the contents of this book and specifically disclaim any implied warranties of merchantability or fitness for a particular purpose. No warranty may be created or extended by sales representatives or written sales materials. The advice and strategies contained herein may not be suitable for your situation. You should consult with a professional where appropriate. Neither the publisher nor author shall be liable for any loss of profit or any other commercial damages, including but not limited to special, incidental, consequential, or other damages.

For general information on our other products and services or for technical support, please contact our Customer Care Department within the United States at (800) 762‐2974, outside the United States at (317) 572‐3993, or fax (317) 572‐4002.

Wiley publishes in a variety of print and electronic formats and by print‐on‐demand. Some material included with standard print versions of this book may not be included in e‐books or in print‐on‐demand. If this book refers to media such as a CD or DVD that is not included in the version you purchased, you may download this material at http://booksupport.wiley.com. For more information about Wiley products, visit www.wiley.com.

Library of Congress Cataloging‐in‐Publication Data

Names: Fowler, Dave (Computer scientist), author. | Matt David, author.

Title: The informed company : how to build modern agile data stacks that drive winning insights / Dave Fowler, Matt David.

Description: Hoboken, New Jersey : Wiley, [2022] | Includes index.

Identifiers: LCCN 2021028324 (print) | LCCN 2021028325 (ebook) | ISBN 9781119748007 (paperback) | ISBN 9781119748021 (adobe pdf) | ISBN 9781119748014 (epub)

Subjects: LCSH: Data structures (Computer science) | Big data. | Cloud computing.

Classification: LCC QA76.9.D35 F69 2022 (print) | LCC QA76.9.D35 (ebook) | DDC 005.7/3—dc23

LC record available at https://lccn.loc.gov/2021028324

LC ebook record available at https://lccn.loc.gov/2021028325

Cover image: © Neo Geometric/Shutterstock

Cover design: Wiley

To my mother who continues to be my most supportive and patient teacher. As a software engineer you taught me to code for my sixth‐grade science project. Today as a Data Analyst you helped a 38‐year‐old me in discussions and edits of this book. Thank you for always supporting and encouraging my curiosities and for all your love.

— Dave Fowler

I dedicate this book to my Mom, an educator who is fueled by helping others learn. Thank you for always believing in me and being an example of how much you can affect other people’s lives.

— Matt David

About This Book

Why Write This Book

Most comprehensive books on analytics architecture that we've found are over a decade old, most of them pre‐cloud. Because there really isn't a modern equivalent to Kimball's seminal The Data Warehouse Toolkit, today's data teams have to reinvent the principles of building a data stack. Too often, they do this without guidance. To solve this problem, we have created a best‐practices guide for bootstrapping and nurturing a technologically current data warehouse.

Who This Book Is For

We wrote this book for whoever values data and believes that informed companies are competitive. It's a book for the working professional who is creating a practical, modern data stack. It's for the lone analyst or the professional embedded in a team. It's for anyone interested in what design practices underlie robust data architecture, the kind that equips entire companies with business intelligence insights. At its heart, this book is written with collaboration in mind (Figure A.1).

Figure A.1 Data management is a collaborative process.

Who This Book Is Not For

This book is not written for “big data” professionals. To be clear, even large corporations like Doordash, Discord, and the owners of The Financial Times and The New York Times (all previous customers of ours) do not qualify as big data companies. As a rule of thumb, the big data label applies to data architectures with raw input that exceeds 100 GB per day.

No doubt, many elements of this text map onto the big data workflow, especially since warehouses support all sorts of tables, not just, say, event streams. However, our aim is to focus on the central pillars of a modern data stack, so that the widest set of readers can readily benefit from the information herein. In this spirit, we forgo recommendations for mega‐scale architectures.

This book is not for AI‐enabled teams and does not cover AI workflows, machine learning models, or real‐time operational use cases. Instead, its goal is to provide best practices for building and maintaining a robust data analytics stack (i.e. the analytics foundation on which an AI workflow can be built).

If you are a small business that can run everything with Quickbooks and Excel, that ability is great. Data is important for all companies, but if these tools are already serving you well, the book may not offer helpful guidance. If you start exceeding the data capacity of Excel or bring in a data source that needs to be in a database to be analyzed, then keep reading.

Who Wrote the Book

This book was written by Dave Fowler and Matt David.

Dave Fowler has worked in BI for over a decade, and has always looked for ways to JOIN teams ON data . He wants to enable any working professional (not just data analysts) to explore and understand their data. As the founder and CEO of Chartio, Dave has spent the last 11 years leading the development of a self‐service BI product that aims to do just that. Chartio's suite of tools make it easy for anyone at a data‐driven business to browse their schemas, merge various data sources, and produce beautiful dashboards. In March 2021, Atlassian acquired Chartio and is integrating it into their platform.

Matt David has worked in product management and education for eight years. As data becomes a necessary skill for more and more jobs, he passionately advocates for data literacy among the workforce. As the current head of The Data School, he oversees the production of free, online resources focused on leveraging data within companies. Recent book topics include SQL optimization, data governance, and common analysis biases. Dave started The Data School, and together he and Matt have grown it into an important free resource for the data community. He previously worked at Udacity and General Assembly teaching analytics.

Dave and Matt decided to co‐write this book after seeing how many people struggle when constructing data stacks and then trying to use them. This book was created with the support of many employees at Chartio. They graciously provided insights into how customers model their data and collected frequently asked data‐infrastructure questions. Their contributions guided the production of this text.

Who Edited the Book

This book was reviewed and edited by Emilie Schario, Mila Page, and David Yerrington. Emilie is the head of data at Netlify and previously helped build Gitlab's entire data organization. She regularly writes and speaks on all things related to modern data. Mila is a developer relations advocate at dbt Labs, the makers of dbt (data build tool). She helps data professionals learn and apply modern analytics‐engineering practices, and is an organizer for Coalesce, the dbt Community’s annual conference. David is a Data Science Consultant and was the Global Lead Data Science Instructor at General Assembly. He helps people around the world better leverage their data. Emilie, Mila, and David have shaped the narrative and content of this book. Their (sometimes) line‐by‐line feedback has ensured that we can proudly stand behind our recommendations.

Influences

We've drawn on several sources of information and opinion when writing this text. While at Chartio, we worked with hundreds of modern cloud‐based customers. We've collected, implemented, and refined these practices ourselves, and through writing this book, vetted them further with partners and customers. We've also learned from the data community through dataschool.com, blogs like Tristan Handy's, and data‐focused slack communities.

And lastly, it's worth noting and thanking some classic books that informed the previous generation of warehousing toolkits. We honor them by echoing their terminology and best practices wherever possible:

Agile Data Warehouse Design

by Lawrence Corr

The Data Warehouse Toolkit

by Ralph Kimball

Information Dashboard Design

by

Stephen Few

(

my review here

)

How This Book Was Written

This book originates in part from a project within The Data School (Figure A.2), a collection of free online books and interactive tutorials on managing and leveraging data (see dataschool.com). These resources are always expanding, much like the articles of Wikipedia: each round of updates sees our ebooks cover additional topics, go deeper on established ideas, share more real‐world examples, and better deliver that content. Our goal is to maintain and improve these resources and keep them modern.

Figure A.2

Source: The Data School

Few are complete “experts” in all of the areas of modern data governance, and the landscape is changing all of the time. If you have a story to share, or a chapter you think is missing, or a new idea, email us. Even if you don't know what specifically to share, but you don't mind sharing your story, please reach out as we are particularly interested in adding real‐world experiences and insights.

There is already too much jargon in the data world, often created by talented vendor marketing teams. We try to stick with the most common and straightforward words that are already in use. For any jargon we do find necessary, we include a definition.

There are many books for the old ways of working with data. We're highlighting current best practices here, so we ignore outdated terminology and techniques. In a few cases where it is beneficial to talk about industry evolution—like the change from ETL to ELT—we teach ELT and discuss the choice in a separate chapter.

Almost every part of this book could be contentious to someone, in some use case or to some vendor. In writing this book, it is tempting to bring up the caveats everywhere and write what would ultimately be a very defensive and overly explained book. We believe this type of book is way less useful for people seeking straightforward advice. Where we have a strong opinion, we don't argue it; we just go with it. Where we think the user has a legitimate choice to make, we pose those options.

This book aims to provide a broad overview and general guidelines on how to set up a data stack. We intentionally gloss over the details of launching a Redshift instance, writing SQL, or using various BI products. That would clutter the text, repeat what's already on the internet, and make the read quite stale.

How to Read This Book

The book starts with a quick overview and decision charts about what the stages are and what stage is appropriate for you. This book is structured with a section for each of the four stages, and if you'd like, you can jump ahead to the stage you're at.

Not every company needs the entirety of this book. As a growing company's data needs expand, more and more of the book becomes valuable. Note, though, many best practices presented at each stage appear when they start to be relevant. These practices assume they are useful from the point they appear in the book, onward, to avoid redundancy. So it may benefit you to at least skim those earlier stages even if you and your company are further ahead.

At the end of the book we have a section where we describe what has changed in the data world that makes this new architecture relevant and performant. We avoid explaining how our recommendations differ from previous practices like Kimball Dimensional modeling so as not to clutter the experience. Such discussions are necessary, however, and we've put them in this last section of the book.

Lastly, throughout the book you will see the following icons:

Definitions

They are related to a term found on the same page. For example, on this page, the term “data lake” is mentioned. A data lake is a staging area for several data sources.

Protips

Protips expand on an idea or provide additional information about a topic related to what you read within a given chapter.

Foreword

In 2015, I used a product called Amazon Redshift. At the time, I had spent the prior 15 years of my career in a variety of roles all centered around their use of data, from analytics to marketing to operations. And while I considered my data competency my biggest professional differentiator, I had also become deeply frustrated. For all of the supposed progress in the data ecosystem, it was still slow, hard, and expensive to get insights out of data.

But my first experience with Redshift is where that all changed for me. I have such a visceral memory of the first hour I spent with the product: queries I ran returned so fast that it seemed like absolute magic. I had spent years and years of my career writing queries and waiting for the MacOS “spinner” icon to stop spinning. Now, all the sudden, these same queries weren’t 20% faster…they were 10 to 100 to 1000x faster. I felt like I had superpowers.

I'll let Dave and Matt actually explain how the modern data warehouse can achieve these types of performance results, but for now, just trust me that it can and does. Given that, the fascinating question is actually: what does this mean for people like you and me?

What kind of “people” do I mean? You know—people who are involved in making decisions at companies and want to use data to help us. People who likely over the years have acquired a variety of data skills, whether that's Excel VLOOKUPs, Google Analytics