Mastering Tableau - David Baldwin - E-Book

Mastering Tableau E-Book

David Baldwin

0,0
41,99 €

-100%
Sammeln Sie Punkte in unserem Gutscheinprogramm und kaufen Sie E-Books und Hörbücher mit bis zu 100% Rabatt.

Mehr erfahren.
Beschreibung

Master the intricacies of Tableau to create effective data visualizations

About This Book

  • Arm yourself with an arsenal of advanced chart types and geocoding to efficiently and engagingly present information
  • Map a grid over a network node diagram and use that grid to demonstrate loads, processing time, and more in Tableau
  • Integrate R with Tableau by utilizing R functions, libraries, and saved models

Who This Book Is For

If you are a business analyst without developer-level programming skills, then this book is for you. You are expected to have at least a fundamental understanding of Tableau and basic knowledge of joins, however SQL knowledge is not assumed. You should have basic computer skills, including at least moderate Excel proficiency.

What You Will Learn

  • Create a worksheet that can display the current balance for any given period in time
  • Recreate a star schema from in a data warehouse in Tableau
  • Combine level of detail calculations with table calculations, sets, and parameters
  • Create custom polygons to build filled maps for area codes in the USA
  • Visualize data using a set of analytical and advanced charting techniques
  • Know when to use Tableau instead of PowerPoint
  • Build a dashboard and export it to PowerPoint

In Detail

Tableau has emerged as one of the most popular Business Intelligence solutions in recent times, thanks to its powerful and interactive data visualization capabilities. This book will empower you to become a master in Tableau by exploiting the many new features introduced in Tableau 10.0.

You will embark on this exciting journey by getting to know the valuable methods of utilizing advanced calculations to solve complex problems. These techniques include creative use of different types of calculations such as row-level, aggregate-level, and more. You will discover how almost any data visualization challenge can be met in Tableau by getting a proper understanding of the tool's inner workings and creatively exploring possibilities.

You'll be armed with an arsenal of advanced chart types and techniques to enable you to efficiently and engagingly present information to a variety of audiences through the use of clear, efficient, and engaging dashboards. Explanations and examples of efficient and inefficient visualization techniques, well-designed and poorly designed dashboards, and compromise options when Tableau consumers will not embrace data visualization will build on your understanding of Tableau and how to use it efficiently.

By the end of the book, you will be equipped with all the information you need to create effective dashboards and data visualization solutions using Tableau.

Style and approach

This book takes a direct approach, to systematically evolve to more involved functionalities such as advanced calculation, parameters & sets, data blending and R integration. This book will help you gain skill in building visualizations previously beyond your capacity.

Sie lesen das E-Book in den Legimi-Apps auf:

Android
iOS
von Legimi
zertifizierten E-Readern

Seitenzahl: 433

Veröffentlichungsjahr: 2016

Bewertungen
0,0
0
0
0
0
0
Mehr Informationen
Mehr Informationen
Legimi prüft nicht, ob Rezensionen von Nutzern stammen, die den betreffenden Titel tatsächlich gekauft oder gelesen/gehört haben. Wir entfernen aber gefälschte Rezensionen.



Table of Contents

Mastering Tableau
Credits
About the Author
www.Packtpub.com
Why subscribe?
Preface
What this book covers
What you need for this book
Who this book is for
Conventions
Reader feedback
Customer support
Downloading the example code
Downloading the color images of this book
Errata
Piracy
Questions
1. Getting Up to Speed – a Review of the Basics
The Tableau universe
Understanding the Tableau interface and basic terminology
Worksheet and dashboard creation
Worksheet creation
Exercise - fundamentals of visualizations
Beyond default behavior
Exercise - overriding defaults
Show Me
Dashboard creation
Exercise - building a dashboard
Exercise - adding interactivity to a dashboard
Connecting Tableau to your data
Excel and text files
Comparing and contrasting Native Tableau Driver and MS Jet Driver
Connecting to a Tableau Server
Exercise - observing metadata differences
Exercise steps
Exercise - connecting to Tableau Server
Connecting to saved data sources
Exercise - creating a local data connection
Measure Names and Measure Values
Measure Names and Measure Values - the basics
Exercise - Measure Names and Measure Values
Measure Names and Measure Values - shortcuts
Exercise - Measure Names and Measure Values shortcuts
Exercise commentary
Three essential Tableau concepts
Essential concept 1 - dimensions and measures
Exercise - dimensions and measures
Essential concept 2 - Row Level, Aggregate Level, and Table Level
Exercise - Row Level, Aggregate Level, and Table Level
Exercise commentary
Essential concept 3 - continuous and discrete
Summary
2. All about Data – Getting Your Data Ready
Understanding Tableau's data-handling engine
The Tableau Data-Handling Engine
Changing field attribution example
Table calculation example
DHE takeaways
Data-mining and knowledge-discovery process models
Survey of the process models
CRISP-DM
CRISP-DM phases
Focusing on data preparation
Surveying data
Exercise - surveying data
Exercise - extrapolating data
Exercise commentary
Cleaning the data
Exercise - cleaning the data
Exercise steps
Exercise commentary
Exercise - extracting data
Exercise steps
Exercise commentary
Summary
3. All about Data – Joins, Blends, and Data Structures
About joins
Complex joins
Exercise - observing join culling
Exercise steps
Exercise commentary
Data blending
Order of operations for aggregating and matching
Exercise - a data blend versus a left join
Exercise steps
Understanding the join
Understanding the data blend
No dimensions from a secondary source
Exercise - adding secondary dimensions
Scaffolding
Exercise - enhanced forecasting via scaffolding
Data structures
Exercise - adjusting data structure for different questions
Exercise steps - part 1
Exercise steps - part 2
Summary
4. All about Data – Data Densification, Cubes, and Big Data
About data densification
Domain completion
How domain completion can be deployed
Exercise - activating domain completion in a crosstab part I
Exercise - activating domain completion in a crosstab part II
Exercise – activating domain completion via View Types
How domain completion is useful
Exercise - labelling nulls
When domain completion is a problem
Exercise - unwanted domain completion
Domain padding
Deploying domain padding via Show Empty Rows/Columns
Exercise - activating domain padding via Show Empty Rows/Columns
How domain padding is useful
Exercise - domain padding - filling date gaps
When domain padding is a problem
Exercise - from a domain padded visualization to a crosstab
Working with cubes
Exercise - using a data blend for continuous months when accessing a cube
Exercise steps
Exercise commentary
Exercise - using a data blend for hierarchies, aliasing, and grouping when accessing a cube
The deprecation of cubes
Tableau and big data
Exercise - a strategy for addressing Excel's row limitation
Massively parallel processing
Exercise - building a visualization with Google BigQuery
Exercise steps
Summary
5. Table Calculations
A definition and two questions
What is the function?
Directional and non-directional
Exercise - exploring directional and non-directional table calculation functions
Exploring each unique table calculation function
Exercises - exploring each unique table calculation function
LOOKUP and TOTAL
PREVIOUS_VALUE
RUNNING
WINDOW
FIRST and LAST
INDEX
RANK
SIZE
How is the function applied?
Building a playground
Partitioning and addressing with one dimension
Partitioning and addressing with two dimensions
Partitioning and addressing with three dimensions
Summary
6. Level of Detail Calculations
Building playgrounds
Playground 1 - FIXED and EXCLUDE
Exercise - exploring FIXED and EXCLUDE - setting up the workbook
Inspecting the worksheet and initial setup
Exercise steps - exploring FIXED and EXCLUDE - understanding FIXED
Exercise steps - exploring FIXED and EXCLUED - understanding EXCLUDE
Exercise - exploring FIXED and EXCLUDE - understanding order of filtering
Exercise commentary
Playground #2 - INCLUDE
Inspecting the worksheet and initial setup
Exercise steps - exploring INCLUDE
Practical application
Exercise - practical FIXED
Exercise steps - practical FIXED - the problem
Exercise steps - practical FIXED - table calc solution
Exercise steps - practical FIXED - LOD solution
Exercise commentary
Exercise - practical INCLUDE
Exercise steps part 1 - solve by using an LOD calculation
Exercise steps part 2 - solve without an LOD calculation
Exercise commentary
Exercise - practical EXCLUDE
Exercise steps part 1 - solve by using an LOD calculation
Exercise steps part 2 - solve by using blending
Exercise commentary
Summary
7. Beyond the Basic Chart Types
Improving popular visualizations
Bullet graphs
Exercise steps for bullet graph – the basics
Exercise steps for bullet graph – beyond the basics
Making useful pies and donuts on maps
Exercise – pies and donuts on maps
Exercise steps for pies and donuts – the basics
Exercise steps for pies and donuts – beyond the basics
Pareto charts
Exercise steps for a Pareto chart – the basics
Exercise steps for a Pareto chart – beyond the basics
Custom background images
Exercise – creating a grid
Exercise – steps for creating a grid
Exercise – steps for using a grid to generate a dataset
Exercise – visualizing a chess game
Exercise – creating polygons on a background image
Exercise – steps for creating polygons on a background image
Summary
8. Mapping
Extending Tableau mapping capabilities without leaving Tableau
Exercise - displaying routes and calculating distances
Extending Tableau mapping with other technology
Exercise - connecting to a WMS server
Exploring the TMS file
The TMS file structure
Accessing popular map servers
ArcGIS
Stamen
Exploring Mapbox
Exercise - Mapbox classic
Exercise – Mapbox GL
Accessing different maps via a dashboard
Exercise – swapping maps
Creating custom polygons
Exercise - drawing a square around Null Island
Exercise steps
Converting shape files for tableau
Exercise – polygons for Texas
Exercise steps
Exercise commentary
Summary
9. Tableau for Presentations
Getting the best images out of Tableau
A brief survey of screen capture tools
Tableau's native export capabilities
The five export types
From Tableau to PowerPoint
Exercise – create a template
Exercise – create two dashboards
Exercise – create a PowerPoint presentation
Exercise – automating a weekly PowerPoint presentation
Embedding Tableau in PowerPoint
Exercise – create an interactive PowerPoint presentation
Animating Tableau
Exercise – creating an animation with Tableau
Exercise – using an animation to export many images
Exercise – using an animation in Tableau to create an animation in PowerPoint
Story Points dashboards for presentations
Presentation resources
Exercise – using Tableau dashboards to create a PowerPoint-like presentation
Summary
10. Visualization Best Practices and Dashboard Design
Visualization design theory
Formatting rules
Rule – keep the font choice simple
Rule - Trend line > Fever line > Reference line > Drop line > Zero line > Grid line
Rule – band in groups of three to five
Exercise  – banding
Color rules
Rule – keep colors simple and limited
Rule – respect the psychological implication of colors
Rule – be colorblind-friendly
Rule – use pure colors sparingly
Exercise – using pure colors
Rule – color variations over symbol variation
Visualization type rules
Rule – keep shapes simple
Exercise – shapes
Rule – use pie charts sparingly
Compromises
Make the dashboard simple versus make the dashboard robust
Presenting dense information versus presenting sparse information
Telling a story versus allowing a story to be discovered
Document, document, document! But don't use any space!
Exercise – tooltips for extensive help
Keeping visualizations simple versus exploring esoteric chart types
Dashboard design
Dashboard layout
Golden Rectangle layout
Quad layout
Small Multiple layout
Sheet selection
Exercise - sheet swapping pie charts and treemaps
Exercise – collapsible menu
Summary
11. Improving Performance
Understand the Performance Recording dashboard
Exercise - exploring Performance Recording in Tableau Desktop
Performance Recording dashboard events
Behind the scenes of the Performance Recording dashboard
Hardware and on-the-fly techniques
Hardware considerations
On-the-fly techniques
Exercise - pause/resume auto updates
Exercise - Run Update
Exercise - small extracts
Single Data Source > Joining > Blending
Three ways Tableau connects to data
Using referential integrity when joining
Exercise - referential integrity
When it's necessary or advantageous to blend
Exercise - necessary blending
Working efficiently with data sources
Tuning data sources
Primary and foreign keys
NOT NULL
What is an index?
What should be indexed?
Working efficiently with large data sources
Intelligent extracts
Understanding the Tableau Data Extract
Constructing a TDE for optimal performance
Exercise - summary aggregates for improved performance
Optimizing extracts
Exercise - materialized calculations
Using filters wisely
Extract filter performance
Data source filter performance
Context filters
Dimension and measure filters
Table calculation filters
Exercise - late filter
Efficient calculations
Boolean/Numbers > Date > String
Exercise - an efficient and an inefficient way to determine N figure salary
Exercise - date versus integer
Additional performance considerations
Avoid overcrowding a dashboard
Fix dashboard sizing
Set expectations
Summary
12. Interacting with Tableau Server
Tableau file types
Tableau Data Source (.tds)
Tableau Packaged Data Source (.tdsx)
Exercise - publish a data source to Tableau Server
Tableau Workbook (.twb)
Tableau Packaged Workbook (.twbx)
Other file types
Tableau Server architecture
Tableau Server architecture approaches to avoid
Tableau Server architecture - TWB Centric
Tableau Server architecture - TWBX Centric
Tableau Server architecture approaches to adopt
Tableau Server architecture - TDS Centric
Tableau Server architecture - TDSX Centric
Tableau Server revision history
Tableau Server web authoring environment
Basic web authoring instructions
Exercise - edit an existing workbook on Tableau Server
Exercise - create a new workbook on Tableau Server
Exploring the capabilities and limitations of the Tableau Server web authoring environment
Exercise - the Tableau Server web authoring environment
Comparing and contrasting Tableau Desktop with the Tableau Server web authoring environment
User filters
Exercise - deploying a view level user filter
Accessing the Tableau Server Performance Recording dashboard
Exercise - exploring performance recording on Tableau Server
Summary
13. R Integration
Architecture, installation, and integration
Architecture
Installation
Installing R
Integration - starting Rserve, connecting with Tableau, and installing RStudio Desktop
Using R functions
Exercise - reproducing native Tableau functionality in R
Correlation
Exercise - correlation
Regression analysis
Exercise - regression analysis
Clustering
Exercise - clustering
Quantiles
Exercise - quantiles
Troubleshooting
Give R what it expects
R Scripts are table calculations
Performance challenges
Summary

Mastering Tableau

Mastering Tableau

Copyright © 2016 Packt Publishing

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

First published: November 2016

Production reference: 1231116

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham 

B3 2PB, UK.

ISBN 978-1-78439-769-2

www.packtpub.com

Credits

Author

David Baldwin 

Copy Editor

Vikrant Phadke

Reviewer

Tamas Foldi

Project Coordinator

Shweta H Birwatkar 

Commissioning Editor

Veena Pagare

Proofreader

Safis Editing

Acquisition Editor

Vinay Argekar

Indexer

Aishwarya Gangawane

Content Development Editor

Sumeet Sawant

Graphics

Disha Haria

Technical Editor

Akash Patel

Production Coordinator

Nilesh Mohite

About the Author

David Baldwin has provided consulting in the business intelligence sector for 17 years. His experience includes Tableau training and consulting, developing BI solutions, project management, technical writing, and the web and graphic design. His vertical experience includes financial, healthcare, human resource, aerospace, energy, education, government, and entertainment industries. As a Tableau trainer and consultant, David enjoys serving a variety of clients throughout the USA. Tableau provides David a platform that collates his broad experience into a skill set that can service a diverse client base.

Many people provided invaluable support in the writing of this book. Although I cannot name everyone, there are those to whom I would like to draw special attention: My wife, Kara, was an unfailing encourager, supporter, and cheerleader throughout the writing journey. My children, Brent and Brooke, were very understanding of their dad’s many long hours in front of a laptop at the dining room table. My mother, Bettye, was my first and best writing instructor and thus provided a foundation for clear communication. My father, Larry, taught me the importance of precise technical and mathematical thinking. My sister, Chelsea, modeled perseverance as she pursued and achieved advanced degrees. Also I’d like to thank my colleagues at Teknion for being ever willing to entertain questions, provide valuable feedback, and read rough drafts, particularly Bridget Cogley, Matthew Agee, Preston Howell, and especially Joshua Milligan.

www.Packtpub.com

For support files and downloads related to your book, please visit www.PacktPub.com.

Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.PacktPub.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at [email protected] for more details.

At www.PacktPub.com, you can also read a collection of free technical articles, sign up for a range of free newsletters and receive exclusive discounts and offers on Packt books and eBooks.

https://www.packtpub.com/mapt

Get the most in-demand software skills with Mapt. Mapt gives you full access to all Packt books and video courses, as well as industry-leading tools to help you plan your personal development and advance your career.

Why subscribe?

Fully searchable across every book published by PacktCopy and paste, print, and bookmark contentOn demand and accessible via a web browser

Preface

So what is this book about? The title certainly points in the right direction: Mastering Tableau. The word Mastering implies a journey to a level of competency beyond mere familiarity or superficial knowledge. The word Tableau, of course, limits the scope of a particular software package. Let’s extend the title by one word in order to hone the focus: Mastering Tableau Desktop. The word Desktop further narrows consideration by communicating that this book is not focused on Tableau Server, although there is a chapter dedicated to interacting with Server. Nor does this book dive deep into topics beyond the realm of Tableau, though other technologies such as R and SQL are discussed as they pertain to Tableau. Furthermore, this book is not focused on data visualization or architectural theory per se, though these topics are explored and every attempt is made to adhere to sound methodology as technical problems are discussed. Instead, this book attempts to build on a foundation of an already basic understanding of Tableau Desktop so as to provide a theoretical and practical basis for solving real-world challenges in an efficient and elegant manner. Along the way, many tips and tricks for use in everyday work are discussed and exercises with careful step-by-step instructions and commentary are provided.

What this book covers

Chapter 1, Getting Up to Speed - a Review of the Basics, provides a quick on-ramp for those new to Tableau and a useful review for those with experience. For a more thorough consideration of fundamental topics, see Learning Tableau, written by Joshua Milligan and published by Packt Publishing.

Chapter 2, All about Data - Getting Your Data Ready, commences a series of three "All about Data" chapters. The chapter begins with a theoretical discussion of the Tableau data paradigm and data mining topics and then moves on to practical ways to use Tableau to survey and cleanse data.

Chapter 3, All about Data - Joins, Blends, and Data Structures, explores complex joins, data blending, and pivoting.

Chapter 4, All about Data - Data Densification, Cubes, and Big Data, ends the series of "All about Data" chapters by surveying a variety of data topics, including the undocumented world of data densification, working with cubes and big data considerations.

Chapter 5, Table Calculations, focuses on two questions: “What is the function?” and “How is the function applied?” These questions provide a framework for discussing directional and non-directional table calculations as well as partitioning and addressing.

Chapter 6, Level of Detail Calculations, begins with two playground environments created in Tableau designed to provide a foundation for understanding level-of-detail calculations and then moves on to practical application.

Chapter 7, Beyond the Basic Chart Types, looks at improving some popular visualization types and then considers the largely underexplored topic of using background images in Tableau. The workbook provided with this chapter also provides many additional visualization types.

Chapter 8, Mapping, begins by considering how to expand Tableau’s native mapping capabilities without leaving the interface, and then explores extending Tableau mapping via other technologies, including connecting to WMS servers and MapBox. Lastly, the chapter demonstrates how to provide the end user options for choosing different maps and ends with a discussion on custom polygons.

Chapter 9, Tableau for Presentations, discusses techniques for integrating Tableau with PowerPoint as well as how to use Tableau as a standalone presentation tool via animation and story points.

Chapter 10, Visualization Best Practices and Dashboard Design, begins by considering design topics such as formatting, color, and visualization types and then addresses dashboard layout options. The chapter ends by exploring sheet swapping in some depth.

Chapter 11, Improving Performance, is the longest chapter of the book and attempts to systematically (though not exhaustively) cover options for optimizing Tableau performance.

Chapter 12, Interacting with Tableau Server, explores how to optimize Tableau Server architecture for best performance and easiest maintenance. The chapter also considers the web authoring environment, user filters, and accessing the Performance Recording dashboard via Tableau Server.

Chapter 13, R Integration, begins by considering how to install and integrate R with Tableau and then explores R and Tableau integration via a series of exercises. The chapter ends with a troubleshooting section.

What you need for this book

In order to make use of this book, an installation of Tableau 10 is required. The following technologies are mentioned and lightly utilized in this book but are not strictly required:

Chapter 2, All about Data - Getting Your Data Ready, makes light use of Adobe Acrobat.Access to SQL Server is helpful in a few places, especially in Chapter 4, All about Data - Data Densification, Cubes, and Big Data.Chapter 7, Beyond the Basic Chart Types, assumes access to Inkscape to complete two exercises (Inkscape is an open source vector drawing application that is similar to Adobe Illustrator).Access to Tableau Server is helpful in a few places, especially Chapter 12, Interacting with Tableau Server, where permissions for publishing data sources and editing in the web authoring environment are assumed.Access to Rserve is assumed for Chapter 13, R Integration. RGui and RStudio are also helpful but not strictly required.

Who this book is for

Mastering Tableau targets persons with 5+ months of experience using Tableau. Although not strictly required, a thorough reading of the predecessor to this book, Learning Tableau, is helpful. Alternatively, the Desktop I and II training provided by Tableau provides a helpful foundation. A basic knowledge of SQL is helpful in a few sections. A basic knowledge of Excel is assumed.

Reader feedback

Feedback from our readers is always welcome. Let us know what you think about this book—what you liked or may have disliked. Reader feedback is important for us to develop titles that you really get the most out of.

To send us general feedback, simply send an e-mail to [email protected], and mention the book title via the subject of your message.

If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, see our author guide on https://www.packtpub.com/books/info/packt/authors.

Customer support

Now that you are the proud owner of a Packt book, we have a number of things to help you to get the most from your purchase.

Downloading the example code

You can download the example code files for all Packt books you have purchased from your account at http://www.packtpub.com. If you purchased this book elsewhere, you can visit http://www.packtpub.com/support and register to have the files e-mailed directly to you.

You can download the code files by following these steps:

Log in or register to our website using your e-mail address and password.Hover the mouse pointer on the SUPPORT tab at the top.Click on Code Downloads & Errata.Enter the name of the book in the Search box.Select the book for which you're looking to download the code files.Choose from the drop-down menu where you purchased this book from.Click on Code Download.

Once the file is downloaded, please make sure that you unzip or extract the folder using the latest version of:

WinRAR / 7-Zip for WindowsZipeg / iZip / UnRarX for Mac7-Zip / PeaZip for Linux

The code bundle for the book is also hosted on GitHub at https://github.com/PacktPublishing/Mastering-Tableau. We also have other code bundles from our rich catalog of books and videos available at https://github.com/PacktPublishing/. Check them out!

Downloading the color images of this book

We also provide you a PDF file that has color images of the screenshots/diagrams used in this book. The color images will help you better understand the changes in the output. You can download this file from: https://www.packtpub.com/sites/default/files/downloads/MasteringTableau_ColorImages.pdf.

Errata

Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you find a mistake in one of our books—maybe a mistake in the text or the code—we would be grateful if you would report this to us. By doing so, you can save other readers from frustration and help us improve subsequent versions of this book. If you find any errata, please report them by visiting http://www.packtpub.com/submit-errata, selecting your book, clicking on the errata submission form link, and entering the details of your errata. Once your errata are verified, your submission will be accepted and the errata will be uploaded on our website, or added to any list of existing errata, under the Errata section of that title. Any existing errata can be viewed by selecting your title from http://www.packtpub.com/support.

Piracy

Piracy of copyright material on the Internet is an ongoing problem across all media. At Packt, we take the protection of our copyright and licenses very seriously. If you come across any illegal copies of our works, in any form, on the Internet, please provide us with the location address or website name immediately so that we can pursue a remedy.

Please contact us at [email protected] with a link to the suspected pirated material.

We appreciate your help in protecting our authors, and our ability to bring you valuable content.

Questions

You can contact us at [email protected] if you are having a problem with any aspect of the book, and we will do our best to address it.

Chapter 1. Getting Up to Speed – a Review of the Basics

The goal of this book is to empower you to become a Tableau master; in Tableau-speak, the term is Jedi. Yes, that is official Tableau terminology. Attend the yearly Tableau conference and you can sit in on Jedi classes. Of course, simply attending a class will not automatically bestow you with Jedi powers - nor will simply reading this book. Diligent work on real-world problems is absolutely essential. Couple this diligent work with industrious study and you will make it. You will become a Tableau Jedi. My hope is that this book will prove useful to you on your journey to mastery.

If you are a seasoned Tableau author, you may find this initial chapter elementary. (A person who creates Tableau workbooks is referred to as an author, not a developer.) For such persons, I recommend a quick, inspectional read. If after a few minutes you are satisfied you already possess a solid understanding of the concepts discussed, feel free to proceed to subsequent chapters. If, however, you find some of the content unfamiliar, it may be wise to read with greater attention.

Those who are fairly new to Tableau should find this chapter helpful in getting up to speed quickly; however, since this book targets advanced topics, relatively little time is spent considering the basics. For a more thorough consideration of fundamental topics, consider Learning Tableau, written by Joshua Milligan and published by Packt Publishing.

In this chapter, we will discuss the following:

The Tableau universeUnderstanding the Tableau interface and basic terminologyWorksheet and dashboard creationConnecting Tableau to your dataMeasure Names and Measure ValuesThree essential Tableau concepts

The Tableau universe

Tableau Software has a focused vision resulting in a small product line. The main product (and hence the center of the Tableau universe) is Tableau Desktop. Assuming you are a Tableau author, that's where almost all your time will be spent when working with Tableau. But of course you must be able to connect to data and output the results. Thus, as shown in the following figure, the Tableau universe encompasses data sources, Tableau Desktop, and output channels, which include the Tableau Server family and Tableau Reader:

Data Sources

Tableau connects to many data sources. Those data sources will be discussed in more detail in the following section.

Tableau Desktop

Tableau Desktop is where visualizations are created. Although, as of Tableau 8.0, some authoring capabilities were introduced into the Tableau Server environment, that environment is limited. (See Chapter 12, Interacting with Tableau Server to learn more about authoring in the Tableau Server environment.) Thus, the heavy lifting is still done in Tableau Desktop.

Tableau Server Family

Once completed in Tableau Desktop, a workbook can be uploaded to Tableau Server for end-user access. Tableau Server provides a secure, web-based environment where end users can access visualizations created in Desktop either through a browser or via the Tableau Mobile app for Android and iPhone.

Tableau Online is a cloud-based version of Tableau Server hosted by Tableau Software. It's an ideal solution for smaller organizations that need the security and flexibility of Server without the associated overheads.

Tableau Public is, in reality, split into two products: the Tableau Public client, and a cloud-based, public-facing version of Tableau Server. The client has the capabilities of Desktop, with a few major exceptions:

The data sources you can connect to are very limited (for example, Excel, Access, text file formats and web data connectors)You can only publish to Tableau PublicYou are limited to 15 million rows of data per workbook

Tableau Reader

The relationship between Tableau Desktop and Tableau Reader is synonymous to that between Adobe Acrobat and Adobe Reader. Desktop is used for authoring; Reader is used for viewing. Desktop has an associated cost; Reader is free.

A few brief notes regarding Reader:

Filters, dashboard actions, animation and parameters are all accessible on Reader, but no authoring is enabledReader is useful for presentations when Desktop is not availableReader can only be used with packaged workbooks that do not contain live connections to other data sources

Understanding the Tableau interface and basic terminology

To begin our survey of the basics, let's consider the terminology and associated definitions of assets that make up the Tableau workspace. Lingering a little on these terms should prove helpful, since each is used throughout the book:

Go to Start

Click to toggle between the start page and the workspace.

Data Pane

Provides access to the data source, all fields, sets, and parameters, and displays the underlying data when the View Data icon is clicked.

Data Source

Lists all data sources and provides access to edit data sources, create data extracts, publish data sources, and more.

Dimensions

Lists all fields classified as Dimensions.

Measures

Lists all fields classified as Measures.

Sets

Lists all sets created by the Tableau author.

Parameters

Lists all parameters created by the Tableau author.

Data Source Tab

Provides access to the Data Source page.

Shelves

Areas where fields are placed to create views.

Legend

Serves the dual purpose of communicating Color, Size, and Shape information that exists in the view, as well as providing sorting, filtering, and highlighting capabilities.

Status Bar

Displays information about the current view.

Fit

Determines how a view is sized on the screen.

View

That which the end user sees via Tableau Server or Reader. Includes the visualization, legends, displayed filters, parameters, captions, and so on.

Pills

Fields, sets, or parameters that have been placed on one or more shelves. So named because the shape resembles a pill.

Show Me

Tool used to automatically create visualizations based on selected fields and any fields already placed on shelves.

Displayed Filter

A field exposed to the end user providing the ability to display/hide a numeric or date range of the field or display/hide members of the field. In earlier versions of Tableau this was referred to as a Quick Filter.

Connecting Tableau to your data

At the time of writing, Tableau's Data Connection menu includes 50 different connection types, and that is somewhat of an understatement since some of those types contain multiple options. For example, the selection choice, Other Files, includes 21 options. Of course, we won't cover the details for every connection type, but we will cover the basics.

Upon opening a new instance of Tableau Desktop, you will note a link in the upper left-hand corner of the workspace. Clicking on that link will enable you to connect to data. Alternatively, you can simply click on the New Data Source icon on the toolbar:

Although subsequent chapters will consider connecting to other data sources, here we will limit the discussion to considerations when connecting to Excel and text files.

Excel and text files

Upon choosing to connect to an Excel or text file, the Tableau author is presented with two choices. Note that those choices are somewhat hidden. As shown in the following screenshot, you will need to click on the arrow next to the Open button to access them:

The Open option uses a native Tableau driver. The Open with Legacy Connection option accesses the Microsoft JET driver. Let's compare and contrast some of the differences between these two drivers.

Comparing and contrasting Native Tableau Driver and MS Jet Driver

Native Tableau Driver

MS Jet Driver

More set capabilities such as in/out and combined sets

Limited set capabilities

Count Distinct is allowed

Count Distinct is disallowed

Allows more than 255 columns

Columns are capped at 255

Special characters, such as brackets and quotation marks, are allowed in file and field names

Special characters are disallowed in file and field names

When connecting to Excel, the data type is determined by 95% of the first 10,000 rows

When connecting to Excel, the data type is determined by the first eight rows

Cannot connect to .xlsb files

Can connect to .xlsb files

File names can be any length

File names are limited to 64 characters

Custom SQL is not allowed

Custom SQL is allowed

Left and inner joins are allowed

Left, inner, and right joins are allowed

Pivot data from rows to columns

No pivoting feature

Improved header auto-detection

Note that the preceding table is not complete. There are many other differences between the functionality of Native Tableau Driver and MS Jet Driver. Most of those, however, are less consequential.

So, when should you use Native Tableau Driver versus MS JET Driver? In short, use the native Tableau driver! In almost every case it will provide better performance and more functionality. One exception is when custom SQL is required. Tableau Software does not recommend using custom SQL in most cases because Tableau-generated SQL will run more efficiently; however, in some cases it may be necessary.

Connecting to a Tableau Server

Connecting to Tableau Server is perhaps the single most important server connection type to consider, since it is frequently used to provide better performance than may otherwise be possible. Additionally, connecting to Tableau Server enables the author to receive not only data, but information regarding how that data is to be interpreted, for example, whether a given field should be considered a measure or a dimension. Let's explore this further via two exercises.

Exercise - observing metadata differences

As a precursor to connecting to Tableau Server, let's compare and contrast the instance of the Superstore data source represented in the workbook associated with this chapter (that is, the Chapter 1 workbook) with a new connection to the same data.

Exercise steps

In a new instance of Tableau, navigate to Data | New Data Source | Excel to connect to the Sample - Superstore dataset that installs with Tableau desktop (it should be located on your hard drive under My Tableau Repository | Datasources).Double-click on the Orders sheet.Click on the Sheet 1 tab.Place Discount on the Text shelf.Double-click on Profit and Sales.Compare the results of the new worksheet to that of the worksheet entitled Observing Metadata Differences in the Chapter 1 workbook:
A: The data source name has been altered in the Chapter 1 workbookB: In the Chapter 1 workbook, the default aggregation of Discount is AVG. In the unaltered instance the default is SUMC: Product Hierarchy exists only in the Chapter 1 workbookD: The format of Discount, Profit, and Sales differs between the two instancesE: Profit Ratio exists only in the Chapter 1 workbook

Exercise - connecting to Tableau Server

In order to complete this exercise, access to an instance of Tableau Server is necessary. If you do not have access to Tableau Server, consider installing a trial version on your local computer:

In the workbook associated with this chapter, navigate to the Connecting to Tableau Server worksheet.Right-click on the Superstore data source and select Publish to Server.Log in to Tableau Server and follow the prompts to complete the publication of the data source.After the data source has been published, open a new instance of Tableau Desktop and navigate to Data | New Data Source | Tableau Server to connect to the data source published in the previous step.Click on Sheet 1 in the new workbook and observe that the changes made in the Chapter 1 workbook have been preserved.Within the Data pane, right-click on Profit Ratio and note that it is not directly editable.

Having completed the previous two exercises, let's discuss the most germane point; that is, metadata. Metadata is often defined as data about the data. In the preceding case, the data source name, default aggregation, default number formatting, and hierarchy are all examples of Tableau remembering changes made to the metadata. This is important because publishing a data connection allows for consistency across multiple Tableau authors. For example, if your company has a policy regarding the use of decimal points when displaying currency, that policy will be easily adhered to if all Tableau authors start building workbooks by pointing to data sources where all formatting has been predefined.

In the last exercise, the fact that the Profit Ratio calculated field was not directly editable when accessed via connecting to Tableau Server as a data source has important implications. Imagine the problems that would ensue if different Tableau authors defined Profit Ratio differently. End users would have no way of understanding what Profit Ratio truly means. However, by creating a workbook based on a published data source, the issue is alleviated. One version of Profit Ratio is defined and it can only be altered by changing the data source. This functionality can greatly improve consistency across the enterprise.

Connecting to saved data sources

Connecting to a saved data source on a local machine is much like connecting to a data source published on Tableau Server. Metadata definitions associated with the local data source are preserved, just like they are on Tableau Server. Of course, since the data source is local instead of remote, the publication process is different. Let's explore this via an exercise.

Exercise - creating a local data connection

In the workbook associated with this chapter, navigate to the tab entitled Local Data Connection.In the Data pane, right-click on the Superstore data source and select Add to Saved Data Sources.Using the resulting dialog box, save the data source as Superstore in My Tableau Repository | Datasources, located on your hard drive.Click on the Go to Start icon located in the top-left corner of your screen and observe the new saved data source:

Tip

Note that you can save a local data source that points to a published data source on Tableau Server. First, connect to a published data source on Tableau Server. Next, right-click on the data source in your workspace and choose Add to Saved Data Sources. Now you can connect to Tableau Server directly from your Start page!

Measure Names and Measure Values

I've observed the following scenario frequently. A new Tableau author creates a worksheet and drags a measure to the Text shelf. They would like to create another row to display a second measure but do not know how. They drag the second measure to various places on the view and get results that seem entirely unpredictable. The experience is very frustrating for the author, since it's so easy to accomplish this in Excel! The good news is that it's also easy to accomplish in Tableau. It just requires a different approach. Let's explore the solution via an exercise.

Measure Names and Measure Values - the basics

Measure Names and Measure Values are generated fields in Tableau. They do not exist in the underlying data, but they are indispensable for creating many kinds of views. As may be guessed from its placement in the Data pane and its name, Measure Names is a dimension whose members are made up of the names of each measure in the underlying dataset. Measure Values contains the numbers or values of each measure in the dataset. Watch what happens below when measure names and measure values are used independently. Afterward observe how they work elegantly together to create a view.

Exercise - Measure Names and Measure Values

In the workbook associated with this chapter, navigate to the worksheet entitled Measure Names/Values.Drag Measure Values to the Text shelf and observe the results:Clear the worksheet by clicking on the Clear Sheet icon on the toolbar:Drag Measure Names to the Rows shelf and observe that the view merely displays No Measure Value.Drag Measure Values to the Text shelf. Note the list of measures and associated values.

Perhaps the interrelationship between Measure Names and Measure Values is best explained by an analogy. Consider several pairs of socks and a partitioned sock drawer. Step 2 is the equivalent of throwing the socks into a pile. The results are, well, disorganized. Step 4 is the equivalent of an empty sock drawer with partitions. The partitions are all in place but where are the socks? Step 5 is a partitioned drawer full of nicely organized socks. Measure Names is like the partitioned sock drawer. Measure Values is like the socks. Independent of one another they are not of much use. Used together, they can be applied in many different ways.

Measure Names and Measure Values - shortcuts

Tableau provides various shortcuts to quickly create a desired visualization. If you are new to the software, this shortcut behavior may not seem intuitive, but with a little practice and a few pointers, you will quickly gain understanding. Let's use the following exercise to explore how you can use a shortcut to rapidly deploy Measure Names and Measure Values.

Exercise - Measure Names and Measure Values shortcuts

In the workbook associated with this chapter, navigate to the worksheet entitled MeasureNames/Values ShrtCts.Drag Sales directly on top of the Profit number in the view:Observe the results, including the appearance of the Measure Values shelf, the deployment of Measure Names on the Rows and Filters shelves, and Measure Values on the Text shelf:

Exercise commentary

Several things happened in step 2. After placing Sales on top of the Profit number in the view, Tableau did the following:

Deployed Measure Names on the Filters shelf:
Open the Measure Names filter and observe that only Profit and Sales are selected. This limits the view to display only those two measures.