Live vs Extract Connection: A Road to Tableau Desktop Specialist Certification

Chapter 4: An extensive guide to Live & Extract data connection, Logical Table & Physical Table in Tableau

Daksh Trehan
Towards AI

--

Welcome to the fourth chapter, In this piece, we are going to learn about data connections in Tableau and which one should you go for.

If you want to navigate through other chapters, visit: Tableau: What it is? Why it is the best?; A road to Tableau Desktop Specialist Certification.

If you want to directly go on Tableau Desktop Specialist notes, access them here → https://dakshtrehan.notion.site/Tableau-Notes-c13fceda97b94bda940edbf6751cf30

Use the link to get access to free Tableau certification dumps (Valid till 6 July 2022):

https://www.udemy.com/course/tableau-desktop-specialist-certification-dumps-2022/?couponCode=E3F08189FA4BAFAE72BC

When we connect our data, Tableau let us define two types of connections: Live & Extract.

Table of Content

  • Live Connection
  • Extract Connection
  • Selecting a better connection type for your data
  • Sample Exam Questions from this topic

Live Connection

As the name suggests, a Live connection is made when we want our data to keep flowing in our Tableau Data source from a dynamic database.

This offers flexibility when we want real-time updates and instantaneous insights.

Live data process queries in the source database.

By default, when we create a connection, it is a Live connection.

Single Cylinder represents Live Connection →

With Live connection, the data in our worksheet would be updated every time we re-open a worksheet or manually refreshes the data.

To refresh the live data →

Extract Connection

“Extracts” are one of the most underrated features of Tableau.

An Extract file have .hyper extension(previously referred as .tde)

An extract connection is made over tde. Tableau Data Extract(.tde/also referred to as hyper file) is a compressed snapshot of live data that is stored locally and loaded into memory. To ease off our connection and minimize the load due to complex data, we can apply filters to our data, save it as a .hyper file and later use them for our extract connection. This tends to be much faster than Live Connections and hence provides an edge when used for complex visualizations.

An extract process queries using Tableau Data Engine because a hyper file is so much optimized and efficient that Tableau in-memory Data engine can process queries easily.

An extract is only an extract for that particular workbook, and when it will be shared with others it will act as live data.

Double Cylinder represents Extract connection →

Extracts can’t be updated automatically, but can only be refreshed either manually or periodically.

Extracting Data in Tableau

Method 1

Step1 → Chose “Extract” Connection

Step2 → When we chose “Extract” Data, Tableau ask for the location of .hyper file.

Method 2

Step1 → Go to the Data pane and right-click on Data Connection and click on “Extract Data”

Step2 → Add filters and modify the extract according to your needs.

Refreshing an Extract in Tableau

Once an extract is made and loaded in our Workbook, there are two ways we can refresh our data:

The first option will refresh the connection, but since the connection is static, there won’t be any changes reflected in your data. This option is also present in Live connection and is more specifically to be used when we are working with Live data.

The second option will create a new snapshot of the data i.e. new extract and hence the data in our workbook would be updated.

To periodically refresh the data automatically, we can publish our data to Tableau Server and can choose the time interval for refreshing.

Tableau offers to refresh the data in two ways, we can either go for a full refresh, which replaces all the content in the extract or can use an incremental refresh, which only adds new rows.

(Logical)Single Table vs Multiple(Physical) Table Extraction

Logical Tables are extracted by default.

Only use Physical tables, when the extract constitutes multiple tables with equality joins and meet the following conditions:

  • No RAWSQL.
  • No Incremental refreshes.
  • No Sampling.
  • No Extract Filters.
  • Data Types of Columns used for relationship/joins are identical.
  • All joins between physical tables are equality joins.

Physical Tables should only be used when the size of our extract is larger than expected while Logical Tables must be used when we want to limit the amount of data in our extract using Filters or aggregations.

Selecting a better connection type for your data

Benefits of Live Connection

  • Allows working with real-time data and provides instantaneous insights.
  • This connection is suggested if you’re working with a rapid database.

Benefits of Extract Connection

  • This can be used if you want a portable dataset.
  • It helps us to limit the amount of data we need.
  • It improves performance and efficiency.
  • These are fast to create and work with.
  • These support additional functionalities such as the COUNTD function.

Disadvantages of Extract Connection

  • This connection is not suggested while working with sensitive data, as the extract is saved locally and hence can be shared with anyone.
  • This requires a manual refresh.

Sample Exam Questions from this Topic

Can we automatically refresh our Extract data?

  1. True
  2. False

Solution: True

Why should we not use Extract Data?

  1. They are slow.
  2. They might expose confidential data.
  3. They are difficult to use.
  4. They can’t be refreshed.

Solution: They might expose confidential data.

What is the extension for extracted data?

  1. .hyper
  2. .twb
  3. .tds
  4. .tbm

Solution: .hyper

By default, Tableau starts ____ connection.

  1. Live
  2. Extract

Solution: Live

Which connection should we use for real-time data?

  1. Live
  2. Extract

Solution: Live

Use the link to get access to free Tableau certification dumps (Valid till 6 July 2022):

https://www.udemy.com/course/tableau-desktop-specialist-certification-dumps-2022/?couponCode=E3F08189FA4BAFAE72BC

References:

[1] Tableau Help | Tableau Software

[2] Personal Notes

[3]Tableau Desktop Specialist Exam (New Pattern — 2021) — Apisero

Thanks for Reading!

Feel free to give claps so I know how helpful this post was for you, and share it on your social networks, this would be very helpful for me.

If you like this article and want to learn more about Machine Learning, Data Science, Python, BI. Please consider subscribing to my newsletter:

Daksh Trehan’s Newsletter.

Find me on Web: www.dakshtrehan.com

Connect with me at LinkedIn: www.linkedin.com/in/dakshtrehan

Read my Tech blogs: www.dakshtrehan.medium.com

Connect with me at Instagram: www.instagram.com/_daksh_trehan_

--

--