NewDataLabs

Tableau - Business Intelligence Tools

  • Solutions
    • ETL
      • Tableau Prep
      • Airflow
      • Power Query
      • Microsoft Integration Services
      • Fivetran
    • Data warehouses
      • Snowflake
      • Google Big QUERY
      • Microsoft SQL Server
      • PostgreSQL
      • Web analytics
      • Amplitude
    • Business Intelligence
      • Tableau
        • What is Tableau?
        • How to implement Tableau properly?
        • Interactive management dashboard
        • Tableau price list
        • Products of Tableau
          • – Tableau Desktop
          • – Tableau Server
          • – Tableau Prep
          • – Tableau Online
        • Tableau Audit
        • Tableau Day
      • Power BI
  • Services
    • Tableau workshops on your data – Proof of Concept
    • Preparation of data – ETL
    • Reporting and data visualization
    • Migration services
    • Tableau training courses
    • Embedded Analytics
  • Salesforce
    • Salesforce CRM
      • Customer relationship management
      • Sale
        • Sales opportunity management
      • Business Intelligence analysis
      • Salesforce Manufacturing Cloud
      • Salesforce Service Cloud
      • Salesforce Marketing Cloud
    • Services
      • Proof of Concept
      • CRM workshops
      • Implementation of CRM
      • Post-implementation support
      • Data migration
      • Salesforce training
      • CRM consultancy
      • Salesforce integrations
    • Tableau + Salesforce
  • About Us
  • Customers
    • Our Customers
    • Customer stories
      • Totalizator Sportowy Sp. z o.o.
      • Snieżka S.A.
      • Werner Kenkel Sp. z o.o.
      • DOZ S.A.
  • Events
    • Webinars >
      • Salesforce
      • Tableau
      • Power BI
    • Tableau Training Courses >
      • New Data Model
      • Tableau online course
      • Tableau training courses
    • Power BI Training Course
  • Blog
  • Download Tableau
    • Tableau Desktop – Trial
    • Tableau Server – Trial
    • Tableau Prep – Trial
    • Tableau Reader
  • Contact
EN / PL

ETL – what possibilities does it give us?

2020-11-05

Nowadays, everything, every object that surrounds us, that we use, that we want to buy leaves a trace. Trace in data form. At different stages this trace can be saved in different ways, in different databases.

In order to make a full analysis we need to extract the data we need from the source. But this is just the beginning. Different sources, different standards, different formats. Data must be consistent – homogeneous – to give a clear and true message. Every company wants to learn other things from their data.

That is why it is so important to prepare this data according to the needs, requirements and standards of each client individually. It is also a stage that transforms data so that it is compatible and can be analyzed as such. Our extracted and transformed data must be loaded (pulled in, uploaded) into the target database, which will be the basis of our analysis.

What possibilities does the ETL tool give us?
  • Collecting data from different sources (different types of databases), different departments of the company (production, logistics, sales, accounting) and creating data warehouses from them
  • Importing data from old, no longer used analytical systems into new ones
  • Integrating company data after acquisitions, mergers or purchase – all activities related to the development and growth of the company
  • Linking our data with databases of suppliers and business partners
  • The list of possible applications of the ETL tool goes beyond these four most characteristic points, but clearly shows how important it is to use ETL in everyday business operations. It is enough to imagine how hard and laborious it would be to work with data if we had to be an expert on every data source that is processed in our company. If every processing, search, purification had to be done individually. Every day. How much work, resources and time is wasted on such an action – and is it worth it? Especially when there are tools available to use these expenses much more efficiently and… effectively.

    Tableau Prep interface Extraction The extraction process may seem to be the easiest of the data preparation steps – but it is not. At this stage, there are many problems with access to data, with the compatibility of source and target data formats. For data that are easily accessible, these procedures will of course be simple and for data that are not directly accessible, they will be correspondingly more complicated.

    When we deal with different types of data, they are usually organized into a structure. It is important that the extraction procedures already introduce pre-selection of data, because extraction of redundant or insignificant data adversely affects the efficiency of the ETL process and the data warehouse being created.

    Transformation

    During the transformation process, we check whether the records from the databases are correct, need to be cleaned or are unusable. This helps to maintain order in the created base and does not lead to its littering.

    The simplest of data conversion procedures is responsible for converting numerical data into characters and vice versa, filling in empty values, changing formats (numbers and dates), changing values e.g. converting units. The transformation processes also allow us to unify values and maintain the integrity of the data being prepared.

    Before the data can reach the data warehouse, it must be cleaned, integrated and mapped:

    • Detection and, if possible, correction of spelling and dictionary errors, e.g. an error in the name of the city, the full list of which we have in the database,
    • detection of correct formats, e.g. correctness of telephone numbers in terms of number of digits,
    • detecting and removing inconsistencies between the name of an attribute and its content – for example, it often happens that there is a customer’s first and last name in the ‘Company name’ field,
    • value normalization – replacement of blank character strings and spaces with NULL,
    • completion of data based on external information sources, e.g. completion of missing postal codes based on addresses.

    Data Loading

    Loading is the last and most important process in the ETL system. It consists of placing data that have been previously extracted and transformed in a data warehouse.

    Use of ETL and impact on business

    The data – as well as their availability and real-time analysis – have become the basis of modern business. How you collect, process, combine, store, visualize and analyze your data from a business perspective is now more important than ever before.

    Whether you’re looking for data from databases, streaming services, files or other sources, choosing the right toolkit is crucial. A modern ETL solution, designed and built for today’s business environment where data is processed in real time, may be the best solution for your business.

    The choice of the ETL tool is, contrary to appearances, one of the most important steps when it comes to data analysis. The way they are prepared, processed and their final shape and time for these processes will be devoted to the basis of our analyses. Always. It will be at the heart of our data retrieval, processing and delivery system. Therefore, the versatility of the ETL tool should be one of the key criteria for choosing a specific solution.

    How to select an ETL tool?

    We can distinguish 5 areas to be taken into account.

    • Connecting to data: the ETL tool should be able to communicate with any data source, regardless of its origin (databases, streaming services, files)
    • Efficiency: transferring and modifying data requires adequate computing power, so an ETL tool should be able to cope with the increasing size of our data in the future.
    • Transformation flexibility: matching, merging and changing data is crucial for transforming it – most of these operations should be able to be performed with simple drag and drop operations.
    • Data quality: the data we start working with is not cleaned and structured. The information contained in our data will only reveal its secrets to us if it is homogenous and ordered.
    • Flexible data collection options: an ETL tool should give you the possibility to determine whether to refresh data at intervals, or to refresh only the latest data, the whole database, or maybe only a specific part of it.

    There are many ready-made ETL solutions on the market, such as: SAS ETL Studio, Alteryx, Oracle Warehouse Builder, Pentaho Data Integration. They all provide a wide range of functionality and connectivity to a variety of data sources. Their implementation is usually not problematic, but purchase costs can already be a hard barrier to overcome for companies without a sufficiently large budget for such expenses.

    This makes the use of Tableau Prep all the more interesting – especially if we think about using Tableau technology in our “main” Business Intelligence. Tableau Prep is “embedded” in the license of Tableau Creator and therefore does not entail additional costs. It has very interesting possibilities to analyze the metadata itself (i.e. data about the data), as well as various transformations, groupings, aggregations, cleansing, etc. of the data itself – even before it is “let in” to Tableau. In a simple, graphical way you can design the ETL process itself, as well as check the results of specific data transformations in real time. At the “output” we get a ready-made Tableau extract, or csv file.

    Importantly, a once defined process (so called Tableau Flow, .tfl) can later be started according to a defined scheduler, or on request.

    Why use ETL?

    The history of ETL processes is almost 50 years old and since then companies have relied on ETL for a consolidated view of their data. With ETL, organizations can analyze their data in multiple locations and formats and make faster and better business decisions based on it.

    Webinars
    > Amplitude Webinars
    > Tableau Webinars
    > Power BI Webinars

    Tableau training courses
    Power BI online course
    ETL
    > Tableau Prep
    > Airflow
    > Power Query
    > Microsoft Integration Services
    Data warehouses
    > Snowflake
    > Google BigQuery
    > Microsoft SQL Server
    > PostgreSQL
    Business intelligence
    > Tableau
    > Salesforce
    > Power BI
    Web analytics
    > Amplitude
    Salesforce
    > Customer relationship management
    > Sale
    > Sales opportunity management
    > Analysis Business Intelligence
    > Salesforce Manufacturing Cloud
    > Salesforce Service Cloud
    > Salesforce Marketing Cloud
    Contact

    contact@newdatalabs.com
    contact@newdatalabs.com


    601797783
    +48 781 648 194


    Adres
    Correspondence address:
    al. T. Boy’a – Żeleńskiego 26,
    51-160 Wrocław

    Copyright © 2025

    Manage Consent
    To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
    Functional Always active
    The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
    Preferences
    The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
    Statistics
    The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
    Marketing
    The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
    Manage options Manage services Manage {vendor_count} vendors Read more about these purposes
    View preferences
    {title} {title} {title}