{"id":2558,"date":"2023-11-28T15:56:22","date_gmt":"2023-11-28T14:56:22","guid":{"rendered":"https:\/\/newdatalabs.com\/en\/?p=2558"},"modified":"2023-12-14T11:58:19","modified_gmt":"2023-12-14T10:58:19","slug":"visualizing-the-distribution-of-data","status":"publish","type":"post","link":"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/","title":{"rendered":"Visualizing the Distribution of Data"},"content":{"rendered":"\n<p>Visualizing the Distribution of Data. At the early stage of analysis, it is vital to know the data with which you are working. Are these numerical or descriptive values? Which variables are important? And how are these values distributed? The last element is key in understanding your data, as it helps find answers to the essential questions \u2013 What is the range of variables? How are they distributed? Is their distribution symmetrical? Are there any dominant characteristics or significant outliers? The answers to the above questions will allow you to build knowledge about the data you are working with. How to facilitate the understanding of the distribution of variables using data visualization tools?<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Using a histogram to analyze numerical variables<\/h2>\n\n\n\n<p><\/p>\n\n\n\n<p><a href=\"https:\/\/newdatalabs.com\/histogram-tajemnice\/\"><strong>Histogram<\/strong><\/a>&nbsp;is a specific bar chart, where the Y-axis counts the values, and the X-axis counts the categories or ranges of the numeral value. This type of chart can be easily created in Tableau \u2013 all you need to do is move the required measure to your chart to be able to see the option <em>Histogram<\/em> in <em>Show me<\/em>:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"956\" height=\"664\" data-src=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e1.png\" alt=\"\" class=\"wp-image-2562 lazyload\" data-srcset=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e1.png 956w, https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e1-300x208.png 300w, https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e1-768x533.png 768w\" data-sizes=\"(max-width: 956px) 100vw, 956px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 956px; --smush-placeholder-aspect-ratio: 956\/664;\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>As is always the case, the chart created by <em>Show me<\/em> requires further processing. Let\u2019s start with the interpretation of the chart. The distribution is symmetrical; however, it is slightly outstretched on both sides. This indicates that there are outliers which hinder the actual analysis. At the same time, they are so small that they cannot be identified on the chart. Let\u2019s try to remove these values by limiting the scope of the <em>Profit<\/em> variable:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"950\" height=\"664\" data-src=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e2.png\" alt=\"\" class=\"wp-image-2563 lazyload\" data-srcset=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e2.png 950w, https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e2-300x210.png 300w, https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e2-768x537.png 768w\" data-sizes=\"(max-width: 950px) 100vw, 950px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 950px; --smush-placeholder-aspect-ratio: 950\/664;\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>As you can clearly see, most of the values fall in the range of 0 to 100. You can reduce the size of <em>bin<\/em> on the X-axis to better understand the distribution of variables in the range close to 0:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"768\" height=\"534\" data-src=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e3-768x534-1.png\" alt=\"\" class=\"wp-image-2564 lazyload\" data-srcset=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e3-768x534-1.png 768w, https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e3-768x534-1-300x209.png 300w\" data-sizes=\"(max-width: 768px) 100vw, 768px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 768px; --smush-placeholder-aspect-ratio: 768\/534;\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>It is also possible to analyze the histogram as a <em>line chart<\/em> or <em>area chart<\/em>, by applying another axis with a line or area chart:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><a href=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e4-768x534-1.png\"><img decoding=\"async\" width=\"768\" height=\"534\" data-src=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e4-768x534-1.png\" alt=\"\" class=\"wp-image-2565 lazyload\" data-srcset=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e4-768x534-1.png 768w, https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e4-768x534-1-300x209.png 300w\" data-sizes=\"(max-width: 768px) 100vw, 768px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 768px; --smush-placeholder-aspect-ratio: 768\/534;\" \/><\/a><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">The distribution of variables for categories<\/h2>\n\n\n\n<p><\/p>\n\n\n\n<p>Let\u2019s try to analyze the distribution to get a better understanding of the categories. For this purpose, we will assume that we want to see the distribution of profit according to such categories as <em>Region<\/em> and <em>State<\/em>:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"768\" height=\"532\" data-src=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e5-768x532-1.png\" alt=\"\" class=\"wp-image-2566 lazyload\" data-srcset=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e5-768x532-1.png 768w, https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e5-768x532-1-300x208.png 300w\" data-sizes=\"(max-width: 768px) 100vw, 768px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 768px; --smush-placeholder-aspect-ratio: 768\/532;\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>The problem of this approach is that the points overlap each other, making it more difficult to identify where the data is centred. You can resolve this issue by reducing <em>Opacity<\/em>:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><a href=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e6-768x534-1.png\"><img decoding=\"async\" width=\"768\" height=\"534\" data-src=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e6-768x534-1.png\" alt=\"\" class=\"wp-image-2567 lazyload\" data-srcset=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e6-768x534-1.png 768w, https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e6-768x534-1-300x209.png 300w\" data-sizes=\"(max-width: 768px) 100vw, 768px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 768px; --smush-placeholder-aspect-ratio: 768\/534;\" \/><\/a><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>If the clarity is still not good enough, you can change the above <em>dot plot<\/em> to a <em>jitter plot<\/em>, by adding random distribution of the data points in the category:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><a href=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e7-768x531-1.png\"><img decoding=\"async\" width=\"768\" height=\"531\" data-src=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e7-768x531-1.png\" alt=\"\" class=\"wp-image-2568 lazyload\" data-srcset=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e7-768x531-1.png 768w, https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e7-768x531-1-300x207.png 300w\" data-sizes=\"(max-width: 768px) 100vw, 768px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 768px; --smush-placeholder-aspect-ratio: 768\/531;\" \/><\/a><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>This way you can easily identify the data points, as overlapping is now limited. However, this can give rise to questions, such as what is shown on the Y-axis. Users will usually try to seek the additional meaning of the vertical position of a given data point, whereas it is a random and unsignificant number.<br><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">If there are a lot of points \u2013 use a Box Plot<\/h2>\n\n\n\n<p><\/p>\n\n\n\n<p>If you had a larger number of data points, it would be hard to make any conclusions based on the <em>dot plot,<\/em> or <em>jitter plot<\/em>. In this case, it is better to use a <em>Box Plot<\/em> \u2013 it is a visualization technique that allows to depict the statistics of a set of points:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><a href=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e8-768x533-1.png\"><img decoding=\"async\" width=\"768\" height=\"533\" data-src=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e8-768x533-1.png\" alt=\"\" class=\"wp-image-2569 lazyload\" data-srcset=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e8-768x533-1.png 768w, https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e8-768x533-1-300x208.png 300w\" data-sizes=\"(max-width: 768px) 100vw, 768px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 768px; --smush-placeholder-aspect-ratio: 768\/533;\" \/><\/a><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>How to interpret the box plot? The main point is the median, from which the 25<sup>th<\/sup> and 75<sup>th<\/sup> percentiles are measured. The area between them forms a box, and the line in the middle, which divides the box into two parts, is the median. Another elements are whiskers, which show the lowest and the highest values in the dataset, however their distance is not greater than 1.5 times the range between the 25<sup>th<\/sup> and 75<sup>th<\/sup> percentiles (it is called the quartile range, which is the width of the box). All values except for the whiskers are considered as the <em>outliers<\/em>:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><a href=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e9-768x533-1.png\"><img decoding=\"async\" width=\"768\" height=\"533\" data-src=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e9-768x533-1.png\" alt=\"\" class=\"wp-image-2570 lazyload\" data-srcset=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e9-768x533-1.png 768w, https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e9-768x533-1-300x208.png 300w\" data-sizes=\"(max-width: 768px) 100vw, 768px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 768px; --smush-placeholder-aspect-ratio: 768\/533;\" \/><\/a><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p><em>A Box plot<\/em>\u00a0is a useful technique for visualising datasets. However, it\u2019s important to bear in mind that not all users know how to interpret it. Therefore, when working with this chart you should use appropriate labels to avoid misinterpretation.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">The distribution of variables according to several categories<\/h2>\n\n\n\n<p><\/p>\n\n\n\n<p>Up to now, we have been visualising one numerical variable, and according to one dimension. However, if you need to analyze the distribution according to two categories, you can use a <em>heat map<\/em>, or &#8211; in other words &#8211; a <em>highlight table<\/em>. Extending the table by adding bar charts on the sides will allow you to analyze another aspect by comparing the accumulated values:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full is-resized\"><img decoding=\"async\" width=\"588\" height=\"786\" data-src=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e10.png\" alt=\"\" class=\"wp-image-2571 lazyload\" data-srcset=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e10.png 588w, https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e10-224x300.png 224w\" data-sizes=\"(max-width: 588px) 100vw, 588px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 588px; --smush-placeholder-aspect-ratio: 588\/786;\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>It is a very useful way to expand the tables which are the most popular among users,\u00a0 with additional visual aspects.<\/p>\n\n\n\n<p>Another approach is to analyze the distribution of two numeral variables at the same time. In the case of a large number of data points, it allows to view how they are distributed. In the below example, a <em>scatter plot<\/em> is the main chart, and it has been extended with <em>marginal histograms<\/em>:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><a href=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e11-1-768x612-1.png\"><img decoding=\"async\" width=\"768\" height=\"612\" data-src=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e11-1-768x612-1.png\" alt=\"\" class=\"wp-image-2561 lazyload\" data-srcset=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e11-1-768x612-1.png 768w, https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/e11-1-768x612-1-300x239.png 300w\" data-sizes=\"(max-width: 768px) 100vw, 768px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 768px; --smush-placeholder-aspect-ratio: 768\/612;\" \/><\/a><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Visualization of distribution is always a simplification<\/h2>\n\n\n\n<p><\/p>\n\n\n\n<p>The distribution of data provides the summary information such as what ranges of values it includes, what their variability is like, whether the distribution is symmetric, which values are the most common, and whether there are any outliers. Due to a large number of data points, such aggregative approach is usually the most suitable, since showing all observations may be overwhelming and unreadable to the users. The above methods don\u2019t cover all of the visualization techniques available, however, &nbsp;they are the most practical. They are also the easiest to prepare and understand. Apart from these tools, there are also such charts as a <em>beeswarm<\/em> \u2013 similar to a <em>jitter plot<\/em>, however here the data points are distributed evenly. A <em>violin chart<\/em> is another option which enables you to create a histogram by combining the <em>area chart<\/em> with the <em>box plot<\/em>. As you can see, there are various options available, but the best approach is to rely on simplicity and usability, while keeping in mind the needs of the end users of your visualizations.<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p>Mateusz Karmalski Tableau Author<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p>Tableau Author<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Visualizing the Distribution of Data. At the early stage of analysis, it is vital to know the data with which you are working. Are these numerical or descriptive values? Which variables are important? And how are these values distributed? The last element is key in understanding your data, as it helps find answers to the [&hellip;]<\/p>\n","protected":false},"author":5,"featured_media":2573,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"inline_featured_image":false,"_genesis_hide_title":false,"_genesis_hide_breadcrumbs":false,"_genesis_hide_singular_image":false,"_genesis_hide_footer_widgets":false,"_genesis_custom_body_class":"","_genesis_custom_post_class":"","_genesis_layout":"content-sidebar","footnotes":""},"categories":[9,5],"tags":[],"class_list":{"0":"post-2558","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-blog","8":"category-tableau","9":"entry"},"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.8 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Visualizing the Distribution of Data - NewDataLabs<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Visualizing the Distribution of Data - NewDataLabs\" \/>\n<meta property=\"og:description\" content=\"Visualizing the Distribution of Data. At the early stage of analysis, it is vital to know the data with which you are working. Are these numerical or descriptive values? Which variables are important? And how are these values distributed? The last element is key in understanding your data, as it helps find answers to the [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/\" \/>\n<meta property=\"og:site_name\" content=\"NewDataLabs\" \/>\n<meta property=\"article:published_time\" content=\"2023-11-28T14:56:22+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-12-14T10:58:19+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/blog.png\" \/>\n\t<meta property=\"og:image:width\" content=\"360\" \/>\n\t<meta property=\"og:image:height\" content=\"216\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Gosia\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Gosia\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/\"},\"author\":{\"name\":\"Gosia\",\"@id\":\"https:\/\/newdatalabs.com\/en\/#\/schema\/person\/e72471586cc4c4d811ff645bc4ae64f7\"},\"headline\":\"Visualizing the Distribution of Data\",\"datePublished\":\"2023-11-28T14:56:22+00:00\",\"dateModified\":\"2023-12-14T10:58:19+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/\"},\"wordCount\":1017,\"commentCount\":0,\"image\":{\"@id\":\"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/blog.png\",\"articleSection\":[\"Blog\",\"Tableau\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/\",\"url\":\"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/\",\"name\":\"Visualizing the Distribution of Data - NewDataLabs\",\"isPartOf\":{\"@id\":\"https:\/\/newdatalabs.com\/en\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/blog.png\",\"datePublished\":\"2023-11-28T14:56:22+00:00\",\"dateModified\":\"2023-12-14T10:58:19+00:00\",\"author\":{\"@id\":\"https:\/\/newdatalabs.com\/en\/#\/schema\/person\/e72471586cc4c4d811ff645bc4ae64f7\"},\"breadcrumb\":{\"@id\":\"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/#primaryimage\",\"url\":\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/blog.png\",\"contentUrl\":\"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/blog.png\",\"width\":360,\"height\":216},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Strona g\u0142\u00f3wna\",\"item\":\"https:\/\/newdatalabs.com\/en\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Visualizing the Distribution of Data\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/newdatalabs.com\/en\/#website\",\"url\":\"https:\/\/newdatalabs.com\/en\/\",\"name\":\"NewDataLabs\",\"description\":\"Tableau - Business Intelligence Tools\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/newdatalabs.com\/en\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/newdatalabs.com\/en\/#\/schema\/person\/e72471586cc4c4d811ff645bc4ae64f7\",\"name\":\"Gosia\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/newdatalabs.com\/en\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/445673971de6d34fc47aeb482636a72b51e216f3e84ffb2d993c8f129d0ffd79?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/445673971de6d34fc47aeb482636a72b51e216f3e84ffb2d993c8f129d0ffd79?s=96&d=mm&r=g\",\"caption\":\"Gosia\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Visualizing the Distribution of Data - NewDataLabs","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/","og_locale":"en_US","og_type":"article","og_title":"Visualizing the Distribution of Data - NewDataLabs","og_description":"Visualizing the Distribution of Data. At the early stage of analysis, it is vital to know the data with which you are working. Are these numerical or descriptive values? Which variables are important? And how are these values distributed? The last element is key in understanding your data, as it helps find answers to the [&hellip;]","og_url":"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/","og_site_name":"NewDataLabs","article_published_time":"2023-11-28T14:56:22+00:00","article_modified_time":"2023-12-14T10:58:19+00:00","og_image":[{"width":360,"height":216,"url":"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/blog.png","type":"image\/png"}],"author":"Gosia","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Gosia","Est. reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/#article","isPartOf":{"@id":"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/"},"author":{"name":"Gosia","@id":"https:\/\/newdatalabs.com\/en\/#\/schema\/person\/e72471586cc4c4d811ff645bc4ae64f7"},"headline":"Visualizing the Distribution of Data","datePublished":"2023-11-28T14:56:22+00:00","dateModified":"2023-12-14T10:58:19+00:00","mainEntityOfPage":{"@id":"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/"},"wordCount":1017,"commentCount":0,"image":{"@id":"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/#primaryimage"},"thumbnailUrl":"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/blog.png","articleSection":["Blog","Tableau"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/","url":"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/","name":"Visualizing the Distribution of Data - NewDataLabs","isPartOf":{"@id":"https:\/\/newdatalabs.com\/en\/#website"},"primaryImageOfPage":{"@id":"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/#primaryimage"},"image":{"@id":"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/#primaryimage"},"thumbnailUrl":"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/blog.png","datePublished":"2023-11-28T14:56:22+00:00","dateModified":"2023-12-14T10:58:19+00:00","author":{"@id":"https:\/\/newdatalabs.com\/en\/#\/schema\/person\/e72471586cc4c4d811ff645bc4ae64f7"},"breadcrumb":{"@id":"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/#primaryimage","url":"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/blog.png","contentUrl":"https:\/\/newdatalabs.com\/en\/wp-content\/uploads\/2023\/11\/blog.png","width":360,"height":216},{"@type":"BreadcrumbList","@id":"https:\/\/newdatalabs.com\/en\/visualizing-the-distribution-of-data\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Strona g\u0142\u00f3wna","item":"https:\/\/newdatalabs.com\/en\/"},{"@type":"ListItem","position":2,"name":"Visualizing the Distribution of Data"}]},{"@type":"WebSite","@id":"https:\/\/newdatalabs.com\/en\/#website","url":"https:\/\/newdatalabs.com\/en\/","name":"NewDataLabs","description":"Tableau - Business Intelligence Tools","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/newdatalabs.com\/en\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/newdatalabs.com\/en\/#\/schema\/person\/e72471586cc4c4d811ff645bc4ae64f7","name":"Gosia","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/newdatalabs.com\/en\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/445673971de6d34fc47aeb482636a72b51e216f3e84ffb2d993c8f129d0ffd79?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/445673971de6d34fc47aeb482636a72b51e216f3e84ffb2d993c8f129d0ffd79?s=96&d=mm&r=g","caption":"Gosia"}}]}},"_links":{"self":[{"href":"https:\/\/newdatalabs.com\/en\/wp-json\/wp\/v2\/posts\/2558","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/newdatalabs.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/newdatalabs.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/newdatalabs.com\/en\/wp-json\/wp\/v2\/users\/5"}],"replies":[{"embeddable":true,"href":"https:\/\/newdatalabs.com\/en\/wp-json\/wp\/v2\/comments?post=2558"}],"version-history":[{"count":4,"href":"https:\/\/newdatalabs.com\/en\/wp-json\/wp\/v2\/posts\/2558\/revisions"}],"predecessor-version":[{"id":2598,"href":"https:\/\/newdatalabs.com\/en\/wp-json\/wp\/v2\/posts\/2558\/revisions\/2598"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/newdatalabs.com\/en\/wp-json\/wp\/v2\/media\/2573"}],"wp:attachment":[{"href":"https:\/\/newdatalabs.com\/en\/wp-json\/wp\/v2\/media?parent=2558"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/newdatalabs.com\/en\/wp-json\/wp\/v2\/categories?post=2558"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/newdatalabs.com\/en\/wp-json\/wp\/v2\/tags?post=2558"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}