## 统计代写|数据可视化代写Data visualization代考|Gestalt Principles

Gestalt principles refer to the guiding principles of how people interpret and perceive what they see. These principles can be used in the design of effective data visualizations. The principles generally describe how people define order and meaning in things that they see. We will limit our discussion to the four Gestalt principles that are most closely related to the design of data visualizations: similarity, proximity, enclosure, and connection. An understanding of these principles can help in creating more effective data visualizations and help differentiate between clutter and meaningful design in data visualizations.

The Gestalt principle of similarity states that people consider objects with similar characteristics as belonging to the same group. These characteristics could be color, shape, size, orientation, or any preattentive attribute. When a data visualization includes objects with similar characteristics, it is important to understand that this communicates to the audience that these objects should be seen as belonging to the same group. Figure $3.16$ is a portion of what was shown in Figure 3.10, but here we are using it to represent the Gestalt principle of similarity. The audience will perceive objects that are the same color, or same shape, as belonging to the same group. We need to understand this when we design a visualization and make sure that we only use similar characteristics for objects when they belong to the same group.

## 统计代写|数据可视化代写Data visualization代考|Proximity

The Gestalt principle of proximity states that people consider objects that are physically close to one another as belonging to a group. People will generally seek to collect objects that are near each other into a group and separate objects that are far from one another into different groups. The principle of proximity is apparent in many data visualization charts, including scatter charts.

Consider a firm that would like to perform a market segmentation analysis of its customers to learn more about the customers who purchase its products. The company has collected data on the ages and annual incomes of its customers. A simple scatter chart of the age and income of customers is shown in Figure 3.17. Here, our natural inclination is to view this as three distinct groups of customers based on the proximity of the points. This is an example of the Gestalt principle of proximity.

The Gestalt principle of enclosure states that objects that are physically enclosed together are seen as belonging to the same group. We can illustrate this principle using two modified versions of Figure 3.17. First, we can simply reinforce the similarity principle by creating an enclosure of the points that are already in close proximity (see Figure 3.18a). Alternatively, suppose that there is a third attribute of the customers, other than annual income and age, which can be used to group these customers such as educational background. If we want to visually indicate certain customers that share this characteristic of having similar educational backgrounds, then we can use the principle of enclosure to illustrate this even when customers do not appear close together in the chart. This is shown in Figure $3.18 \mathrm{~b}$. Note that the enclosure can be indicated in multiple ways in a chart. In Figure $3.18$ a we have used shaded areas to enclose points. In Figure $3.18 \mathrm{~b}$ we have used dashed boxes. In general, we only need to create a suggestion of enclosure for the audience to view the objects being enclosed as members of the same group.

## 统计代写|数据可视化代写Data visualization代考|Data-Ink Ratio

The concepts of preattentive attributes and Gestalt principles are valuable in understanding features that can be used to visualize data and how visualizations are processed by the mind. However, it is easy to overuse any of the features and diminish the effectiveness of the feature to differentiate and draw attention. A guiding principle for effective data visualizations is that the table or graph should illustrate the data to help the audience generate insights and understanding. The table or graph should not be so cluttered as to disguise the data or be difficult to interpret.

A common way of thinking about this principle is the idea of maximizing the data-ink ratio. The data-ink ratio measures the proportion of “data-ink” to the total amount of ink used in a table or chart, where data-ink is the ink used that is necessary to convey the meaning of the data to the audience. Non-data-ink is ink used in a table or chart that serves no useful purpose in conveying the data to the audience. Note in Figure 3.11a that the pie chart uses color and a legend to differentiate between the eight managers. The bar chart in this figure communicates the same information without either of these features, and so has a higher data-ink ratio.

Let us consider the case of Diaphanous Industries, a firm that produces fine silk clothing products. Diaphanous is interested in tracking the sales of one of its most popular items, a particular style of scarf. Table $3.1$ and Figure $3.20$ provide examples of a table and chart with low data-ink ratios used to display sales of this style of scarf. The data used in this table and figure represent product sales by day. Both of these examples are similar to tables and charts generated with Excel using common default settings. In Table 3.1, most of the gridlines serve no useful purpose. Likewise, in Figure 3.20, the gridlines in the chart add little additional information. In both cases, most of these lines can be deleted without reducing the information conveyed. However, an important piece of intormation is missing from rigure $3.20:$ titles tor axes. Gienerally, axes should always be labeled in a chart. There are rare exceptions to this where both the meaning and unit of measure are obvious such as when the axis displays the names of months (i.e., “January,” “February,” “March,” etc.). For most charts, we recommend labeling the axes to avoid the possibility of misinterpretation by the audience and to reduce the cognitive load required by the audience.

