Spinner logo OtasAI

OtasAI

Did You Know?


Data Visualization - Density Contour Chart

Introduction

The Density Contour Chart is a powerful visualization tool for representing the density of data points in a two-dimensional space. It is particularly useful for understanding the distribution and concentration of data within a dataset. OtasML, a visual machine learning tool, includes a Density Contour Chart feature within its data preparation model. This article outlines how to configure the Density Contour Chart feature for effective data visualization.

Configurations

The Density Contour Chart tool in OtasML offers various configuration options, allowing users to customize the visualization to meet their specific needs. Below are the key configurations and options available:

Subset

  • Default Value: None
  • Description: This option allows users to select specific columns for visualizing their data. By specifying the subset of columns, users can focus on the data values of interest, ensuring the density contour chart provides meaningful insights.

Height

  • Default Value: None
  • Description: This option allows users to provide a specific height value for the chart. Only integer values are allowed, and this helps maintain the aspect ratio of the chart, ensuring the visualization remains clear and proportional.

Color

  • Default Value: None
  • Description: This option allows users to specify the color of different elements in the chart. By customizing the colors, users can enhance the visual appearance of the chart, making it easier to distinguish between different data points.

Histfunc

  • Default Value: Count
  • Description: Specifies how the data should be aggregated and represented within each cell of the histogram-like grid that forms the basis of the density contour plot. Users can choose from several functions:
    • Count: Each cell represents the count of data points within that cell, indicating data density.
    • Sum: Each cell represents the sum of a specified numerical variable within that cell.
    • Avg: Each cell represents the average (mean) of a specified numerical variable within that cell.
    • Min: Each cell represents the minimum value of a specified numerical variable within that cell.
    • Max: Each cell represents the maximum value of a specified numerical variable within that cell.

Marginal X

  • Default Value: None
  • Description: Refers to additional one-dimensional density plots that show the distribution of data along the X-axis. These plots are displayed on the sides of the main two-dimensional density plot, providing insights into the distribution of the X variable.

Marginal Y

  • Default Value: None
  • Description: Refers to additional one-dimensional density plots that show the distribution of data along the Y-axis. These plots are displayed on the sides of the main two-dimensional density plot, providing insights into the distribution of the Y variable.

Histnorm

  • Default Value: None
  • Description: Allows users to control how the data is normalized or scaled when creating the contour plot based on a histogram-like representation. Users can choose from several normalization options:
    • Percent: Data is normalized to show the percentage of data points within each cell.
    • Probability: Data is normalized so that the integral over the entire range is 1, turning the histogram into a probability density function.
    • Density: Data is normalized to create a density estimate, where the integral over the entire range is the total number of data points.

X

  • Default Value: None
  • Description: The x-axis represents the values of the dataset. Users can define which values will be represented along the x-axis.

Y

  • Default Value: None
  • Description: The y-axis represents the cumulative proportion or count of data points that are less than or equal to each value on the x-axis. Users can define how the cumulative distribution will be represented.

Z

  • Default Value: None
  • Description: The Z values represent the estimated density of data points for each (x, y) coordinate pair. Higher Z values correspond to areas with higher data point density, while lower Z values correspond to areas with lower density.

Interactive Button: Preview

To enhance user experience and provide greater control over the Density Contour Chart visualization, the tool includes a Preview button:

  • Preview: This button allows users to see the effects of their configuration in real-time without permanently applying the changes. By clicking Preview, users can visually assess how the Density Contour Chart will appear based on the current configurations, ensuring that the visualization is appropriate before committing to any changes.

Conclusion

The Density Contour Chart tool in OtasML provides a comprehensive solution for visualizing data density in a two-dimensional space. By allowing users to select specific columns, customize colors, and define aggregation and normalization options, the tool offers flexibility and control over the data visualization process. The inclusion of an interactive Preview button further enhances the user experience, ensuring confidence in the Density Contour Chart configuration. OtasML continues to empower users with intuitive and effective tools, making data visualization a seamless and integral part of the machine learning workflow.

Tools

A+ A-

Version

1.1