Data Preparation

Collect, clean, and organize data for analysis. Apply operations and expressions to generate customized data.

Perform Data Preparation

Follow these steps:

  1. Configure a Data Source (for example, S3).

  2. Once the Data Source is configured and saved, you can view its design time data on the schema columns that are reflected in the Data Preparation window.

This allows you to build the application while interacting with the data.

  1. To access Data Preparation, click on the eye icon of any component. The component’s schema will be displayed in the Data Preparation window.

By default, the data appears in Summary View with the Profile Pane visible.

Data_Preparation


Actions Available in Data Preparation

Sorting

  • Sort: Sort the column entries either by count (ascending or descending) or by domain (alphabetically).

Transformations

  • Operations: Quickly apply the most popular transformations to column values.

    Operations that can be performed:

    • Filter

    • Transform

    • Missing Value Replacement

    • Analyze

    • Pivot

    • Group By

    • Rename Column

    • Create New Column

    • Remove Column

    • Enable PII Masking (Available on Data Sources)

Views & Data Interaction

  • Profile Pane: Displays the distribution of data in each column.

  • Data Pane: Represents the schema as columns divided into records.

Both views support operations on columns and allow editing column alias.

Create New Columns

Add a new column by specifying a name, expression, and values.

Keep/Remove Column

Select the columns to either keep or remove for further processing.

Display & Reload Options

  • Display Columns:

    • Choose which columns to display while working on transformations.

    • The columns will remain in the dataset schema but will not be shown in the data preparation window.

  • Reload Data for Preview: Reloads the data for the selected component for preview.

Export & Close

  • Download Result: Download the schema result for a specific component.

  • Close: Exit the transformation preview page and return to the canvas.


Visualize Binary Data

For the components that support reading binary data type, you can visualize the data.

On the data preparation page, for the column containing binary data, click the icon as shown in the image below:

visualize-binary-data

A new modal Window will open where you can visualize the binary data.

visualize-binary-data

Top