Skip to main content

Varicent ELT Help Center

Undersample

Use the Undersample tool to address classification problems in your data or the imbalance in data labels, and reduce the majority classes to the minority classes.

Tip

You can only use it if you have a sufficient amount of data. Undersampling small data sets can cause you to lose useful data.

Configuration

Use the following configuration options to help configure the Undersample tool.

Configuring the Undersample tool
  1. Go to the Pipes module from the side navigation bar.

  2. From the Pipes tab, click an existing pipe to open, or create a new pipe. To create a new pipe, read the Creating a pipe documentation.

  3. In your Pipe builder add your data source.

  4. On the canvas toolbar, click symon_add_icon.png + Tool.

  5. In the Tools modal search bar, type Undersample.

    Tip

    You can also find the Undersample tool in the Data section.

  6. Click + Add Tool.

  7. Connect the tool to your data set.

  8. In the configuration pane, enter the following information:

    Table 77. Undersample tool configuration

    Field

    Description

    Target column

    Select the target column to use as an undersample of data.