Undersample
To address the imbalance in data labels, reduce majority classes to minority classes.
To address the imbalance in data labels, reduce majority classes to minority classes.
Tip
You can only use it if you have a sufficient amount of data. Undersampling small data sets can cause you to lose useful data.
When to use this tool
Use to address classification problems in your data.
Configuration
Use the following configuration options to help configure the Undersample tool.
In Varicent ELT, go to the Pipes module.
On the Pipes tab, find the pipe you want to work with. Click the pipe to open.
In your Pipe builder add your data source.
On the canvas toolbar, click
+ Tool.
In the Tools modal search bar, type Undersample.
Tip
You can also find the Undersample tool in the Data section.
Click + Add Tool.
Connect the tool to your data set.
In the configuration pane, enter the following information:
Table 61. Undersample tool configurationField
Description
Target column
Select the target column to use as an undersample of data.