Oversample
Generate more data or address classification problems in your data.
To address the imbalance in data, supplement the data with multiple copies of some of the minority classes.
A new column is added to indicate that Oversample generated the record.
When to use this tool
Use the Oversample tool to generate more data or to address classification problems in your data.
Configuration
Use the following configuration options to help configure the Oversample tool.
In Varicent ELT, go to the Pipes module.
On the Pipes tab, find the pipe you want to work with. Click the pipe to open.
In your Pipe builder add your data source.
On the canvas toolbar, click
+ Tool.
In the Tools modal search bar, type Oversample.
Tip
You can also find the Oversample tool in the Data section.
Click + Add Tool.
Connect the tool to your data set.
In the configuration pane, enter the following information:
Table 59. Oversample tool configurationField
Description
Target column
Select the column to use for the target column.
Oversample column
Enter the name to use for the oversample column.