Split
Randomly divide a data set into 2 smaller sets determined by the percentage you specify. Use the Split tool when you want to work with smaller subsets of your data.
Input and output
To use this tool, you need an existing data set.
When you run the tool, it splits the data, giving you two output nodes to further transform your data.
Configuration
Use the following configuration options to configure the Split tool.
Go to the Pipes module from the side navigation bar.
From the Pipes tab, click an existing pipe to open, or create a new pipe. To create a new pipe, read the Creating a pipe documentation.
In the Pipe builder, add a data source to your pipe. For more information on adding a data source, see the Data Input tool.
Click
+Tool.The Tools modal opens, where you can add tools, such as the Aggregate tool, to your pipe.
In the Tools modal, search for Split. Click
+ Add tool.Tip
You can also find the Split tool in the Organize section.
Click the tool node and drag the line to the next tool to connect the tools. If you need to undo the action, click the line and then click Unlink.
In the configuration pane, enter the Top output, or click the arrows to increase or decrease the number to divide your data into 2 smaller data sets. The bottom output is split according to the top number. For example, if you enter 80% for the top output, then the bottom output is automatically set to 20%.