Skip to main content

Symon.AI help center

Predictor

Train the model to predict a column with a fixed set of values.

This tool considers all classification modelling tools and regression modelling tools. Symon.AI uses the tool with the best score to make a prediction.

Note

The speed and quality slider is more of a spectrum. This setting determines the number of machine learning models considered. Models that do not support the training data are automatically excluded.

This tool adds two columns to your data: a prediction and a probability. The probability is the likelihood that the prediction is accurate.

When you run the tool, the data is automatically split: 80% of the data is used for training. The remaining 20% is used for testing. Each model being considered is trained and evaluated to select the one with the best score. This is done 5 times to predict the test values (the 20% of your data). The final score is the average of all 5 scores.

Tip

You can configure this tool without using the configuration menu.

In the Add tool menu, start typing the first few letters of the tool name and press tab to auto-complete. Then start typing the name of the column you want to use and press tab to auto-complete.

Explain Waterfall visuals

From the Run page, you can access the Explain link to open the Waterfall visual. The run results are available when you select one of the Classifier, Predictor or Regressor tools.

Accessing Explain Waterfall visuals from the Run tab
  1. In your selected pipe, go to the Run page. Ensure that you have previously built and run the pipe.

  2. Click View from the side panel.

  3. In the data, the Explain link appears. Click to open the Waterfall visual.

Accessing Explain Waterfall visuals from the Build tab
  1. In your selected pipe go to the Build tab and select the classifier node in a built pipe.

  2. Open the row viewer.

  3. Click on the Explain link.

  4. Click to open the Waterfall visual.

Note

If you have an explainable tool upstream, you can still get an error message with one of the following issues:

  • The schema has changed in the export. For example, a missing column or an extra column is present.

  • There are multiple explainable tools in the pipe upstream.

  • The pipe changed and the calculation is now invalid.

If there is no Explain link, there is no explainable tool selected in the pipe upstream.

When to use this tool

Use this when you want to make a prediction, but you're unfamiliar with classification and regression modelling tools. After you're comfortable with the difference between those tools, switch to using the Regressor or Classifier tool.

What is Smart exclude?

Following a successful build using the Predictor tool, Smart exclude identifies and automatically excludes columns that don’t help predict the target column. Smart exclude will only consider columns not already manually excluded. If you want to disable this setting to troubleshoot, test, or run a calculation that is taking too long, go to the Advanced settings under the Configure tab.