Feature Selection Options

 

FS Options

 

Information-based measure

Calculates the strength of the relationship between each input field and the target (last) field. The strength of the relationship is measured using information theory. This measure is only applicable when the target field is categorical or discrete numeric.

 

Correlation coefficients

Calculates the strength of the relationship between each input field and the target (last) field. The strength of the relationship is measured using correlation coefficients. This measure is only applicable when the target field is numeric.

 

Display feature selection results

A report dialog can be displayed to show the scores for each field.

 

Select best fields for output

Selects the best fields for use in a later part of the current stream.

 

Number of fields to select

Enter the number of best-scoring fields that will be selected. Alternatively, click on the spin control to increase or decrease the current value. The default value is 1.

 

Continuous field step size

The information gain measure can be slow for continuous numeric fields with a large number of records. Increasing the step size will increase the speed of the algorithm but may reduce the accuracy of the results.

You can click on the spin control to increase or decrease the current step size for the information-based method. The default value is 1. We recommend that you leave the setting at 1 unless the process is very slow for some fields, in which case increase the value until you obtain an acceptable rate of progress.