Outlier filter
From Piki
The outlier remover filter eliminates outliers from data.
| Oultier remover | |
| Name | Oultier remover |
| Deployable | No |
| Static | Yes |
| Sample modifier | Yes |
| Feature modifier | No |
Contents |
Usage
The outlier remover is used to purge the data of samples that are outside the range of normal values. This filter can only be used with static systems and should not be used with time series data.
Operation
The outlier remover has two basic operational modes: statistical and histogram modes. The statistical (or sigma) mode removes samples that fall outside of a normal distribution. This works very well when the data roughly follows a bell shaped distribution.
The histogram mode removes samples that occur less than a specified frequency in a histogram of the features of the data unit are removed. This will work with any distribution but is sensitive to the histogram settings.
The two modes can be combined.
Settings
The settings can be modified using the settings browser.
| Outlier remover settings |
|---|
|
|
General advice
- Be careful not to remove too much with the outlier remover. It is advisable to keep the validation set intact.
See also
- Filter - Article covering general filter principles.
- List of Filter components - List of all available filters.
- Data unit - Article explaining data units in detail.
- Data - Article explaining essential data concepts.


