SGN Documentation

View the Project on GitHub solgenomics/sgn

Home   3. Managing Breeding Data  

3.22 Managing Outliers in Dataset


3.22.1 What is Outliers Functionality in Dataset ?

As in step 2.1.2 -> Saving the wizard selections we can create a dataset.

The dataset incorporates a feature to identify outlier points, which we may choose to exclude from a specific dataset. It’s important to note that these exclusions only apply at the dataset level, and no data is permanently removed from the database. Additionally, outlier categorization can be modified at any time, and these changes are visible to all other functionalities within the system.

Each dataset stores a wholly unique set of outlier points, completely independent of any other dataset in the database. Outliers are specifically designated for traits within datasets, exclusively encompassing phenotype data. If a particular dataset lacks traits as a part of wizard selection, this functionality is not available.

Each trait has its own set of defined outliers.

3.22.2 Accessing Trait Visualisation

Once you’ve selected a specific trait, the web application provides access to a visualisation of the data points associated with that trait.

3.22.3 Interpreting Visual Elements

Once you’ve selected a specific trait, the web application provides access to a visualisation of the data points associated with that trait.

3.22.4 Choosing Cut-Off Values

You have two fundamental options for setting cut-off points:

3.22.5 Setting Deviation Multiplier

The slider allows you to specify the deviation multiplier from a central point, which influences the cut-off values.

3.22.6 Utilising Graph Controls

Beneath the graph, you’ll find four buttons, each serving a distinct function:

These tools and functions are designed to provide you with control and insights when working with data visualisation and outliers..