Main Content

Percentile Plot

In theSimBiology Model Analyzerapp, you can visualize time course data and its corresponding statistics using a percentile plot. The plot shows curves of summary statistics (percentiles or mean and standard deviation) over time. You can also view the raw data along with summary statistics.

You can choose between two methods to aggregate time-varying data and compute summary statistics. For details, seeInterpolation MethodandTime Point Binning Method

To show a percentile plot, select a data source that contains time courses in theBrowserpane, then clickpercentilein thePlotsection on theHometab. For a workflow example, seeVisualize Simulation Statistics and Overlay Experimental Data Using Percentile Plot

Display Options

Each response in a percentile plot has three display options, which you can configure.

  • Percentiles— Shows the percentile curves. By default, the plot shows 5th and 95th percentiles. Scan programs with more than 40 samples use percentile plots as default plots. You can change this default cutoff inPreferences>Programs>Plots.This is the default display type for simulation data.

  • Mean— Shows the mean and standard deviation of response data at each time point. This is the default display type for experimental data.

  • Raw Data— Shows the original response data points at each time point.

Responses section with menu showing the three display

Percentiles Options

ThePercentilessection provides the following options to configure the percentile curves.

Option Description
Show percentiles (%) Percentiles to plot. Enter one or more nonnegative numbers between 0 and 100 or any MATLAB®expression that results in a nonnegative number or vector of values between 0 and 100 in ascending order.
Show median 逻辑flag to show or hide the median line in the plot.
显示风格 Display format to show lines, shading, or both.

Mean Options

TheMeansection provides the following options to configure the plots.

Option Description
Show mean 逻辑flag to show the mean response value at each time point. The plot uses the markeroto represent mean values.
Show standard deviation 逻辑flag to show ±1 standard deviation of the response value at each time point. The plot shows error bars to indicate the standard deviations.
Show min/max

逻辑flag to show the derived minimum and maximum response values at each time point.

To calculate these values, the app first interpolates all the time courses to a common time vector and then calculates the statistics, such as min, max, mean, and standard deviation, at each time point on the common time vector across all interpolated time courses. Hence the interpolated maximum and minimum values at a given time point shown in the percentile plot may not match those values of the raw data exactly.

The plot uses the marker*to indicate the minimum and maximum values.

显示风格 Display format to show lines, markers, or both.

Data Options

每个数据源的百分位的阴谋,你can select the data aggregation method and related options in the corresponding data section of theProperty Editorpane. The app provides two data aggregation methods:interpolationandbinning.推荐使用interpolationfor densely recorded data, such as simulated model responses. Use time pointbinningfor sparsely recorded data, such as experimental data.

If you are plotting multiple responses or slicing the data by a covariate or parameter, the app performs data aggregation independently for each response or data slice. Different responses can have different automatic interpolation or binning results.

Each data aggregation method has its own set of options. You can change these options for each data source independently from other data sources.

Interpolation

If you selectinterpolationas the data aggregation method, the app interpolates all time courses onto a common time vector. The summary statistics are calculated on the interpolated data at every time point in the common time vector. For details, seeInterpolation Method.You can also specify a custom common time vector and the interpolation method used for data aggregation. The next table summarizes the available options.

Option Description
Time vector

Common time vector onto which all time courses are interpolated before calculating the summary statistics.

The default optionauto选择使用一个共同的时间向量的向量of equidistant points between the minimum and maximum time points specified in the data.

Alternatively, specify the time vector as a sequence of numbers or MATLAB expression that evaluates to a vector of strictly increasing numbers greater than or equal to zero.

Interpolation method

Method used to interpolate time courses onto a common time vector. The app callsinterp1with the specifiedmethod(default islinear). For simulation data, the app treats multiple response values at the same time point as a discontinuity and performs piecewise interpolation between such time points. For experimental data, the app treats these response values at the same time point as repeated measurements and uses the mean of all measurements at the same time point.

Show raw data fraction (%) Percentage of raw (original) time courses or data shown in the plot. Enter a nonnegative integer between 0 and 100.

For details, seeInterpolation Method

Binning

If you selectbinningas the data aggregation method, the app clusters the data points into different bins based on their time values. It calculates summary statistics for data within each bin and plots the statistics at the centroid of each bin, which is the mean of time values of data within that bin. For details, seeTime Point Binning Method.The next table summarizes the available options.

Option Description
Binning method
  • auto— The app scans over several possible numbers of bins and calculates thekmeans(Statistics and Machine Learning Toolbox)clustering solution for each of these values. It then determines the optimal numbers of bins by selecting the solution that minimizes theDavies-Bouldin Criterion(Statistics and Machine Learning Toolbox)

  • specify number of bins— As an alternative toauto, you can specify a custom number of bins to use for the k-means algorithm.

  • specify bin edges— Specify the exact edges or boundaries for each bin to avoid thekmeansclustering.

Show bin edges 逻辑flag to display vertical lines that indicate the bin boundaries.
Show raw data fraction (%) Percentage of raw (original) time courses or data shown in the plot. Enter an integer between 0 and 100.

For details, seeTime Point Binning Method

Interpolation Method

When you are usinginterpolationas the data aggregation method, the app calculates summary statistics using the following steps.

  1. The app calculates a common time vector as a vector of equidistant time points between the minimum and maximum time points in the data across all the groups (or runs) in each data slice or obtains the time vector by using the code specified in theTime vectoroption.

  2. It then interpolates the response time course for each group or run onto the common time vector usinginterp1with the method specified in theInterpolation methodoption. For simulation data, the app treats multiple response values at the same time point as a discontinuity and performs piecewise interpolation between such time points. For experimental data, the app treats data at the same time point as repeated measurements and uses the mean of all measurements at those time points.

  3. The app then calculates the corresponding statistics, such as percentiles, mean, max, standard deviation, for each time point in the common time vector across all groups for that time point in the interpolated time courses.

    Note

    Because of interpolation, calculated maximum and minimum values might be different than those values from the original data.

  4. It then generates a plot using the calculated statistics against the common time vector according to the显示风格option.

Time Point Binning Method

When you are usingbinningas the data aggregation method, the app calculates summary statistics using the following steps.

  1. The app partitions the data intonbins using only the time values for each data point. It does not consider any similarities in measurement values. By default, the data is binned using thekmeans(Statistics and Machine Learning Toolbox)algorithm, and you can also specify a custom number of bins or specific bin edges (or boundaries).

  2. The app calculates summary statistics for each bin.

  3. It obtains the common time vector by calculating the mean time value for each bin.

  4. The app then generates a plot using the computed statistics from step 2 against the mean time value for each bin from step 3 according to the显示风格option.

See Also

Related Topics