import
 
adding title
 
(3 intermediate revisions by the same user not shown)
Line 1: Line 1:
This is a documentation for [[Siril]]'s statistics, given by the graphical user interface from the contextual menu of images (right-clicking in it) then selecting 'Statistics...', from the <tt>Image Information</tt> submenu of the application menu after also selecting 'Statistics...' or using the [[Siril:Commands#stats|stat command]].
=Siril's statistics=


'''Mean:''' this is the arithmetic mean, also known as average or arithmetic average. This is computed by doing the sum of the values divided by the number of values.
This is a documentation for [[Siril]]'s statistics, given by the graphical user interface (GUI) from the contextual menu of images (right-clicking in it) then selecting <tt>Statistics...</tt>, from the <tt>Image Information</tt> submenu of the application menu after also selecting <tt>Statistics...</tt> or using the [[Siril:Commands#stat|stat command]]. Note that when using the GUI, it is possible to draw a selection in the loaded image and that when doing so, the statistics are computed on the pixels of region. Many of these values are measures of [https://en.wikipedia.org/wiki/Statistical_dispersion statistical dispersion].


'''Median:''' this is the value separating the higher half from the lower half of a data sample. Generally, it represents the value of the background of an astronomical image.
'''Mean:''' this is the [https://en.wikipedia.org/wiki/Arithmetic_mean arithmetic mean], also known as average or arithmetic average. This is computed by doing the sum of the pixel values divided by the number of pixels in an image channel.


'''Sigma:''' this is a measure of the amount of dispersion of the image pixels. The sigma value of a sub image containing only the background will represent the noise of the image.
'''Median:''' the [https://en.wikipedia.org/wiki/Median median] is the value separating the higher half from the lower half of a dataset. Generally, it represents the value of the background of an astronomical image.


'''Background noise:''' this is a measure of estimated noise in image background level, for pixels having a value low enough to be considered as background. It is an iterative process based on k.sigma, so there is no fixed threshold for the ''low enough''.
'''Sigma:''' also known as the [https://en.wikipedia.org/wiki/Standard_deviation standard deviation], noted σ, this is a measure of dispersion of the image pixels based on squared differences from the average. The sigma value of a sub image containing only the background will represent the noise of the image.


'''Average Deviation:''' In order to understand what the average deviation is, one needs to understand what the term absolute deviation is. Absolute deviation is the distance between each value in the dataset and that dataset's mean or median. Taking all of these absolute deviations, finding the average, and the mean average deviation is computed.
'''Background noise:''' ''(available by the GUI from the <tt>Image Information</tt> submenu of the application menu after selecting <tt>Noise estimation</tt>, also displayed at the end of stacking)'' this is a measure of estimated noise in image background level, for pixels having a value low enough to be considered as background. It is an iterative process based on k.sigma (a factor of the standard deviation above the median), so there is no fixed threshold for the ''low enough''.


'''Median Absolute Deviation (MAD):''' this is a robust measure of how spread out a set of data is. The absolute deviation and standard deviation are also measures of dispersion, but they are more affected by extremely high or extremely low values.
'''avgDev:''' the Average Deviation, also called AAD for [https://en.wikipedia.org/wiki/Average_absolute_deviation average absolute deviation] or mean absolute deviation. In order to understand what the average deviation is, one needs to understand what the term absolute deviation is. Absolute deviation is the distance between each value in the dataset and that dataset's mean (in this instance) or median (for MAD below). Taking all of these absolute deviations, finding the average, and the mean average deviation is computed. To simplify, if standard deviation is the squared deviation from the mean, this is the linear version of it.


'''Bi-Weight Mid-Variance (BWMV):''' This is yet another tool to measure dispersion of a dataset, even more robust than others cited above to outliers. It discards the data points too far way from the median and computes a weighted variance, weights decreasing as the data points are further way from the median. The estimator of dispersion is the square root of this value.
'''MAD:''' the [https://en.wikipedia.org/wiki/Median_absolute_deviation Median Absolute Deviation] is a robust measure of how spread out a set of data is. The absolute deviation and standard deviation are also measures of dispersion, but they are more affected by extremely high or extremely low values. It is similar to the average deviation above, but relative to the median instead of the mean.


'''Location and Scale:''' In order to align the histograms of the different frames for normalization before stacking, one needs to compute ''where'' they are in terms of level and ''how wide'' they are in terms of spread. A valid estimator of location could be taken as the median while the MAD or the <tt>sqrt(BWMV)</tt> (sqrt means square root) could be used for scale. However, in order to give more robustness to the measures, the pixels more than 6 x MAD away from the median are discarded. On this clipped dataset, the median and <tt>sqrt(BWMV)</tt> are re-computed and used as location and scale estimators respectively.
'''BWMV:''' The [https://en.wikipedia.org/wiki/Robust_measures_of_scale#The_biweight_midvariance biweight midvariance] is yet another tool to measure dispersion of a dataset, even more robust than others cited above to outliers. It discards the data points too far way from the median and computes a weighted variance, weights decreasing as the data points are further way from the median. The estimator of dispersion is the square root (marked as <tt>sqrt(BWMV)</tt>) of this value.
 
'''Location and Scale:''' [https://en.wikipedia.org/wiki/Location%E2%80%93scale_family these parameters], often colloquially called ''scale and offset'', are not displayed in the user interfaces but are computed internally by Siril. In order to align the histograms of the different images for normalization before stacking, one needs to compute ''where'' they are in terms of level and ''how wide'' they are in terms of spread. A valid estimator of location could be taken as the median while the MAD or the <tt>sqrt(BWMV)</tt> could be used for scale. However, in order to give more robustness to the measures, the pixels more than 6 x MAD away from the median are discarded. On this clipped dataset, the median and <tt>sqrt(BWMV)</tt> are re-computed and used as location and scale estimators respectively. They are computed relative to the ''reference image'' of a sequence in Siril.

Latest revision as of 18:53, 10 May 2021

Siril's statistics

This is a documentation for Siril's statistics, given by the graphical user interface (GUI) from the contextual menu of images (right-clicking in it) then selecting Statistics..., from the Image Information submenu of the application menu after also selecting Statistics... or using the stat command. Note that when using the GUI, it is possible to draw a selection in the loaded image and that when doing so, the statistics are computed on the pixels of region. Many of these values are measures of statistical dispersion.

Mean: this is the arithmetic mean, also known as average or arithmetic average. This is computed by doing the sum of the pixel values divided by the number of pixels in an image channel.

Median: the median is the value separating the higher half from the lower half of a dataset. Generally, it represents the value of the background of an astronomical image.

Sigma: also known as the standard deviation, noted σ, this is a measure of dispersion of the image pixels based on squared differences from the average. The sigma value of a sub image containing only the background will represent the noise of the image.

Background noise: (available by the GUI from the Image Information submenu of the application menu after selecting Noise estimation, also displayed at the end of stacking) this is a measure of estimated noise in image background level, for pixels having a value low enough to be considered as background. It is an iterative process based on k.sigma (a factor of the standard deviation above the median), so there is no fixed threshold for the low enough.

avgDev: the Average Deviation, also called AAD for average absolute deviation or mean absolute deviation. In order to understand what the average deviation is, one needs to understand what the term absolute deviation is. Absolute deviation is the distance between each value in the dataset and that dataset's mean (in this instance) or median (for MAD below). Taking all of these absolute deviations, finding the average, and the mean average deviation is computed. To simplify, if standard deviation is the squared deviation from the mean, this is the linear version of it.

MAD: the Median Absolute Deviation is a robust measure of how spread out a set of data is. The absolute deviation and standard deviation are also measures of dispersion, but they are more affected by extremely high or extremely low values. It is similar to the average deviation above, but relative to the median instead of the mean.

BWMV: The biweight midvariance is yet another tool to measure dispersion of a dataset, even more robust than others cited above to outliers. It discards the data points too far way from the median and computes a weighted variance, weights decreasing as the data points are further way from the median. The estimator of dispersion is the square root (marked as sqrt(BWMV)) of this value.

Location and Scale: these parameters, often colloquially called scale and offset, are not displayed in the user interfaces but are computed internally by Siril. In order to align the histograms of the different images for normalization before stacking, one needs to compute where they are in terms of level and how wide they are in terms of spread. A valid estimator of location could be taken as the median while the MAD or the sqrt(BWMV) could be used for scale. However, in order to give more robustness to the measures, the pixels more than 6 x MAD away from the median are discarded. On this clipped dataset, the median and sqrt(BWMV) are re-computed and used as location and scale estimators respectively. They are computed relative to the reference image of a sequence in Siril.