links are more accurate descriptions |
location and scale update |
||
Line 15: | Line 15: | ||
'''BWMV:''' The [https://en.wikipedia.org/wiki/Robust_measures_of_scale#The_biweight_midvariance biweight midvariance] is yet another tool to measure dispersion of a dataset, even more robust than others cited above to outliers. It discards the data points too far way from the median and computes a weighted variance, weights decreasing as the data points are further way from the median. The estimator of dispersion is the square root (marked as <tt>sqrt(BWMV)</tt>) of this value. | '''BWMV:''' The [https://en.wikipedia.org/wiki/Robust_measures_of_scale#The_biweight_midvariance biweight midvariance] is yet another tool to measure dispersion of a dataset, even more robust than others cited above to outliers. It discards the data points too far way from the median and computes a weighted variance, weights decreasing as the data points are further way from the median. The estimator of dispersion is the square root (marked as <tt>sqrt(BWMV)</tt>) of this value. | ||
'''Location and Scale:''' In order to align the histograms of the different | '''Location and Scale:''' [https://en.wikipedia.org/wiki/Location%E2%80%93scale_family these parameters], often colloquially called ''scale and offset'', are not displayed in the user interfaces but are computed internally by Siril. In order to align the histograms of the different images for normalization before stacking, one needs to compute ''where'' they are in terms of level and ''how wide'' they are in terms of spread. A valid estimator of location could be taken as the median while the MAD or the <tt>sqrt(BWMV)</tt> could be used for scale. However, in order to give more robustness to the measures, the pixels more than 6 x MAD away from the median are discarded. On this clipped dataset, the median and <tt>sqrt(BWMV)</tt> are re-computed and used as location and scale estimators respectively. They are computed relative to the ''reference image'' of a sequence in Siril. |
Revision as of 22:07, 9 May 2021
This is a documentation for Siril's statistics, given by the graphical user interface (GUI) from the contextual menu of images (right-clicking in it) then selecting Statistics..., from the Image Information submenu of the application menu after also selecting Statistics... or using the stat command. Note that when using the GUI, it is possible to draw a selection in the loaded image and that when doing so, the statistics are computed on the pixels of region. Many of these values are measures of statistical dispersion.
Mean: this is the arithmetic mean, also known as average or arithmetic average. This is computed by doing the sum of the pixel values divided by the number of pixels in an image channel.
Median: the median is the value separating the higher half from the lower half of a dataset. Generally, it represents the value of the background of an astronomical image.
Sigma: also known as the standard deviation, noted σ, this is a measure of dispersion of the image pixels based on squared differences from the average. The sigma value of a sub image containing only the background will represent the noise of the image.
Background noise: (available by the GUI from the Image Information submenu of the application menu after selecting Noise estimation, also displayed at the end of stacking) this is a measure of estimated noise in image background level, for pixels having a value low enough to be considered as background. It is an iterative process based on k.sigma (a factor of the standard deviation above the median), so there is no fixed threshold for the low enough.
avgDev: the Average Deviation, also called AAD for average absolute deviation or mean absolute deviation. In order to understand what the average deviation is, one needs to understand what the term absolute deviation is. Absolute deviation is the distance between each value in the dataset and that dataset's mean (in this instance) or median (for MAD below). Taking all of these absolute deviations, finding the average, and the mean average deviation is computed. To simplify, if standard deviation is the squared deviation from the mean, this is the linear version of it.
MAD: the Median Absolute Deviation is a robust measure of how spread out a set of data is. The absolute deviation and standard deviation are also measures of dispersion, but they are more affected by extremely high or extremely low values. It is similar to the average deviation above, but relative to the median instead of the mean.
BWMV: The biweight midvariance is yet another tool to measure dispersion of a dataset, even more robust than others cited above to outliers. It discards the data points too far way from the median and computes a weighted variance, weights decreasing as the data points are further way from the median. The estimator of dispersion is the square root (marked as sqrt(BWMV)) of this value.
Location and Scale: these parameters, often colloquially called scale and offset, are not displayed in the user interfaces but are computed internally by Siril. In order to align the histograms of the different images for normalization before stacking, one needs to compute where they are in terms of level and how wide they are in terms of spread. A valid estimator of location could be taken as the median while the MAD or the sqrt(BWMV) could be used for scale. However, in order to give more robustness to the measures, the pixels more than 6 x MAD away from the median are discarded. On this clipped dataset, the median and sqrt(BWMV) are re-computed and used as location and scale estimators respectively. They are computed relative to the reference image of a sequence in Siril.