Measures of central tendency and dispersion
Measures of Central Tendency The central tendency measures the position of the center or midpoint of a dataset. These measures provide valuable insights int...
Measures of Central Tendency The central tendency measures the position of the center or midpoint of a dataset. These measures provide valuable insights int...
Measures of Central Tendency
The central tendency measures the position of the center or midpoint of a dataset. These measures provide valuable insights into the central tendency of a dataset, including the mean, median, and mode.
Mean (X̄): The mean is calculated by adding up all the values in a dataset and dividing the sum by the total number of values. It is a robust measure that is not sensitive to outliers.
Median (Q2): The median is the middle value in a dataset when arranged in order from smallest to largest. It is not affected by outliers and is a robust measure that is commonly used.
Mode (Mo): The mode is the value that appears most frequently in a dataset. It is a measure of central tendency that is only applicable to numerical data.
Measures of Dispersion
The dispersion measures the spread or variability of a dataset. These measures provide insights into how spread out the data is and help identify outliers.
Variance (Var): The variance is a measure of how spread out a dataset is. It is calculated by calculating the average of the squared differences between each data point and the mean. The variance is a robust measure that is not affected by outliers.
Standard deviation (σ): The standard deviation is a measure of how much the data is spread out. It is calculated by taking the square root of the variance. The standard deviation is a robust measure that is commonly used.
Range: The range is the difference between the highest and lowest values in a dataset. It is a measure of dispersion that is not affected by outliers.
By understanding measures of central tendency and dispersion, we can gain valuable insights into the central tendency and spread of a dataset. This information can be used to identify central tendencies, assess the spread of data, and identify outliers