Question 1

When should I use mean vs median?

Accepted Answer

Use the median when your data is skewed or has outliers — income, house prices, response times, and biological measurements like viral load all tend to be right-skewed. The median is robust to extreme values. Use the mean when your data is roughly symmetric and normally distributed — exam scores, measurement errors, and most physical measurements tend to be normally distributed. When in doubt, report both: they tell different stories. If mean and median are very different, your data is skewed and the median is the more honest "typical" value.

Question 2

What does a large standard deviation mean?

Accepted Answer

A large standard deviation (relative to the mean) means the data is spread widely around the average — individuals vary greatly from the typical value. A small SD means data is clustered tightly around the mean. In context: a class with mean exam score 70 and SD 5 is homogeneous — most students score between 60–80. A class with mean 70 and SD 20 is heterogeneous — scores range widely from under 30 to above 100. SD has no absolute interpretation; it's meaningful relative to the scale of the data or compared between datasets measuring the same thing.

Question 3

What is the difference between population and sample statistics?

Accepted Answer

A population is every member of the group you're interested in; a sample is a subset you've measured. Population standard deviation (σ) divides by n; sample standard deviation (s) divides by n−1 (Bessel's correction). If you've measured every student in a class, use population formulas. If you've sampled 30 students from a school of 1,000, use sample formulas. In practice, almost all real-world data analysis uses sample statistics because true populations are rarely fully measurable. Most calculators and software use n−1 by default, which is correct for the overwhelming majority of use cases.

Question 4

What is the mode useful for?

Accepted Answer

Mode is most useful for categorical data (the most common colour, the most popular product, the most frequent survey response) and for discrete data with a natural "typical" value. For continuous measurements like height or temperature, the mode is rarely meaningful — with enough decimal places, every value appears exactly once. In bimodal distributions (two distinct humps), the mode reveals structure that mean and median miss: a bimodal height distribution might indicate you're mixing men and women in one dataset. For shoe sizes, mode is far more useful to a retailer than mean ("buy more of size 8") even though mean is technically calculable.

Descriptive Statistics

Related calculators

About the Descriptive Statistics

How it works

Tips to improve your result

Frequently asked questions

When should I use mean vs median?

What does a large standard deviation mean?

What is the difference between population and sample statistics?

What is the mode useful for?