Basic StatisticsBeginner· 8 min read

Mean, Mode and Median: How to Summarize a Data Set

Learn to summarize data sets with mean, mode and median — and discover when each measure tells the most truthful story about your data.

RF

Renato Freitas

Updated on May 5, 2026

What are measures of central tendency?

When we have a set of numbers, we want to find a single value that 'represents' it. That central value makes comparisons, summaries and decision-making easier. The three most commonly used measures for this purpose are the mean, the mode and the median — each one reveals a different aspect of the data set.

Imagine the grades of a class: 4, 5, 5, 6, 7, 8, 10. With more than seven values, it is hard to say at a glance 'how the class did'. Measures of central tendency turn that set into a single number that carries essential information.

🧮 Try it yourself — CalcSim

Want more features? Download CalcSim IA app

Arithmetic mean: add and divide

The arithmetic mean is calculated by adding all values and dividing by the number of elements. For the grades 4, 5, 5, 6, 7, 8, 10, the sum is 45 and there are 7 values, so the mean is 45 ÷ 7 ≈ 6.43.

The mean works very well when data are symmetric and there are no extreme values. However, a single very high or very low value — called an outlier — pulls the mean significantly. Think of salaries at a company: if 10 employees earn $3,000 and the CEO earns $200,000, the average salary exceeds $20,000, a figure that does not represent the majority.

That is why the mean is a good measure when data are relatively uniform. In standardized school assessments, for example, it works very well because grades tend to be distributed in a balanced way.

  • Formula: Mean = (sum of all values) ÷ (number of values)
  • Sensitive to outliers — one extreme value changes the result significantly
  • Ideal for symmetric data without large discrepancies

Mode: the most frequent value

The mode is the value that appears most often in the set. In the grades example 4, 5, 5, 6, 7, 8, 10, the number 5 appears twice, making it the mode. A set can be unimodal (one mode), bimodal (two modes), or have no mode at all when every value appears the same number of times.

The mode is especially useful with categorical data. The best-selling car color, the most-ordered clothing size, or the most frequently cited neighborhood in complaints are examples where the mode directly answers 'what is most common?'. In market analysis and inventory planning, the mode guides practical decisions.

Median: the middle value

The median is the value that occupies the central position when data are sorted. With 7 grades (4, 5, 5, 6, 7, 8, 10), the central position is the 4th, so the median is 6. When there is an even number of values, the median is the average of the two middle values.

The great advantage of the median is its robustness against outliers. Going back to the salary example: if 10 employees earn $3,000 and the CEO earns $200,000, the median will be $3,000 — a value that truly represents the reality of the majority. That is why income surveys, property prices and other skewed data sets usually report the median rather than the mean.

Choosing the right measure makes all the difference in data interpretation. Symmetric distributions call for the mean; data with outliers or skewness call for the median; categorical or frequency data call for the mode.

Frequently asked questions

Can I calculate the mode of decimal or fractional data?

Yes, but it is rare for decimals to repeat exactly in continuous data. In those cases, it is common to group data into intervals and identify the most frequent interval, called the modal class.

Why is the median better than the mean for salaries?

Because salaries tend to have an asymmetric distribution with a few very high values. The median ignores those extremes and shows what the 'middle worker' actually earns, while the mean would be inflated by the highest earners.

When a data set has two modes, which one should I use?

You can report both. A bimodal set may indicate that there are two distinct groups in the sample, which is valuable information in itself. For example, two modes in height data may indicate the sample mixes adults and children.

Are the mean, mode and median always different?

No. In perfectly symmetric distributions (such as the normal distribution), all three coincide. In simple symmetric data like 1, 2, 3, 4, 5 the mean and median are equal (3), although there is no mode.

How do I calculate the median with an even number of values?

Sort the values, identify the two in the middle and calculate their average. For example, for 3, 5, 7, 9 the two middle values are 5 and 7, and the median is (5 + 7) ÷ 2 = 6.

Was this article helpful?

Rate with stars to help us improve the content.

Sign in to rate this article.

Still have questions?

The AI Professor explains step by step

Ask a question in natural language and get a personalised explanation about Basic Statistics — or any other topic.

Prefer to solve it on your phone?

Download the free app →

Keep learning