Mean

Use our extensive free resources below to learn about Mean.

This material is an extract from our National 5 Mathematics: Curriculum Breakdown course led by instructor Andrew Eadie. Enrol in the full course now and gain access to over 100 detailed topic breakdowns, 48 video tutorials (20 hours) and 39 quizzes spanning the entire curriculum.

What is Mean?

The mean, or alternatively, the “arithmetic average”, is the sum of the values in a dataset divided by the number of values in the dataset.

$\begin{aligned}\textbf{Mean}=\frac{\textbf{Sum of values in set}}{\textbf{Number of values in set}}\end{aligned}$

The purpose of the mean is to calculate a figure which is representative of the dataset as a whole, or representative of the dataset “on average”.

For example, imagine a school has three classes of S4 pupils – Class A, Class B and Class C – who have all sat their National 5 Maths prelim. Each class has 10 pupils.

Class A’s results were as follows:

The mean or “average” result for Class A can be calculated:

$\begin{aligned}&Class A Mean=\frac{Sum of results}{Number of results} \\[16pt]&Class A Mean=\frac{70+70+71+74+78+79+81+82+83+88}{10} \\[12pt]&Class A Mean=\frac{776}{10} \\[12pt]&Class A Mean=77.6\% \\[16pt]&Members of Class A scored 77.6\% on average.\end{aligned}$

Looking at the spread of results across Class A, would you agree that a mean of 77.6% is a fairly good representation of what the typical class member might have achieved in the prelim? Let’s add a purple line to the graph of results to represent the mean score:

As you can see, the mean falls somewhere in the middle of the range of results, with nobody really scoring wildly more or less than 77.6%, so I think it offers a fairly accurate measure of how the class performed as a whole.

Now Class B. Class B’s results were as follows:

The mean or “average” result for Class B can be calculated:

$\begin{aligned}&Class B Mean=\frac{Sum of results}{Number of results} \\[16pt]&Class B Mean=\frac{5+72+72+75+76+79+80+81+87+89}{10} \\[12pt]&Class B Mean=\frac{716}{10} \\[12pt]&Class B Mean=71.6\% \\[16pt]&Members of Class B scored 71.6\% on average.\end{aligned}$

Do you think that a 71.6% average accurately describes what the typical member of Class B scored? Personally, I don’t think it does. Let’s add a blue line to the graph of results to represent the mean score:

Comparing the results to the mean, we can see that everyone, other than the one student who bombed the test scoring only 5%, actually scored more than the mean result of 71.6%. 9 out of 10 of the students scored greater than the mean result – this means that in this case, the average is not a particularly good measure of the score of the typical class member.

So what went wrong? Well like Class A previously, all of Class B’s students scored in the 70-90% range, except for that one pupil who got only 5%. This pupil is a statistical outlier – a result which lies outside of the overall pattern of a distribution (with the distribution in this case being Class B’s prelim results). Statistical outliers in a dataset can skew the mean quite considerably, causing it to become a less accurate representation of the dataset as a whole.

The mean or average is often used improperly to gauge where “roughly the middle” of a dataset might be. The mean is not appropriate for this purpose. As we have seen with Class B, the mean or average can be a poor measure of central tendency as it is greatly influenced by statistical outliers in the dataset. (To actually find where the middle of a dataset is, you should use the median)

Let’s now look at Class C. Class C’s results were as follows:

The mean or “average” result for Class C can be calculated:

$\begin{aligned}&Class C Mean=\frac{Sum of results}{Number of results} \\[16pt]&Class C Mean=\frac{4+12+25+37+46+51+63+71+87+95}{10} \\[12pt]&Class C Mean=\frac{491}{10} \\[12pt]&Class C Mean=49.1\% \\[16pt]&Members of Class C scored 49.1\% on average.\end{aligned}$

Do you think that a 49.1% average accurately describes what the typical member of Class C scored? I think it’s quite clear that this does not describe the class accurately at all! Let’s add a green line to the graph of results to represent the mean score:

If you’re one of the students who scored 4% or 12%, a 49.1% average might sound quite good to you, but if you’re one of the students who scored 87% or 95%, you might be quite annoyed to hear people were saying your class only has a 49.1% average!

The problem with Class C’s set of scores is that they are highly dispersed – meaning that they are highly variable and cover a very wide range of results. For a dataset with such a high level of dispersion, the mean will not be a very accurate representation of the typical class member’s score.

There are a few outcomes we can draw from this exercise:

1) For narrow datasets with low dispersion (like Class A, where all the results were quite close to each other), the mean offers an accurate measure of a typical result.

2) For wide datasets with large dispersion (like Class C, where the results were very variable and covered a large range), the mean does not offer an accurate measure of a typical result.

3) Statistical outliers in a dataset can skew the mean quite considerably (like Class B, where all the results were close to each other except for the one student who got 5%), causing it to become a less accurate representation of the dataset as a whole.

4) The mean or average does not represent the “middle” of a dataset. Depending on the dataset, sometimes the mean can be very near the middle (like with Class A), but for other datasets it may not be (like Class B). To find the middle of a dataset, one should calculate the median.

As a final point, remember that a mean must always be read in context. Take for example a standard die:

This has 6 unique sides, numbered 1 through 6. On any one roll, it is equally likely that the die could show any value from 1 to 6, but on average, the die rolls a value of 3.5:

$\begin{aligned}&Mean=\frac{Sum of possible results}{Number of possible results} \\[16pt]&Mean=\frac{1+2+3+4+5+6}{6} \\[12pt]&Mean=\frac{21}{6} \\[12pt]&Mean=3.5\end{aligned}$

This is what I mean when I say a mean must be read in context. Yes, the die produces a value of 3.5 on average, but no single roll can actually show a value of 3.5 (since there is no side with 3.5 dots!) In this case, no single result can be the same the average value.

Sometimes, it is important to look more closely at the context of a mean calculation to fully understand it. If you looked at the calculation of the mean alone without understanding the limitations of a die (i.e. it can only show whole number values from 1 to 6), then you might think it was actually possible to roll a 3.5!

Key Outcomes

The mean, or alternatively the “arithmetic average”, is the sum of a collection of numbers in a dataset divided by the number of numbers in the dataset.

$\begin{aligned}\text{Mean}=\frac{\text{Sum of values in set}}{\text{Number of values in set}}\end{aligned}$

The purpose of the mean is to calculate a figure which is representative of the dataset as a whole, or representative of the dataset “on average”.

For narrow datasets with low dispersion, the mean offers an accurate measure of a typical result.

For wide datasets with large dispersion, the mean does not offer an accurate measure of a typical result.

Statistical outliers (results which lie outside of the overall pattern of a distribution) in a dataset can skew the mean quite considerably, causing it to become a less accurate representation of the dataset as a whole.

The mean or average does not represent the “middle” of a dataset. Depending on the dataset, sometimes the mean can be very near the middle, but for other datasets it may not be. To find the middle of a dataset, one should calculate the median.

Learn About Mean with National5.com

A Powerful New Learning System For N5 Maths

Our unique learning management system was designed exclusively around the Scottish curriculum. Feature-rich course design gives students the power to tailor their learning, taking advantage of exceptionally detailed explanations, fully worked examples, video tutorials, quizzes and more to identify and tackle problem areas at their own pace.

Theory Explanations
Intuitive Diagrams
Video Tutorials
Fully Worked Examples
Past Paper Breakdowns
Intelligent Quizzes

All hosted in the cloud and accessible anytime on any device.

Download Our Free Cheat Sheet

Learn Anywhere with Unlimited Access

All courses are hosted in the cloud and optimised for use across desktop, tablet and mobile devices. All packages include 1 year’s unlimited access to your chosen course(s).

Our Quality Guarantee

We are so sure you’ll love our course that we offer a 100% money back guarantee. If you’re not entirely satisfied with your course, you can contact us within 14 days of purchase to receive a full refund.

View Our Course

Free Trial

This guarantee is subject to our Terms & Conditions of use.

Cookie	Duration	Description
__stripe_mid	1 year	Stripe sets this cookie cookie to process payments.
__stripe_sid	30 minutes	Stripe sets this cookie cookie to process payments.
_GRECAPTCHA	5 months 27 days	This cookie is set by the Google recaptcha service to identify bots to protect the website against malicious spam attacks.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
aka_debug	session	Vimeo sets this cookie which is essential for the website to play video functionality.
mailchimp_landing_site	1 month	The cookie is set by MailChimp to record which page the user first visited.
player	1 year	Vimeo uses this cookie to save the user's preferences when playing embedded videos from Vimeo.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_gtag_UA_173648119_1	1 minute	Set by Google to distinguish users.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
vuid	2 years	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
fr	3 months	Facebook sets this cookie to show relevant advertisements to users by tracking user behaviour across the web, on sites that have Facebook pixel or Facebook social plugin.
NID	6 months	NID cookie, set by Google, is used for advertising purposes; to limit the number of times the user sees an ad, to mute unwanted ads, and to measure the effectiveness of ads.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.

Cookie	Duration	Description
CookieLawInfoConsent	1 year	No description
cookies.js	session	No description available.
m	2 years	No description available.

Mean

What is Mean?

\begin{aligned}\textbf{Mean}=\frac{\textbf{Sum of values in set}}{\textbf{Number of values in set}}\end{aligned}