Averages

Summary

  • Mean, Median, Mode are three principle measures of location. In general, these will not all give the same numerical value for a given dataset/distribution.
  • Mean is defined by the formula:  \frac { \sum { { x }_{ i } } }{ n } \quad =\quad \frac { { x }_{ 1 }\quad +\quad { x }_{ 2 }\quad +\quad ....\quad +\quad { x }_{ n } }{ n } .
  • Mode is the most frequently occuring number.
  • Range is the largest number minus the smallest number.
  • Median is the middle number of an ordered set given by the formula: M\quad =\quad \frac { { x }_{ 1 }\quad +\quad { x }_{ 2 } }{ 2 }

Average can be described as the number that gives us a sense of a central tendency, or a number that is more representative of a set. There are number of ways in which you can measure the central tendency, or a average of a set of numbers. The four basic ones are Mean, Median, Mode and Range.

Mean

Mean is the most frequent measure of central tendency, which is simply the ‘average’ of that data.

In everyday life when someone asks you to take out the average of a set of numbers, they want you to do is to figure out the mean and not median, mode or range. To do that you add the numbers in the set and then divide it by the total number of numbers in the set. Take a look at this simple example to make things more clearer.

Say we have a set of:

[32, 28, 67, 39, 19]

And we are asked to find the mean of the this data.

We add all the numbers together and divide it by the total number of numbers.

In this case it will be:

\frac { 32\quad +\quad 28\quad +\quad 67\quad +\quad 39\quad +\quad 19\quad }{ 5 } \quad =\quad 37

We can thus say that the formula for the mean is:

\frac { \sum { { x }_{ i } } }{ n } \quad =\quad \frac { { x }_{ 1 }\quad +\quad { x }_{ 2 }\quad +\quad ....\quad +\quad { x }_{ n } }{ n }

(Where \sum is known as the sum of)

We also use the group data to compute particular descriptive statistics, specifically the mean. In which we perform the appropriate calculations by groups.

Example #1

Class intervalMidpointFrequencyfx{ fx }^{ 2 }
120 - 130125112515625
130 - 140135454072900
140 - 1501455725105125
150 - 1601556930144150
160 - 17016571155190575
170 - 1801755875153152
180 - 190185118534225
Total, \sum294535715725

Hence , the grouped mean is :

\bar { x } =\quad \frac { \sum { fx } }{ \sum { f } } \quad \quad =\quad \frac { 4535 }{ 29 } \quad =\quad 156.4

Median

Another method for measuring the average, is by finding the middle value of an (ascending) ordered set.

In other words, putting all the numbers in order from the lowest to the highest and taking the middle number is known as median.

So let’s take a look at its example.

Say you have a set of numbers:

[32, 28, 67, 39, 19]

So, for this data set n = 5

We put all the numbers in order from the lowest to the highest:

[19, 28, 32, 39, 67]

Take the middle value, which in this case is an explicit value, so the average of this would be 32.

However if you have a set of numbers where n is an even number in that case we take average of the values either side of the ‘midpoint’

Hence:

M\quad =\quad \frac { { x }_{ 1 }\quad +\quad { x }_{ 2 } }{ 2 }

Example #2

[19, 24, 28, 32, 32, 37, 39, 44, 48]

After putting them order we will take the two middle values, add them together and then divide by 2.

M\quad =\quad \frac { 32\quad +\quad 37 }{ 2 } \quad =\quad 34.5

Similarly, if we have a grouped data, in that case the value median is determined by the following formula:

\frac { n\quad +\quad 1 }{ 2 } th\quad value

From the previous example we know that our n was 10 so putting it in the formula we get:

\frac { 10\quad +\quad 1 }{ 2 } \quad =\quad \frac { 11 }{ 2 } \quad =\quad 5.5th\quad value

By 5.5 we know that there is no explicit value, so from our ordered dataset we take the 5th and the 6th value which are 32 and 37 and put it into the midpoint formula to find our answer.

Another way we might have to find median is through histograms. If you look at the example below you can see that the highest column is 25, put it in our formula for grouped data:

\frac { 25\quad +\quad 1 }{ 2 } \quad =\quad 13, we know now that our median is the 13th value. We then draw a frequency polygon, find the 13th value on the vertical axis and once you hit the polygon line, move down to find the score.

This is the median

Range

To find the range of a set of data, all you have to do is minus the smallest number in the set by the largest number in the set.

Example #3

The Jaeger family drove through 6 midwestern states on their summer vacation. Gasoline prices varied from state to state. What is the range of gasoline prices?

[$1.79, $1.61, $1.96, $2.09, $1.84, $1.75]

First of all we have to sort the data out into an ascending order, from the greatest to smallest, so we get:

[$1.61, $1.75, $1.79, $1.84, $1.96, $2.09]

Then we take the highest number which is 2.09 and minus it from the smallest number which is 1.61 to find the range:

$2.09 – $1.61 = $0.48

Mode

The mode of a data set is the number that occurs most frequently in the set. To easily find the mode, put the numbers in order from least to greatest and count how many times each number occurs.

The number that occurs the most is the mode.

Example:

[2, 5, 5, 7, 7, 7, 8, 8, 9]

since 7 is the most occuring number in this dataset, that will be the mode.

Similarly , on a histogram, in order to find the mode you just have to see which class division is the tallest. Like the in the example below, you can see the the tallest interval is 4, so the mode will be 4.

Refrences

  1. https://www.mathgoodies.com/lessons/vol8/range
  2. https://www.google.com.pk/search?q=median+on+a+histogram&rlz=1C1AVNG_enGB624GB668&source=lnms&tbm=isch&sa=X&ved=0ahUKEwjY9afvmonfAhVxSRUIHYDGBZoQ_AUIDigB&biw=1164&bih=598#imgrc=JiBvBdtn3jZNdM:
  3. Statistics 1 by J.S Abdey