Quartiles

Quarters are a significant part of everyday life. Whether you're referring to the coins jingling in your pocket or the way you're slicing up a cake, quarters are likely to make an appearance. The separation of objects and data into four parts is an inescapable part of mathematics. One aspect of statistics where some kind of quarter is involved is the finding of quartiles. So read on to find out how quartiles and quarters are related!

Get started

Millions of flashcards designed to help you ace your studies

Sign up for free

Need help?
Meet our AI Assistant

Upload Icon

Create flashcards automatically from your own documents.

   Upload Documents
Upload Dots

FC Phone Screen

Need help with
Quartiles?
Ask our AI Assistant

Review generated flashcards

Sign up for free
You have reached the daily AI limit

Start learning or create your own AI flashcards

StudySmarter Editorial Team

Team Quartiles Teachers

  • 10 minutes reading time
  • Checked by StudySmarter Editorial Team
Save Article Save Article
Contents
Contents

Jump to a key chapter

    Definition of a Quartile

    Suppose you have a set of observations, and you want to find the quartiles. You might think that since quartiles and quarters share the same root word (quart, meaning one fourth) that the quartile is just one quarter of your data. Not quite, but close!

    A quartile is a division of observations into four defined intervals based on the values of the observations and how they compare to the rest of the set of observations.

    That sounds a lot more complicated than it is. To put it simply, the quartiles mark off \(25\%\) of the data, and so the quartiles are often named for the percentages:

    • the first quartile, \(Q_1\), is the \(25^{\text{th}}\) percentile;

    • the second quartile, \(Q_2\), is the \(50^{\text{th}}\) percentile; and

    • the third quartile, \(Q_3\), is the \(75^{\text{th}}\) percentile.

    The second quartile is always the median of your data set.

    If you have an odd number of observations in your data set, exclude the median when finding the quartiles!

    Let's look at a quick example.

    Suppose you have the data set \(\{43, 52, 68, 79, 94, 100, 113\}\). Find the first, second, and third quartile.

    Solution:

    The observations in the data set are already ordered, which is handy! There are \(n=7\) observations, which is an odd number, so you will need to exclude the median of the set when calculating the quartiles. In this case the median is \(79\), so that is what you would exclude.

    The lower half of the data set would be \(\{43, 52, 68\}\), and the median of that data set is \(52\), so the first quartile is \(Q_1 = 52\).

    The second quartile is the median (which is the point you excluded since there are an odd number of data points), so the second quartile is \(Q_2 = 79\).

    The upper half of the data set would be \(\{94, 100, 113\}\), and the median of that data set is \(100\), so the third quartile is \(Q_3 = 100\).

    Let's look at some more examples of how graphs, medians, and quartiles are related.

    Median and Quartiles

    If you have a normal distribution the mean and the median are the same. Since the graph of the normal distribution is symmetric, the second quartile is in the center of the graph. Normal distributions also have the properties that for the mean \(\mu\) and standard deviation \(\sigma\):

    • \(Q_1 = \mu - 0.675\sigma \);

    • \(Q_2 = \mu\); and

    • \(Q_3 = \mu + 0.675\sigma \).

    You might wonder how someone came up with that formula. Remember for the standard normal distribution, has \(\mu=0\) and \(\sigma = 1\). In addition, the \(z\)-score that gives you \(25\%\) of the area under the curve between \(0\) and \(z\) is \(z=0.675\). So all the above formulas do is shift the standard normal distribution information to a normal distribution that has a different mean and standard deviation.

    Quartiles A normal distribution graph with lines indicating where its quartile values are located. There are three lines, dividing the curve into four quarters. Each line represents a quartile value. The quartiles run from lower quartile, to median to the upper quartile StudySmarter Originals

    Fig. 1 - Quartiles for a normal distribution graph.

    If you are looking at a box plot it is relatively easy to see where the quartiles are by looking at the plot. A box plot is designed to show you exactly where the quartiles are.

    Quartiles Diagram showing where the quartile values are located on a box plot. The lower quartile makes up the left boundary of the ‘box’ and the upper quartile makes up the right boundary. The second quartile is the median. StudySmarter OriginalsFig. 2 - Box plot with the upper and lower quartiles marked along with the median.

    What if you have skew data? In the graph below the data is skewed left. You can still mark the quartiles, and they will correspond to the median of the data, not the mean.

    Quartiles skew data showing the median is not located in the center of the data set StudySmarterFig 3. - Skew data showing the quartiles.

    Types of Quartile

    There are two main types of quartiles, the upper quartile, and the lower quartile. Both of them depend on having an ordered set of \(n\) observations.

    The upper quartile is the median of the upper half of the data set.

    The lower quartile is the median of the lower half of the data set.

    If \(n\) is odd, the median of the entire data set is excluded when calculating the upper and lower quartiles.

    Remember that the median is more resistant to outliers or skew in your data. That is why the upper and lower quartiles use the median and not the mean in their calculations.

    Let's look at a quick example.

    Suppose you have the data set as in the previous example, \(\{43, 52, 68, 79, 94, 100, 113\}\). Find the upper and lower quartiles.

    Solution:

    The lower half of the data set would be \(\{43, 52, 68 \}\), and the median is \(52\), so the lower quartile would be \(52\). Notice that this is the same as the \(Q_1\).

    The upper half of the data set would be \(\{94, 100, 113\}\), and the median is 100, so the upper quartile is \(100\), which is the same as \(Q_3\).

    Will it always be the case that the upper quartile is the same as \(Q_3\) and the lower quartile is the same as \(Q_1\)? Yes, because that is the way they are defined! Just be sure let's look at another example.

    Take the data set \(\{6, 8, 15, 36, 40, 41, 41, 43, 43, 48\}\). Find \(Q_1\), \(Q_2\), and \(Q_3\), along with the upper and lower quartiles.

    Solution:

    For this data set there are an even number of points, so none will need to be set aside when finding the quartiles. The points are already ordered, and the median of the set is

    \[\text{median} = \frac{40+41}{2} = 40.5 = Q_2.\]

    The lower half of the data set is \(\{6, 8, 15, 36, 40\}\), and the median of that set is \(15\). So

    \[\text{lower quartile} = 15 = Q_1.\]

    The upper half of the data set is \(\{41, 41, 43, 43, 48\}\), and the median of that set is \(43\). So

    \[\text{upper quartile} = 43 = Q_3.\]

    Quartile Range

    The range of your data set is relatively simple to find.

    The range of the data set is the highest value minus the lowest value.

    The interquartile range is another term you might come across.

    The interquartile range, often abbreviated IQR, is the difference between the upper and lower quartiles.

    IQR = upper quartile - lower quartile.

    Remember that the standard deviation is a measure of how spread out the data in a set is; in other words, it measures how far the observations tend to fall from the center of the data set (the mean).

    The interquartile range is also a measure of how spread out the data in a set is by looking at the middle half of the data:

    • if the interquartile range is small, the middle half of the data is tightly clustered, indicating low variability; but

    • if the interquartile range is large, the middle half of the data is more spread out, indicating higher variability.

    By looking at the middle half of the data, the interquartile range avoids being influenced by outliers or extreme values.

    You might also see a reference to the semi-interquartile range.

    The semi-interquartile range is half of the interquartile range.

    Let's take a look at an example of finding these items for a data set.

    Let's go back to the data set \(\{43, 52, 68, 79, 94, 100, 113\}\). You already know that \(Q_1 = 52\), \(Q_2 = 79\), and \(Q_3 = 100\). Find the range, the interquartile range, and the semi-interquartile range.

    Solution:

    The range is the highest value in the set minus the lowest value in the set, so

    \[\text{range} = 113-43 = 70.\]

    The interquartile range is

    \[\begin{align}\text{IQR} &= Q_3 - Q_1 \\ &= 100-52\\ &=48. \end{align}\]

    The semi-interquartile range is half of the IQR, so

    \[\text{semi-interquartile range} = \frac{48}{2}=24.\]

    More examples are always good!

    Quartile Examples

    Let's take a look at some more examples involving quartiles.

    Suppose your data set is \(\{0,0,0,0,0,0,0,0,0,0,1,1,1,3, 17, 300\}\). Find the first, second, and third quartiles, along with the range and interquartile range.

    Solution:

    This data set is a bit unusual since \(0\) is repeated \(10\) times in it! The range is the largest value minus the largest value, so

    \[\text{range} = 100 - 0 = 100.\]

    There are \(16\) observations in the data set, and the median is \(0\). So \(Q_2 = 0\). In fact, the lower half of the data set is all zeros, so \(Q_1=0\) as well.

    For the upper half of the data set you have \(\{0,0,1,1,1,3, 17, 300\}\), and the median of that set is \(1\), so \(Q_3=1\). Finding the IQR gives you

    \[\text{IQR} = Q_3-Q_1 = 1-0=1.\]

    The low value for IQR indicates that the middle half of the data doesn't have much variability. You can list the quartiles of the data:

    • lowest \(25\%\) of observations: \(\{0,0,0,0\}\);
    • highest \(25\%\) of observations: \(\{1, 3, 17, 300\}\); and
    • middle \(50\%\) of observations: \(\{0, 0, 0, 0,0,0, 1, 1\}\).

    As you can see the middle \(50\%\) of the observations are clustered around zero, which is why the IQR is so small. It isn't affected by the extreme values in the highest \(25\%\) of the observations.

    The next example demonstrates how you can use box plots to determine the quartiles and calculate the interquartile and semi-interquartile ranges.

    The box plot below shows the distribution of ages of pets in a doggie daycare where age is measured in years. Using the information presented on the box plot, calculate the interquartile range as well as the semi-interquartile range.

    Quartiles Box plot showing the distribution of children's ages at Sally's Daycare. The left boundary of the box coincides with 5.5 on the axis below the plot and the right boundary of the box coincides with 10 on the axis. The median coincides with 8.5 on the axis. The left "whisker" ends at 2 and the right "whisker" ends at 14 StudySmarter OriginalsFig. 4 - Box plot of pets in a doggie daycare.

    Solution:

    First, you need to find the values of \(Q_1\) and \(Q_3\). This can be done by reading them off the box plot:

    • \(Q_1 = 5.5\); and
    • \(Q_3 = 10\).

    Next, calculate the interquartile range using the formula:

    \[ \begin{align} IQR & = Q_3 - Q_1 \\ &= 10-5.5 \\ &= 4.5. \end{align}\]

    Finally, calculate the semi-interquartile range:

    \[ \begin{align} \text{semi-interquartile range} &= \frac{IQR}{2} \\&= \frac{4.5}{2} \\&= 2.25.\end{align}\]

    The units on each of the answers are years.

    Quartiles - Key takeaways

    • Each quartile represents \(25\%\) of the values of a set of ordered data.
    • The lower quartile is denoted by \(Q_1\) and the upper quartile is denoted by \(Q_3\). The second quartile is denoted by \(Q_2\).
    • To calculate \(Q_2\), calculate the median of the data set.
    • The upper and lower quartiles are calculated by determining the medians of the upper and lower halves of the data set respectively. If there is an odd number of values in the data set, then the median is excluded when calculating the upper and lower quartiles.
    • The interquartile range is equal to \(Q_3 - Q_1\) and the semi-interquartile range is equal to half the interquartile range.
    Frequently Asked Questions about Quartiles

    What is a quartile? 

    A quartile represents 25% of the values in an ordered set of data.

    How to find the upper quartile? 

    The upper quartile is median of the upper 50% of your data.  Sort your data into ascending order, and take the median of the highest 50% to find the upper quartile.

    Is the second quartile the median? 

    Yes.  The median of your data set is by definition the second quartile. 

    What is a lower quartile? 

    The lower quartile is the median of the lower 50% of your data when it is ordered. 

    How to find lower quartile? 

    The lower quartile is median of the lower 50% of your data.  Sort your data into ascending order, and take the median of the lowest 50% to find the lower quartile.

    Save Article

    Test your knowledge with multiple choice flashcards

    True or False: The interquartile range is calculated by subtracting the smallest data value in a set from the largest.

    The semi-interquartile range is equal to _____.

    True or False: The second quartile is the median of a set of data.

    Next

    Discover learning materials with the free StudySmarter app

    Sign up for free
    1
    About StudySmarter

    StudySmarter is a globally recognized educational technology company, offering a holistic learning platform designed for students of all ages and educational levels. Our platform provides learning support for a wide range of subjects, including STEM, Social Sciences, and Languages and also helps students to successfully master various tests and exams worldwide, such as GCSE, A Level, SAT, ACT, Abitur, and more. We offer an extensive library of learning materials, including interactive flashcards, comprehensive textbook solutions, and detailed explanations. The cutting-edge technology and tools we provide help students create their own learning materials. StudySmarter’s content is not only expert-verified but also regularly updated to ensure accuracy and relevance.

    Learn more
    StudySmarter Editorial Team

    Team Math Teachers

    • 10 minutes reading time
    • Checked by StudySmarter Editorial Team
    Save Explanation Save Explanation

    Study anywhere. Anytime.Across all devices.

    Sign-up for free

    Sign up to highlight and take notes. It’s 100% free.

    Join over 22 million students in learning with our StudySmarter App

    The first learning app that truly has everything you need to ace your exams in one place

    • Flashcards & Quizzes
    • AI Study Assistant
    • Study Planner
    • Mock-Exams
    • Smart Note-Taking
    Join over 22 million students in learning with our StudySmarter App
    Sign up with Email