Find study content
Learning Materials

Discover learning materials by subject, university or textbook.

Explanations
All Subjects

Anthropology

Archaeology

Architecture

Art and Design

Bengali

Biology

Business Studies

Chemistry

Chinese

Combined Science

Computer Science

Economics

Engineering

English

English Literature

Environmental Science

French

Geography

German

Greek

History

Hospitality and Tourism

Human Geography

Japanese

Italian

Law

Macroeconomics

Marketing

Math

Media Studies

Medicine

Microeconomics

Music

Nursing

Nutrition and Food Science

Physics

Politics

Polish

Psychology

Religious Studies

Sociology

Spanish

Sports Sciences

Translation
Features
Features

Discover all of these amazing features with a free account.

Flashcards

StudySmarter AI

Notes

Study Plans

Study Sets

Exams
What’s new?

Flashcards
Study your flashcards with three learning modes.

Study Sets
All of your learning materials stored in one place.

Notes
Create and edit notes or documents.

Study Plans
Organise your studies and prepare for exams.
Resources
Discover

All the hacks around your studies and career - in one place.

Find a job

Student Deals

Magazine

Mobile App
Featured

Magazine
Trusted advice for anyone who wants to ace their studies & career.

Job Board
The largest student job board with the most exciting opportunities.

StudySmarter Deals
Verified student deals from top brands.

Our App
Discover our mobile app to take your studies anywhere.

Learning Materials

Features

Discover

Comparing Data

You have probably already come across methods of analysing and interpreting data in given data distributions. In many real-world applications, we are required to compare information between multiple data sets. Let's look at how to compare data between data distributions.

Get started

+ Add tag
Immunology
Cell Biology
Mo

Which of the following is appropriate to use along with standard deviation for comparison?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Which of the following should you use for comparing a data set with extreme values?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Which of the following is appropriate to use along with interquartile range for comparison?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Which of the following is appropriate to use along with median for comparison?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Which of the following is appropriate to use along with mean for comparison?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Which of the following is appropriate to use along with standard deviation for comparison?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Which of the following should you use for comparing a data set with extreme values?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Which of the following is appropriate to use along with interquartile range for comparison?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Which of the following is appropriate to use along with median for comparison?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Which of the following is appropriate to use along with mean for comparison?

Show Answer

Fact Checked Content
Last Updated: 27.02.2023
4 min reading time

Content creation process designed by
Content cross-checked by
Content quality checked by

Comparing data distributions

When comparing multiple data distributions, you can comment on

A measure of location – a measure of location is used to summarise an entire data set with a single value. For example, mean and median are measures of location.
A measure of spread – a measure of spread provides us information regarding the variability of data in a given data set, i.e. how close or far away the different points in a data set are from each other. Standard deviation and interquartile range are examples of measures of spread.

You can compare different data distributions using the mean and standard deviation, or using the median and interquartile ranges. In cases where data sets contain extreme values and/or outliers, median and interquartile ranges are usually more appropriate to use.

Do not use the median and standard deviation together or the mean and interquartile ranges together.

Let's explore the concept further with the help of examples.

Comparing mean and standard deviations of data sets

The daily mean temperatures during August is recorded at Heathrow and Leeming. For Heathrow, ∑x=562, ∑x²=10301.2. For Leeming, the mean temperature was 15.6°C with a standard deviation of 2.01° C

a) Calculate the mean and standard deviation for Heathrow. b) Compare the data for Heathrow with that of Leeming.

Solutions

For Heathrow,

$\begin{align} mean &= \frac {\sum{x}}{n} \\ &= \frac{562}{31} = 18.1ºC \end{align}$

$Standard \quad deviation = \sqrt{\frac{\sum{x^2}}{n} - (\frac{\sum{x}}{n})^2} = \sqrt {\frac{10301.2}{31} - (\frac{562}{31})^2} = 1.91ºC$

b) From the above information, we see that the mean temperature at Heathrow during August was higher than Leeming, and the spread/variability of temperatures was less than Leeming.

A company collects the delivery times in minutes for suppliers A and B for a period of 20 days. The following is the result of the data collected. Compare the performance of the two suppliers.

suppliers	∑x	∑x²
A	360	18000
B	300	29000

solutions

For supplier A,

$\begin{align} mean_A &= \frac {\sum x}{n} \\ &= \frac{360}{20} = 18 \end{align}$

For supplier B,

$\begin{align} mean_B &= \frac {\sum x}{n} \\ &= \frac{300}{20} = 15 \end{align}$

From the above information, we see that supplier A has a longer delivery time, while supplier B has a greater spread in delivery time.

Consider the above example in a real-world context. If the company wants to keep one of its suppliers and let go of the other, it could compare the above data just like we have. If the priority of the company is to reduce delivery times on average, it would favour supplier B. If the priority on the other hand is greater reliability, it would favour the supplier with less variability, and that would be supplier A.

Comparing median and interquartile range of data sets

The students of two different sections sit for an exam. The quartile and median marks of each section is provided. Compare the performance of the 2 sections.

Section	$Q_{1}$	median	$Q_{3}$
Section 1	58	71	87
Section 2	62	74	83

Solutions

The interquartile range for Section 1 = Q₃ - Q₁= 87-58 = 29

The interquartile range for Section 2 = Q₃ - Q₁= 83-62 = 21

From the given data, we see that the median marks is higher for section 2, while the variability of marks is higher in section 1.

A company collects the delivery times for suppliers, A and B, for a period of 20 days. The median delivery time was 4 hours for supplier A, and 3 hours for supplier B. The interquartile range for supplier A was 0.8 hours and for supplier B was 1.5 hours.

Compare the performance of the suppliers in terms of speed and reliability.

Solutions

Supplier B appears to be the more efficient performing better in terms of speed with a lower median delivery time. Supplier A appears to be more reliable with a lower spread/variability in delivery time.

Comparing Data - Key takeaways

In many real-world applications we are required to compare information between multiple data sets.
When comparing multiple data distributions, you can comment on
- a measure of location
- a measure of spread
You can compare different data distributions using the mean and standard deviation, or using the median and interquartile ranges.

Flashcards in Comparing Data 5

Start learning

Which of the following is appropriate to use along with

standard deviation for comparison?

mean

Which of the following should you use for comparing a data set with extreme values?

mean and standard deviation

Which of the following is appropriate to use along with interquartile range for comparison?

mean

Which of the following is appropriate to use along with median for comparison?

Interquartile range

Which of the following is appropriate to use along with mean for comparison?

standard deviation

Already have an account? Log in

Frequently Asked Questions about Comparing Data

Why are bar graphs useful for comparing data?

Bar graphs allow you to easily visualise the measures of location and spread.

What is the importance of comparing data?

In many real-world applications, we are required to compare information between multiple data sets to make better-informed decisions.

Save Article

Test your knowledge with multiple choice flashcards

Score

Access over 700 million learning materials

Study more efficiently with flashcards

Get better grades with AI

Already have an account? Log in

How we ensure our content is accurate and trustworthy?

At StudySmarter, we have created a learning platform that serves millions of students. Meet the people who work hard to deliver fact based content as well as making sure it is verified.

Content Creation Process:

Lily Hulatt is a Digital Content Specialist with over three years of experience in content strategy and curriculum design. She gained her PhD in English Literature from Durham University in 2022, taught in Durham University’s English Studies Department, and has contributed to a number of publications. Lily specialises in English Literature, English Language, History, and Philosophy.

Get to know Lily

Content Quality Monitored by:

Gabriel Freitas is an AI Engineer with a solid experience in software development, machine learning algorithms, and generative AI, including large language models’ (LLMs) applications. Graduated in Electrical Engineering at the University of São Paulo, he is currently pursuing an MSc in Computer Engineering at the University of Campinas, specializing in machine learning topics. Gabriel has a strong background in software engineering and has worked on projects involving computer vision, embedded AI, and LLM applications.

Get to know Gabriel

Discover learning materials with the free StudySmarter app

About StudySmarter

StudySmarter is a globally recognized educational technology company, offering a holistic learning platform designed for students of all ages and educational levels. Our platform provides learning support for a wide range of subjects, including STEM, Social Sciences, and Languages and also helps students to successfully master various tests and exams worldwide, such as GCSE, A Level, SAT, ACT, Abitur, and more. We offer an extensive library of learning materials, including interactive flashcards, comprehensive textbook solutions, and detailed explanations. The cutting-edge technology and tools we provide help students create their own learning materials. StudySmarter’s content is not only expert-verified but also regularly updated to ensure accuracy and relevance.

Learn more