Curriculum
Mean Median Mode are the most commonly used measures of central tendency in Statistics, Data Analytics, Data Science, Business Analytics, Machine Learning, Artificial Intelligence, and Business Intelligence. These statistical measures help summarize large datasets by identifying their central or typical value.
Organizations use Mean Median Mode to understand customer behavior, analyze business performance, evaluate sales trends, measure employee productivity, and support decision-making.
Mean Median Mode are widely used in:
Understanding Mean Median Mode is essential because they provide a quick overview of data distribution and help analysts interpret datasets efficiently.
Central Tendency refers to the statistical concept of identifying the center or typical value of a dataset.
The three main measures of central tendency are:
These measures help summarize large amounts of data into a single representative value.
Large datasets can be difficult to interpret.
Mean Median Mode help:
Benefits include:
These measures are fundamental to Data Analytics.
The Mean is the average value of a dataset.
To calculate the Mean:
Formula:
xˉ=∑x/n​
Where:
Dataset:
10, 20, 30, 40, 50
Calculation:
(10 + 20 + 30 + 40 + 50) / 5
Result:
30
The Mean is:
30
Applications:
Example:
import numpy as np
data = [
10,
20,
30,
40,
50
]
print(
np.mean(data)
)
Output:
30.0
Applications:
Data Analytics.
Business Reporting.
Benefits include:
Applications:
Business Intelligence.
Machine Learning.
Mean is sensitive to extreme values.
Example:
Dataset:
10, 20, 30, 40, 500
Mean:
120
The value 500 significantly affects the result.
This is called the effect of an outlier.
The Median is the middle value in a sorted dataset.
Steps:
The Median is less affected by outliers.
Dataset:
10, 20, 30, 40, 50
Middle value:
30
Median:
30
Applications:
Income Analysis.
Customer Analytics.
Dataset:
10, 20, 30, 40
Middle values:
20 and 30
Calculation:
(20 + 30) / 2
Result:
25
Median:
25
Example:
import numpy as np
data = [
10,
20,
30,
40,
50
]
print(
np.median(data)
)
Output:
30.0
Applications:
Financial Analytics.
Market Research.
Benefits include:
Applications:
Business Analytics.
Economic Studies.
Median does not use every value in the dataset.
Some information may be ignored.
The Mode is the value that occurs most frequently in a dataset.
Unlike Mean and Median, a dataset may have:
Dataset:
10, 20, 20, 30, 40
Most frequent value:
20
Mode:
20
Applications:
Customer Preferences.
Product Demand Analysis.
Dataset:
10, 10, 20, 20, 30
Modes:
10 and 20
This is called a bimodal dataset.
Example:
from scipy import stats
data = [
10,
20,
20,
30,
40
]
print(
stats.mode(data)
)
Output:
ModeResult(
mode=20
)
Applications:
Customer Analytics.
Retail Analytics.
Benefits include:
Applications:
Market Analysis.
Product Popularity Studies.
A dataset may have:
This can make interpretation difficult.
| Measure | Description |
|---|---|
| Mean | Average Value |
| Median | Middle Value |
| Mode | Most Frequent Value |
Applications:
Data Summarization.
Business Reporting.
Dataset:
10, 20, 20, 30, 100
Mean:
36
Median:
20
Mode:
20
Observation:
The outlier affects the Mean but not the Median.
Choosing the correct measure improves analytical accuracy.
Data Analysts use Mean Median Mode for:
Benefits:
Better business insights.
Business Analysts use Mean Median Mode for:
Benefits:
Improved decision-making.
Machine Learning projects use Mean Median Mode for:
Benefits:
Improved model performance.
Example:
import numpy as np
from scipy import stats
sales = [
10000,
15000,
15000,
20000,
25000
]
print(
np.mean(sales)
)
print(
np.median(sales)
)
print(
stats.mode(sales)
)
Applications:
Sales analysis.
Revenue reporting.
Can distort Mean values.
May produce misleading results.
Can lead to incorrect conclusions.
Can affect analysis quality.
Avoiding these mistakes improves analytical accuracy.
Understand distribution.
Improve reliability.
Gain better insights.
Improve decision-making.
Ensure accuracy.
These practices support professional analytics.
Benefits include:
Mean Median Mode are essential skills for Data Analysts and Data Scientists.
After completing this lesson, you will be able to:
Mean is the average value of a dataset.
Median is the middle value in a sorted dataset.
Mode is the most frequently occurring value.
Mean is affected by outliers.
Median is resistant to outliers.
Yes. A dataset can have more than one mode.
They summarize large datasets and help analysts understand data quickly.
They provide foundational statistical insights used in data analysis and machine learning.
Want to master Python, SQL, Power BI, and Data Analytics?
WhatsApp us