Blog

Central Tendency: Sample Mean and Population Mean

What are sample mean and population mean?

Sample mean and Population mean are two key statistics used to measure the central tendency within a dataset. While they seem similar, they represent averages from different sources of data:

Sample Mean (x̄)

This represents the average value calculated from a subset (sample) of a larger population. It’s an estimate of the true population mean, as collecting data from the entire population can be impractical or impossible.

x̄ = Σ(xi) / n

Where:

  • Σ (sigma) represents the sum of all elements.
  • xi (x-subscript-i) represents each individual value in the sample data set.
  • n represents the total number of elements in the sample.

Population Mean (μ)

This represents the average value of all elements in a population. It’s the true average, but often not directly obtainable due to limitations in data collection.

μ = Σ(xi) / N

Where:

  • Σ (sigma) represents the sum of all elements.
  • xi (x-subscript-i) represents each individual value in the population data set.
  • N represents the total number of elements in the population (ideally, all elements would be included).

Example

Consider a dataset representing the heights (in cm) of 5 students (sample): {170, 165, 182, 178, 159}.

  • Sample Mean:
x̄ = (170 + 165 + 182 + 178 + 159) / 5
x̄ = 854 / 5
x̄ = 170.8 cm
  • (Ideally) Population Mean, if you had data for all 10 students:
μ = (total height of all 10 students) / 10

Analogy: Measuring Student Grades

Imagine a school with 1000 students (population). Calculating the grades of every student to find the average score would be a massive undertaking. Instead, we can randomly select a smaller group of students (sample), say 50, and calculate their average grade (sample mean). This sample mean provides an estimate of how the entire student body might have performed (population mean).

Key Differences and When to Use Each:

FeatureSample Mean (x̄)Population Mean (μ)
RepresentsAverage of a sampleAverage of the entire population
Calculation MethodSum of sample values divided by sample size (n)Sum of all population values divided by population size (N)
Use CaseEstimate population mean when complete data collection is not feasible.Ideal scenario when data for the entire population is available.

Calculating Sample Mean and Population Mean in Python:

Here’s a Python code example demonstrating how to calculate both sample mean, and population mean:

Sample Mean

sample_data = [85, 72, 90, 88, 65, 78, 92, 83, 87, 69, 80, 75, 95, 82, 70, 89, 68, 98, 81, 71, 91, 84, 77, 94, 86, 73, 97, 80, 74, 93, 67, 99, 79, 76, 90, 88, 66, 96, 72] 

population_data = sample_data 

sample_mean = sum(sample_data) / len(sample_data)

print("Sample Mean:", sample_mean)
Sample Mean: 81.64

Population Mean

population_data = [172, 168, 180, 176, 161, 173, 178, 166, 170, 183, 164, 174, 186, 170, 158, 175, 160, 176, 181, 165, 171, 167, 179, 174, 162, 172, 178, 163, 170, 182, 160, 173, 185, 169, 157, 174, 159, 175, 180, 164, 169, 165, 177, 172, 158, 170, 176, 161, 171, 177, 162, 168, 180, 167, 172, 178, 164, 170, 183, 159, 171, 184, 166]

population_mean = sum(population_data) / len(population_data)

print("Population Mean (if available):", population_mean)
Population Mean (height in cm): 171.24

This example highlights the distinction between sample mean and population mean using different datasets. Remember that in real-world scenarios, you might not always have access to complete population data, making sample means crucial for drawing inferences about the larger population.

By understanding the distinction between sample mean and population mean, you can effectively interpret and utilize central tendency measures in your data analysis tasks.

We hope you found the information helpful! If you learned something valuable, consider sharing it with your friends, family, and social networks.


Reference: Khan Academy

Also Read:

Spread the love

View Comments

Recent Posts

SQL Interview Question at Zomato for a Data analyst Position (0-3 Years) – | Shared By An Experienced Data Analyst

SQL Interview Question at Zomato: These questions were recently asked in interview at Zomato, you…

5 days ago

The Ultimate Guide to SQL Indexing and Query Optimization

Introduction: SQL Indexing and Query Optimization SQL indexing is a critical concept that can drastically…

2 weeks ago

SQL Interview Questions Asked In Walmart For Data Analyst Post | CTC – 18 LPA | Learn With Curious Club!!

This article is about the SQL Interview Questions asked by Walmart for their Data Analyst…

2 weeks ago

SQL Interview Questions for Deloitte Data Engineer Roles: Your Ultimate Prep Guide

You must be able to answer these SQL Interview Questions if you are applying for…

3 weeks ago

Data Analyst SQL Interview Questions | EY (Ernst & Young) | Shared By An Experienced Data Analyst

This article tackles common SQL Interview Questions asked by EY, offering detailed solutions and explanations…

3 weeks ago

1164 Product Price at a Given Date

1164. Product Price at a Given Date: Learn how to track and select price from…

1 month ago