Gaussian Distribution (Normal Distribution)
The Normal distribution, also called the Gaussian distribution, is the most important distribution in statistics and machine learning.
We all would have heard, during our yearly performance review that management has adjusted your rating to fit in a belt curve. Below is that bell curve. π€£ βthis is the shape of the normal distribution!

1. What is the Normal Distribution?
A normal distribution is a continuous probability distribution that is symmetric about its mean, showing that data near the mean are more frequent in occurrence than data far from the mean. In graph form, the normal distribution will appear as a bell curve.
2. Key Properties
- Random Variable (
): Can take any real value from to . - Symmetry: The distribution is perfectly symmetric about the mean.
- Parameters:
- Mean (
): The center of the distribution (also the median and mode). - Variance (
): Measures the spread of the distribution. - Standard Deviation (
): The square root of the variance.
- Mean (
- Probability Density Function (PDF):
3. Visualizing the Parameters
-
Mean (
): Determines the location of the peak (center) of the curve. - Increasing
shifts the curve to the right. - Decreasing
shifts the curve to the left.

- Increasing
-
Standard Deviation (
): Controls the width (spread) of the curve. - Small
: Narrow, tall curve. - Large
: Wide, flat curve.

- Small
4. The Empirical Rule (68-95-99.7 Rule)
This rule tells us how data is distributed in a normal distribution:
- 68% of values fall within 1 standard deviation of the mean.
- 95% of values fall within 2 standard deviations of the mean.
- 99.7% of values fall within 3 standard deviations of the mean.
5. Standard Normal Distribution
The standard normal distribution is a special case where:
- The random variable is called
.
The PDF becomes:
where

6. Example: Heights of Students
Suppose the heights of students in a school are normally distributed with a mean (
(a) What is the probability that a randomly chosen student is taller than 185 cm?
Step 1: Convert to z-score
Step 2: Find the probability using the standard normal table
- From the z-table,
- So,
Final Answer:
There is a 6.68% chance that a randomly chosen student is taller than 185 cm.
(b) What percentage of students are between 160 cm and 180 cm?
Step 1: Convert both values to z-scores
- For 160 cm:
- For 180 cm:
Step 2: Find probabilities from the z-table
Step 3: Subtract to find the probability between
Final Answer:
About 68.26% of students are between 160 cm and 180 cm tall.
- Many natural phenomena (heights, test scores, measurement errors) follow a normal distribution.
- The Central Limit Theorem states that the sum (or average) of many independent random variables tends toward a normal distribution, even if the original variables themselves are not normally distributed.
- Used in hypothesis testing, confidence intervals, and many machine learning algorithms.