Gaussian Distribution (Normal Distribution)

The Normal distribution, also called the Gaussian distribution, is the most important distribution in statistics and machine learning.

We all would have heard, during our yearly performance review that management has adjusted your rating to fit in a belt curve. Below is that bell curve. 🤣 —this is the shape of the normal distribution!

1. What is the Normal Distribution?

A normal distribution is a continuous probability distribution that is symmetric about its mean, showing that data near the mean are more frequent in occurrence than data far from the mean. In graph form, the normal distribution will appear as a bell curve.

2. Key Properties

Random Variable ( $X$ ): Can take any real value from $- \infty$ to $+ \infty$ .
Symmetry: The distribution is perfectly symmetric about the mean.
Parameters:
- Mean ( $μ$ ): The center of the distribution (also the median and mode).
- Variance ( $σ^{2}$ ): Measures the spread of the distribution.
- Standard Deviation ( $σ$ ): The square root of the variance.
Probability Density Function (PDF): $f (x; μ, σ) = \frac{1}{σ \sqrt{2 π}} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}}, - \infty < x < + \infty$

3. Visualizing the Parameters

Mean ( $μ$ ): Determines the location of the peak (center) of the curve.
- Increasing $μ$ shifts the curve to the right.
- Decreasing $μ$ shifts the curve to the left.
Standard Deviation ( $σ$ ): Controls the width (spread) of the curve.
- Small $σ$ : Narrow, tall curve.
- Large $σ$ : Wide, flat curve.

4. The Empirical Rule (68-95-99.7 Rule)

This rule tells us how data is distributed in a normal distribution:

68% of values fall within 1 standard deviation of the mean.
95% of values fall within 2 standard deviations of the mean.
99.7% of values fall within 3 standard deviations of the mean.

5. Standard Normal Distribution

The standard normal distribution is a special case where:

$μ = 0$
$σ = 1$
The random variable is called $Z$ .

The PDF becomes:

f (z) = \frac{1}{\sqrt{2 π}} e^{- \frac{z^{2}}{2}}

where $z = \frac{x - μ}{σ}$ is called the z-score.

6. Example: Heights of Students

Suppose the heights of students in a school are normally distributed with a mean ( $μ$ ) of 170 cm and a standard deviation ( $σ$ ) of 10 cm.

(a) What is the probability that a randomly chosen student is taller than 185 cm?

Step 1: Convert to z-score

z = \frac{x - μ}{σ} = \frac{185 - 170}{10} = 1.5

Step 2: Find the probability using the standard normal table

$P (X > 185) = P (Z > 1.5)$
From the z-table, $P (Z < 1.5) \approx 0.9332$
So, $P (Z > 1.5) = 1 - 0.9332 = 0.0668$

Final Answer:
There is a 6.68% chance that a randomly chosen student is taller than 185 cm.

(b) What percentage of students are between 160 cm and 180 cm?

Step 1: Convert both values to z-scores

For 160 cm: $z_{1} = \frac{160 - 170}{10} = - 1$
For 180 cm: $z_{2} = \frac{180 - 170}{10} = 1$

Step 2: Find probabilities from the z-table

$P (Z < 1) \approx 0.8413$
$P (Z < - 1) \approx 0.1587$

Step 3: Subtract to find the probability between

P (160 < X < 180) = P (Z < 1) - P (Z < - 1) = 0.8413 - 0.1587 = 0.6826

Final Answer:
About 68.26% of students are between 160 cm and 180 cm tall.

Why is the Normal Distribution Important?

Many natural phenomena (heights, test scores, measurement errors) follow a normal distribution.
The Central Limit Theorem states that the sum (or average) of many independent random variables tends toward a normal distribution, even if the original variables themselves are not normally distributed.
Used in hypothesis testing, confidence intervals, and many machine learning algorithms.