1. The Normal Distribution S1
Key Properties of the Normal Distribution
Shape Properties
- Symmetric about the mean μ
- Mean = Median = Mode = μ
- Bell-shaped curve — single peak at μ
- Asymptotic to x-axis (tails never touch)
- Total area under curve = 1
- 68% of data within μ±σ
- 95% within μ±2σ
- 99.7% within μ±3σ
Effect of μ and σ
- Increasing μ shifts the curve right
- Decreasing μ shifts the curve left
- Larger σ → flatter, wider curve
- Smaller σ → taller, narrower curve
- The area under the curve is always 1 regardless of μ and σ
Standardisation — The Z-Score
Standardisation Formula
The table gives Φ(z) = P(Z < z) for z > 0. For negative z: use Φ(−z) = 1−Φ(z).
Key values: Φ(1.645) ≈ 0.95, Φ(1.960) ≈ 0.975, Φ(2.326) ≈ 0.99, Φ(2.576) ≈ 0.995
These give critical values for common significance levels in hypothesis testing.
📐 Worked Example 1 — Normal Probabilities
X ~ N(50, 16). Find: (a) P(X < 54) (b) P(X > 46) (c) P(44 < X < 58) (d) P(|X−50| < 3)
P=Φ(2)−Φ(−1.5)=0.9772−(1−0.9332)=0.9772−0.0668=0.9104
📐 Worked Example 2 — Finding μ or σ from a Probability
(a) X~N(μ,25) and P(X<30)=0.8. Find μ. (b) X~N(60,σ²) and P(X>70)=0.1. Find σ.
(30−μ)/5=0.842 → 30−μ=4.21 → μ=25.79
(70−60)/σ=1.282 → σ=10/1.282=7.80
📐 Worked Example 3 — Finding Both μ and σ
X~N(μ,σ²). P(X<20)=0.2 and P(X<30)=0.8. Find μ and σ.
P(X<30)=0.8 → z₂=0.842
Subtract: 10/σ=1.684 → σ=5.94
2. Normal Approximations S1
Normal Approximation to the Binomial
P(X=k) → P(k−½ < Y < k+½)
P(X ≤ k) → P(Y < k+½) P(X < k) → P(Y < k−½)
P(X ≥ k) → P(Y > k−½) P(X > k) → P(Y > k+½)
where Y ~ N(np, npq)
📐 Worked Example 4 — Normal Approximation to Binomial
X~B(100, 0.4). Use Normal approximation to find P(X ≤ 35).
z=(35.5−40)/√24=(35.5−40)/4.899=−0.918
Normal Approximation to the Poisson
📐 Worked Example 5 — Normal Approximation to Poisson
X~Po(25). Use Normal approximation to find P(X ≥ 30).
3. Sampling Distributions and the Central Limit Theorem S1
Sampling Distribution Formulae
📐 Worked Example 6 — Sample Mean Distribution
The weight of apples is distributed with mean 120g and standard deviation 15g. A sample of 25 apples is selected. Find: (a) the distribution of the sample mean (b) P(X̄ > 125) (c) P(115 < X̄ < 122).
P(X̄>125)=1−Φ(1.667)=1−0.9525=0.0475
P=Φ(0.667)−Φ(−1.667)=0.7475−0.0475=0.7000
4. Hypothesis Testing — Population Mean S1
1. State H₀ and H₁ — Null hypothesis (no change/claimed value) and Alternative (what you suspect).
2. State significance level α (usually 5% or 1%).
3. State the test statistic and its distribution under H₀.
4. Calculate the test statistic from the sample data.
5. Find the critical value or calculate the p-value.
6. Compare test statistic to critical region (or p-value to α).
7. Conclude in context — use words about the original claim.
One-Tailed and Two-Tailed Tests
| Test Type | H₁ | Critical Region | Critical Value (5%) |
|---|---|---|---|
| One-tailed (upper) | μ > μ₀ | Z > z_α | Z > 1.645 |
| One-tailed (lower) | μ < μ₀ | Z < −z_α | Z < −1.645 |
| Two-tailed | μ ≠ μ₀ | |Z| > z_{α/2} | |Z| > 1.960 |
📐 Worked Example 7 — Z-Test for Population Mean
The weight of cereal boxes is claimed to be μ=500g with σ=8g. A sample of 36 boxes has mean 497g. Test at 5% significance level whether the population mean has decreased.
Significance level: α=5%=0.05
Test statistic: Z=(497−500)/(8/√36)=(−3)/(8/6)=−3×6/8=−2.25
Z=−2.25 < −1.645 → falls in the critical region.
📐 Worked Example 8 — Two-Tailed Z-Test
A machine produces bolts with diameter N(10mm, 0.04mm²). After maintenance, a sample of 20 bolts has mean 10.08mm. Test at 1% level whether the mean diameter has changed.
α=1%, two-tailed → critical values: Z=±2.576
Z=(10.08−10)/0.04472=0.08/0.04472=1.789
Conclusion: Do not reject H₀. Insufficient evidence at the 1% level to conclude the mean diameter has changed after maintenance.
5. Hypothesis Testing — Binomial Proportion S1
📐 Worked Example 9 — Binomial Hypothesis Test
A manufacturer claims 20% of items are defective. A quality inspector finds 9 defectives in a random sample of 30. Test at 5% level whether the proportion of defectives has increased.
Under H₀: X~B(30, 0.2). α=5%.
Using tables: P(X≤8)≈0.8867
P-value = 1−0.8867 = 0.1133
Conclusion: Do not reject H₀. There is insufficient evidence at the 5% level to conclude that the proportion of defectives has increased above 20%.
📐 Worked Example 10 — Critical Region for Binomial Test
X~B(20, p). Test H₀: p=0.3 vs H₁: p<0.3 at 5%. Find the critical region.
Calculate cumulative probabilities under B(20, 0.3):
P(X≤3)≈0.1071 > 0.05 ✗
Critical region: X ≤ 2 (actual significance level ≈ 3.55%)
Critical region: Find the boundary value c such that P(X ≤ c) < α (for lower tail). Reject H₀ if observation falls in critical region.
P-value: Calculate P(result as extreme or more extreme | H₀ true). Reject H₀ if p-value < α.
Both give the same conclusion — Cambridge questions may ask for either or both.
6. Type I and Type II Errors S1
Type II Error: Failing to reject H₀ when it is actually false — a false negative. Probability denoted β. Depends on the true value of the parameter.
| H₀ True | H₀ False | |
|---|---|---|
| Reject H₀ | Type I Error (probability=α) | Correct decision (Power=1−β) |
| Do not reject H₀ | Correct decision (probability=1−α) | Type II Error (probability=β) |
📐 Worked Example 11 — Type I and Type II Error Probabilities
X~B(20,p). Test H₀:p=0.4 vs H₁:p≠0.4 at 5%. Critical region is X≤4 or X≥13. (a) Find P(Type I Error). (b) Find P(Type II Error) when p=0.6.
= P(X≤4|B(20,0.4)) + P(X≥13|B(20,0.4))
≈ 0.0510 + P(X≥13)≈0.0510+0.0210=0.0210+0.0510=0.0210
More precisely: P(X≤4)≈0.0510, P(X≥13)≈1−P(X≤12)≈1−0.9790=0.0210
P(Type I)≈0.0510+0.0210=0.0720
= P(5≤X≤12 | B(20,0.6))
= P(X≤12|B(20,0.6)) − P(X≤4|B(20,0.6))
≈ 0.5841 − 0.0003 ≈ 0.5838
📝 Exam Practice Questions
Q1 [4 marks] — X~N(72, 100). Find: (a) P(X<80) (b) P(60<X<75) (c) The value of a such that P(X>a)=0.3.
(a) z=(80−72)/10=0.8. P(X<80)=Φ(0.8)=0.7881
(b) z₁=(60−72)/10=−1.2; z₂=(75−72)/10=0.3
P=Φ(0.3)−Φ(−1.2)=0.6179−0.1151=0.5028
(c) P(X>a)=0.3 → P(X<a)=0.7 → z=0.524
a=72+0.524×10=77.24
Q2 [4 marks] — X~N(μ,σ²). P(X<15)=0.25 and P(X<25)=0.75. Find μ and σ.
P(X<25)=0.75 → z₂=0.674
(15−μ)/σ=−0.674 ...(1); (25−μ)/σ=0.674 ...(2)
Subtract: 10/σ=1.348 → σ=7.42
μ=(15+25)/2=20 (by symmetry, since z-scores are equal and opposite)
Q3 [4 marks] — A sample of 50 is taken from a population with σ=12. The sample mean is 43.6. Test at 5% whether the population mean has increased from a claimed value of 41.
Under H₀: X̄~N(41, 144/50). SE=12/√50=1.697
Z=(43.6−41)/1.697=2.6/1.697=1.532
Critical value (5%, one-tail)=1.645. Z=1.532<1.645 → not in critical region.
Conclusion: Do not reject H₀. Insufficient evidence at 5% level that the mean has increased above 41.
Q4 [4 marks] — A six-sided die is suspected to be biased towards 6. In 60 rolls, a 6 appears 16 times. Test at 5% significance whether the die is biased. Use the Normal approximation with continuity correction.
np=10, nq=50 — Normal approx valid. Y~N(10, 60×1/6×5/6)=N(10, 25/3)
SE=√(25/3)=2.887
P(X≥16) ≈ P(Y>15.5)=(15.5−10)/2.887=5.5/2.887=1.905
P(Z>1.905)=1−Φ(1.905)=1−0.9717=0.0283<0.05
Reject H₀. Evidence at 5% that the die is biased towards 6.
Q5 [4 marks] — Records show that 30% of patients recover within a week. A new treatment is given to 15 patients and 8 recover within a week. (a) Test at 10% level whether the new treatment improves recovery. (b) What would be the conclusion at 5%?
P-value = P(X≥8) = 1−P(X≤7).
P(X≤7)=ΣP: using B(15,0.3): P(X≤7)≈0.9500
P-value=1−0.9500=0.0500
(a) p-value=0.05≤0.10 → Reject H₀ at 10%. Evidence that new treatment improves recovery.
(b) p-value=0.05≤0.05 → Also reject at 5% — borderline case.
Q6 [3 marks] — Explain the meaning of a Type I error in the context of Q5. State the probability of making a Type I error if the test is conducted at 5% significance.
The probability of a Type I error = the significance level = 0.05 (5%).
This is the probability of observing X≥8 when p=0.3 (i.e. getting a result this extreme by chance alone when the null hypothesis is true).