1. Representation of Data S1
Stem-and-Leaf Diagrams
Box-and-Whisker Plots
Minimum value | Lower Quartile (Q₁) | Median (Q₂) | Upper Quartile (Q₃) | Maximum value
Interquartile Range (IQR) = Q₃ − Q₁ — measures spread of middle 50%.
Outliers: Values below Q₁ − 1.5×IQR or above Q₃ + 1.5×IQR.
Histograms for Grouped Data
Frequency density = Frequency ÷ Class width
Frequency = Frequency density × Class width
📐 Worked Example 1 — Histogram and Frequency Density
Data on journey times (minutes): 0–10: 12, 10–20: 18, 20–40: 22, 40–70: 15. Draw the histogram frequency densities and estimate the number of journeys under 30 minutes.
0–10: 12/10=1.2 10–20: 18/10=1.8 20–40: 22/20=1.1 40–70: 15/30=0.5
= 12 + 18 + 22/2 = 12 + 18 + 11 = 41 journeys
Cumulative Frequency Curves
2. Measures of Location and Spread S1
Measures of Location
Measures of Spread — Variance and Standard Deviation
📐 Worked Example 2 — Mean and Variance
For the data: 3, 7, 7, 8, 10, 12, 14. Find the mean, variance, and standard deviation.
x̄ = 61/7 ≈ 8.714
σ² = 611/7 − (61/7)² = 87.286 − 75.918 = 11.368
📐 Worked Example 3 — Coding
Data has Σx=360, Σx²=15200, n=20. Using y=x−20, find the mean and standard deviation of x.
x̄=ȳ+20=18
σ²_y=Σy²/n−ȳ²=8800/20−4=440−4=436
Since y=x−20, σ_x=σ_y=√436≈20.9
Choosing the Right Measure
| Situation | Best Measure of Location | Best Measure of Spread |
|---|---|---|
| Symmetric distribution | Mean | Standard deviation |
| Skewed distribution | Median | IQR |
| Outliers present | Median | IQR |
| Categorical data | Mode | — |
| Further calculations needed | Mean | Standard deviation |
3. Probability S1
Fundamental Probability Rules
📐 Worked Example 4 — Conditional Probability
In a class: P(pass Maths) = 0.7, P(pass Physics) = 0.6, P(pass both) = 0.45. Find: (a) P(pass at least one) (b) P(pass Maths | pass Physics) (c) Are the events independent?
P(M|P)=0.75≠0.7=P(M). Therefore not independent.
Tree Diagrams and Venn Diagrams
📐 Worked Example 5 — Tree Diagram (Two-Stage)
A bag has 4 red and 6 blue balls. Two balls are drawn without replacement. Find the probability that both are the same colour.
P(BB) = (6/10)×(5/9) = 30/90 = 1/3
Permutations and Combinations
Counting Methods
📐 Worked Example 6 — Combinations in Probability
A committee of 4 is chosen from 6 men and 5 women. Find the probability that the committee has at least 2 women.
= (⁵C₂×⁶C₂ + ⁵C₃×⁶C₁ + ⁵C₄×⁶C₀)/330
= (10×15 + 10×6 + 5×1)/330
= (150+60+5)/330 = 215/330 = 43/66
4. Discrete Random Variables S1
Expectation and Variance of a DRV
📐 Worked Example 7 — Probability Distribution Table
A DRV X has P(X=x) = k(x+1) for x = 0, 1, 2, 3. Find k, E(X), and Var(X).
Var(X)=E(X²)−[E(X)]²=5−4=1
Geometric Distribution
X ~ Geo(p): P(X=r) = (1−p)^(r−1) p for r = 1, 2, 3, ...
E(X) = 1/p Var(X) = (1−p)/p²
📐 Worked Example 8 — Geometric Distribution
A biased coin has P(head) = 0.3. The coin is tossed until a head appears. Find: (a) P(X=4) (b) P(X≤3) (c) E(X) and Var(X)
= 0.3 + 0.7×0.3 + 0.7²×0.3 = 0.3+0.21+0.147 = 0.657
Alternative: P(X≤3) = 1−P(X>3) = 1−(0.7)³ = 1−0.343 = 0.657 ✓
Var(X) = 0.7/0.09 = 70/9 ≈ 7.78
5. Binomial Distribution S1
Binomial Distribution Formulae
📐 Worked Example 9 — Binomial Probabilities
X~B(10, 0.35). Find: (a) P(X=4) (b) P(X≥3) (c) E(X) and Var(X) (d) P(X=k) is maximum at k=?
= 210×0.01501×0.07542 ≈ 0.2377
P(0)=(0.65)¹⁰≈0.01346; P(1)=10×0.35×(0.65)⁹≈0.07249; P(2)=45×(0.35)²×(0.65)⁸≈0.17567
P(X≤2)≈0.26162. P(X≥3)≈0.7384
(n+1)p=11×0.35=3.85 → floor=3. Check: P(3)≈0.2522>P(4)≈0.2377 ✓
Modal value = 3
Normal Approximation to Binomial
6. Poisson Distribution S1
Poisson Distribution
📐 Worked Example 10 — Poisson Distribution
Calls arrive at a helpdesk at a rate of 4 per hour. Find: (a) P(exactly 3 calls in an hour) (b) P(at most 2 calls in 30 minutes) (c) P(at least 1 call in 15 minutes)
= e⁻²(1+2+2) = 5e⁻² ≈ 5×0.1353 = 0.6767
Binomial B(n,p): Fixed n trials, constant p, independent, count successes.
Geometric Geo(p): Count trials until first success, independent, constant p.
Poisson Po(λ): Count events in interval, random/independent, constant rate λ, E=Var=λ.
Normal N(μ,σ²): Continuous data, symmetric bell curve (Lesson 9).
📝 Exam Practice Questions
Q1 [4 marks] — Data: 12, 15, 18, 18, 20, 22, 25, 28, 30, 35. Find the median, quartiles, IQR, and identify any outliers using the 1.5×IQR rule.
Q₁ = median of lower half {12,15,18,18,20} = 18
Q₃ = median of upper half {22,25,28,30,35} = 28
IQR = 28−18 = 10
Lower fence: 18−15=3. Upper fence: 28+15=43. No values outside [3,43].
No outliers.
Q2 [3 marks] — For 50 values: Σx=480, Σx²=5200. Find the mean, variance, and standard deviation.
σ² = 5200/50 − 9.6² = 104 − 92.16 = 11.84
σ = √11.84 ≈ 3.44
Q3 [4 marks] — Events A and B satisfy P(A)=0.4, P(B)=0.5, P(A∪B)=0.7. (a) Find P(A∩B). (b) Find P(A|B). (c) Are A and B independent? (d) Are A and B mutually exclusive?
(b) P(A|B)=P(A∩B)/P(B)=0.2/0.5=0.4
(c) P(A|B)=0.4=P(A) ✓ → A and B are independent
(Also: P(A∩B)=0.2=P(A)×P(B)=0.4×0.5=0.2 ✓)
(d) P(A∩B)=0.2≠0 → not mutually exclusive
Q4 [4 marks] — A DRV X has the distribution: P(X=1)=0.1, P(X=2)=0.3, P(X=3)=k, P(X=4)=0.2, P(X=5)=0.1. Find k, E(X), Var(X), and E(3X−2).
E(X)=1(0.1)+2(0.3)+3(0.3)+4(0.2)+5(0.1)=0.1+0.6+0.9+0.8+0.5=2.9
E(X²)=1(0.1)+4(0.3)+9(0.3)+16(0.2)+25(0.1)=0.1+1.2+2.7+3.2+2.5=9.7
Var(X)=9.7−2.9²=9.7−8.41=1.29
E(3X−2)=3E(X)−2=8.7−2=6.7
Q5 [3 marks] — X~B(15, 0.4). Find P(X=6), P(4≤X≤7), and the most likely value of X.
P(4≤X≤7)=P(4)+P(5)+P(6)+P(7)
≈0.1268+0.1859+0.2066+0.1771≈0.6964
Modal value: (n+1)p=16×0.4=6.4 → floor=6. Mode=6
Q6 [4 marks] — Flaws in a fabric occur at a rate of 2 per metre. (a) State the distribution of X = number of flaws in 3 metres. (b) Find P(X=5). (c) Find P(X>8). (d) In 0.5 m, find P(at least one flaw).
(b) P(X=5)=e⁻⁶×6⁵/5!=e⁻⁶×7776/120≈0.002479×64.8≈0.1606
(c) P(X>8)=1−P(X≤8). Using cumulative Poisson tables or calculation:
P(X≤8)≈0.8472. P(X>8)≈0.1528
(d) 0.5m: λ=1. P(X≥1)=1−e⁻¹≈0.6321