Probability Density Functions
A continuous random variable X takes values in an interval (or union of intervals). Unlike discrete variables, P(X = x) = 0 for any specific value. Instead, probability is defined over intervals through integration of the probability density function f(x).
Cumulative Distribution Function (CDF)
Properties of the CDF
| Property | Statement |
|---|---|
| Range | 0 ≤ F(x) ≤ 1 for all x |
| Monotone | F is non-decreasing: x₁ < x₂ ⟹ F(x₁) ≤ F(x₂) |
| Limits | F(x) → 0 as x → −∞ and F(x) → 1 as x → +∞ |
| Continuity | F is continuous for a CRV (unlike discrete RVs) |
| At boundaries | F(a) = 0 and F(b) = 1 at the endpoints of the support |
- A common question gives f(x) = kx² (or similar) over [a,b] and asks you to find k using the total area = 1 condition.
- Then asks you to find the CDF F(x) by integrating — always include the lower limit and write F(x) piecewise (0 for x < a, formula for a ≤ x ≤ b, 1 for x > b).
- Then asks for a specific probability — use the CDF rather than integrating again.
- Always state the support of the distribution (the range where f(x) > 0).
(i) Find k. (ii) Find F(x). (iii) Find P(0.5 ≤ X ≤ 1.5).
k∫₀²(2x − x²) dx = k[x² − x³/3]₀² = k(4 − 8/3) = k · 4/3 = 1
F(x) = ∫₀ˣ (3/4)t(2−t) dt = (3/4)[t² − t³/3]₀ˣ = (3/4)(x² − x³/3)
F(1.5) = (3/4)(2.25 − 1.125) = (3/4)(1.125) = 0.84375
F(0.5) = (3/4)(0.25 − 0.125/3) = (3/4)(0.2083) = 0.15625
Median, Quartiles, and Percentiles
Expectation & Variance
Also written Var(X) = E[(X−μ)²] = ∫(x−μ)² f(x) dx — but the computational form E(X²)−μ² is always faster.
Linear Transformations
Sums of Independent Random Variables
When X and Y are independent: Var(X − Y) = Var(X) + Var(Y) — variances always add, never subtract. This catches many students. For Var(2X): that is Var(aX) with a=2, giving 4Var(X). This is NOT the same as Var(X+X) from two independent copies.
= (3/4)[2x³/3 − x⁴/4]₀² = (3/4)(16/3 − 4) = (3/4)(4/3)
= (3/4)(8 − 32/5) = (3/4)(8/5)
Mode and Median vs Mean
| Measure | Definition | How to Find |
|---|---|---|
| Mean μ | ∫ x f(x) dx | Integrate x·f(x) over support |
| Median m | F(m) = 0.5 | Solve CDF = ½ for m |
| Mode | x where f(x) is maximum | Differentiate f(x), set f'(x)=0 |
| Quartile Q₁ | F(Q₁) = 0.25 | Solve CDF = ¼ |
| Quartile Q₃ | F(Q₃) = 0.75 | Solve CDF = ¾ |
Key Continuous Distributions
Cambridge 9231 P4 requires detailed knowledge of four continuous distributions. You must be able to state the PDF, CDF, mean, and variance for each — and recognise which applies from context.
F(x) = (x−a)/(b−a)
f(x) = 0 otherwise
F(x) = 1 − e^(−λx)
f(x) = 0 for x < 0
No closed-form CDF
Standardise: Z = (X−μ)/σ ~ N(0,1)
Support: 0 ≤ x ≤ 1
B(α,β) = Γ(α)Γ(β)/Γ(α+β)
The Exponential Distribution in Depth
The exponential distribution models the waiting time between events in a Poisson process with rate λ. It has the key memoryless property:
If events occur as a Poisson process at rate λ per unit time, then the waiting time between consecutive events follows Exp(λ). This link is often tested in P4 — you may need to switch between Poisson (discrete, counting events) and Exponential (continuous, modelling time).
The Normal Distribution — Standardisation
Linear Combinations of Normal Variables
If X ~ N(μ₁, σ₁²) and Y ~ N(μ₂, σ₂²) are independent:
Functions of a Random Variable
If Y = g(X) where X is a CRV, we need the distribution of Y. There are two standard methods in Cambridge 9231.
Method 1 — CDF Method (Always Works)
Rearrange the inequality to express in terms of X.
(valid for 0 ≤ y ≤ 4)
Method 2 — Change of Variable Formula (Monotone g)
Worked Examples
(i) Find a. (ii) Find F(x). (iii) Find the median. (iv) Find E(X) and Var(X).
a[x²/2]₀¹ + a[2x − x²/2]₁² = a(1/2) + a(4−2 − 2+1/2) = a/2 + a/2 = a = 1
For 1 < x ≤ 2: F(x) = F(1) + ∫₁ˣ (2−t) dt = 1/2 + [2t−t²/2]₁ˣ = 1/2 + (2x−x²/2) − (2−1/2)
= 1/2 + 2x − x²/2 − 3/2 = 2x − x²/2 − 1
E(X²) = ∫₀¹ x²·x dx + ∫₁² x²(2−x) dx = [x⁴/4]₀¹ + [2x³/3−x⁴/4]₁²
= 1/4 + (16/3−4) − (2/3−1/4) = 1/4 + 4/3 − 5/12 = 7/6
Var(X) = 7/6 − 1 = 1/6 ≈ 0.167
(i) State the distribution of T. (ii) Find P(T > 0.3). (iii) Given T > 0.1, find P(T > 0.4).
Practice Questions
(i) Show k = 4/27. (ii) Find F(x) for 0 ≤ x ≤ 3. (iii) Find the median, correct to 3 d.p.
(ii) F(x) = (4/27)∫₀ˣ t²(3−t) dt = (4/27)[t³−t⁴/4]₀ˣ = (4/27)(x³ − x⁴/4)
Try m=2: F(2) = 32/27 − 16/108 = 32/27 − 4/27 = 28/27 — too big; wait 28/27 > 1 — recheck.
F(2) = (4×8/27)(1−2/12) = (32/27)(10/12) = 320/324 = 0.988 — too big.
Try m=1.6: F(1.6) = 4(4.096)/27 − (6.5536)/108 = 0.608 − 0.061 = 0.547
Try m=1.5: F(1.5) = 4(3.375)/27 − 5.0625/108 = 0.500 − 0.047 = 0.453
Median ≈ between 1.5 and 1.6; bisect to get
Var(X) = (8−2)²/12 = 36/12 = 3
P(|X−5| < 1) = P(4 < X < 6) = (6−4)/(8−2) = 2/6 =
(i) Find the mean and standard deviation of T.
(ii) Find P(T > 8).
(iii) Find the value of t such that P(T < t) = 0.9.
(ii) P(T > 8) = e^(−0.2×8) = e^(−1.6) ≈
(iii) 1−e^(−0.2t) = 0.9 ⟹ e^(−0.2t) = 0.1 ⟹ t = ln(10)/0.2 =
F_X(x) = x² (integrate f(x)=2x).
F_Y(y) = P(1−X² ≤ y) = P(X² ≥ 1−y) = P(X ≥ √(1−y)) = 1 − F_X(√(1−y)) = 1 − (1−y) = y
f_Y(y) = d/dy [y] = 1
Interactive Distribution Tool
Select a distribution, adjust parameters, and see the PDF and CDF curves with live probability calculations.
Formula Sheet — Continuous Random Variables
- Finding k: Integrate f(x) over the stated support, set equal to 1, solve. Never integrate over the whole real line if the support is finite.
- Writing F(x): Always give F(x) piecewise — F(x) = 0 below support, formula in between, F(x) = 1 above support. Cambridge deducts marks for missing the boundary cases.
- Mode: Differentiate f(x) and set f'(x) = 0. Check it is a maximum (second derivative or sign check). The mode is where the PDF is highest, not where F = 0.5.
- Var(X−Y): This equals Var(X) + Var(Y) for independent X,Y. Never Var(X) − Var(Y).
- Functions of X: Always use the CDF method for safety — it always works. The Jacobian method is faster only for monotone transformations.
- Exponential memoryless: P(T > s+t | T > s) = P(T > t). State this explicitly — Cambridge awards a mark for invoking the memoryless property by name.