Concept

Binomial Distribution

The binomial distribution counts successes in a fixed number of independent Bernoulli trials. It sits inside the (a, b, 0) frequency class and is the right model whenever the trial count is fixed and the trial probability is constant.

Definition And Parameterization

A binomial random variable X has parameters m (the number of independent trials) and q (the probability of success on each trial). Each trial is a Bernoulli random variable with mean q, and the binomial is the sum of m independent Bernoullis. The Bernoulli is the special case m = 1.

The SOA Loss Models tables write the parameters as m and q. Earlier probability courses often use n and p. The mathematics is the same; only the letters change.

Probability mass function
P(X=k)=(mk)qk(1q)mk,k=0,1,,mP(X=k)=\binom{m}{k}q^k(1-q)^{m-k},\quad k=0,1,\ldots,m
Mean and variance
E[X]=mq,Var(X)=mq(1q)E[X]=mq,\qquad \operatorname{Var}(X)=mq(1-q)
Moment generating function
MX(t)=(1q+qet)mM_X(t)=\bigl(1-q+qe^{t}\bigr)^{m}

When To Use Binomial

Binomial is the right model when three conditions hold: a fixed number of trials, the same success probability on every trial, and independence across trials. The classic actuarial uses are number of policies with a claim in a portfolio of fixed size, number of survivors in a fixed cohort over a fixed period, and number of defaults among a fixed loan pool.

Pick Poisson instead when the count of trials is itself random or very large with small per-trial probability. Pick hypergeometric instead when sampling is without replacement from a small finite population so trials are not independent. Pick negative binomial when the observed variance exceeds the observed mean, which is incompatible with a binomial fit.

Maximum Likelihood Estimation

With independent observations X_1, ..., X_n drawn from Binomial(m, q) and m known, the maximum likelihood estimator for q is the sample mean divided by m. With m = 1 the MLE is just the sample proportion of successes.

If m is unknown, both m and q must be estimated jointly. The MLE for m has no clean closed form; in exam settings m is virtually always given.

Binomial MLE for q with known m
q^MLE=Xˉm\hat q_{\mathrm{MLE}}=\frac{\bar X}{m}

Worked Example: Claim Counts In A Fixed Portfolio

A book of 20 independent annual policies has a 0.08 probability that each one produces at least one claim during the year. Let X be the number of policies that produce a claim. Then X is Binomial(20, 0.08).

Probability that exactly two policies produce a claim is C(20, 2)(0.08)^2(0.92)^{18} = 190 × 0.0064 × 0.2229 ≈ 0.271. The mean is 1.6 and the variance is 1.472, so the coefficient of variation is √1.472 / 1.6 ≈ 0.758.

Computation
P(X=2)=(202)(0.08)2(0.92)180.271P(X=2)=\binom{20}{2}(0.08)^{2}(0.92)^{18}\approx 0.271

Worked Example: Survival In A Cohort

A cohort of 100 lives at age 65 each survive one year with probability 0.97, independently. The number of survivors S at age 66 is Binomial(100, 0.97). The expected number of survivors is 97; the variance is 100 × 0.97 × 0.03 = 2.91.

The probability that fewer than 95 survive can be approximated by the normal distribution because m × q × (1 − q) is not too small. With continuity correction, P(S ≤ 94) ≈ Φ((94.5 − 97) / √2.91) = Φ(−1.466) ≈ 0.071.

Worked Example: Sum Of Two Independent Binomials

If X is Binomial(m_1, q) and Y is Binomial(m_2, q) and X and Y are independent with the same q, then X + Y is Binomial(m_1 + m_2, q). The convolution is closed only when q is shared.

If m_1 = 8, m_2 = 12, q = 0.25, then X + Y is Binomial(20, 0.25) with mean 5 and variance 3.75. If the two q values differed, no binomial would fit the sum; the variance formula would still hold but the distribution would no longer be binomial.

Connections

Bernoulli is the m = 1 case. The Poisson distribution is the limit when m grows large and q shrinks with mq held at a constant λ; this is why Poisson is sometimes called the law of rare events.

Hypergeometric replaces independent trials with sampling without replacement; the binomial is the with-replacement analogue. As the population grows relative to the sample size, the two distributions converge.

Inside Loss Models, the binomial is a member of the (a, b, 0) class with a = -q/(1-q) and b = (m+1)q/(1-q). That parameterization is what enables the Panjer recursion for compound binomial aggregates.

Practice

Original exam practice

4 questions built from syllabus outcomes and released-exam patterns. The prompts and answers are original, so they train the skill without copying official exam text.

Binomial Fixed-Trials Drill

Original binomial checks for fixed trials, claim indicators, mean, variance, and Poisson approximation.

Exam P - 16 min
Source pattern: SOA Exam P binomial distribution outcomes; original fixed-portfolio prompts.
  1. Question 1/Calculation

    Exactly two claim policies

    A book has 20 independent policies. Each policy has an 8 percent chance of at least one claim during the year. Find the probability that exactly 2 policies have a claim.

    Solution and grading points

    Use the binomial model with 20 trials and success probability 0.08. The probability is C(20, 2) times 0.08 squared times 0.92 to the eighteenth power, about 0.271.

    • Identifies the fixed number of trials.
    • Uses the binomial mass function for exactly 2 successes.
    • Reports a probability near 0.271.
  2. Question 2/Calculation

    Mean and variance

    For 100 independent policies with a 3 percent claim probability each, find the mean and variance of the number of policies with a claim.

    Solution and grading points

    The mean is 100 times 0.03, or 3. The variance is 100 times 0.03 times 0.97, or 2.91.

    • Uses trials times probability for the mean.
    • Uses trials times probability times one minus probability for variance.
    • Keeps the count interpretation clear.
  3. Question 3/Written Answer

    Model choice

    Why is binomial a better model than Poisson when the problem gives exactly 50 independent policies and the same claim probability for each?

    Solution and grading points

    The fixed number of independent policies gives a fixed number of Bernoulli trials. Poisson is more natural for counts over exposure when the number of opportunities is not modeled as a fixed trial count.

    • Mentions the fixed number of trials.
    • Mentions the shared trial probability.
    • Distinguishes fixed-trial language from exposure-rate language.
  4. Question 4/Written Answer

    Rare-event approximation

    When can a Poisson distribution approximate a binomial claim count?

    Solution and grading points

    Use the approximation when the number of trials is large, the success probability is small, and the product of trials and success probability is a moderate stable mean.

    • Names large trial count.
    • Names small success probability.
    • Uses the product of trials and success probability as the Poisson mean.

References and official sources