Mathematical Statistics

The Binomial Probability Distribution

Samir Orujov, PhD

ADA University, School of Business

Information Communication Technologies Agency, Statistics Unit

2025-11-05

๐ŸŽฏ Learning Objectives

By the end of this lecture, you will be able to:

  • Identify and verify the five properties of binomial experiments in financial and economic contexts
  • Apply the binomial probability formula to calculate probabilities for investment, credit risk, and market scenarios
  • Interpret binomial probability distributions and their parameters in economic applications
  • Compute mean and variance of binomial random variables for financial modeling
  • Use maximum likelihood estimation to estimate success probabilities from sample data
  • Apply binomial models to real-world financial and economic problems

๐Ÿ“‹ Overview

๐Ÿ“š Topics Covered Today

  • Binomial Probability Distribution - Definition and mathematical properties
  • Binomial Experiments - Identifying conditions in finance and economics
  • Probability Calculations - Using formulas and statistical tables
  • Mean and Variance - Expected value and risk measures
  • Applications - Portfolio management, credit risk, market analysis
  • Maximum Likelihood Estimation - Parameter estimation from data

๐Ÿ“– Definition: Binomial Experiment

๐Ÿ“ Definition 1: Binomial Experiment

A binomial experiment possesses the following properties:

  1. ๐Ÿ”ข The experiment consists of a fixed number, \(n\), of identical trials
  2. โš–๏ธ Each trial results in one of two outcomes: success (\(S\)) or failure (\(F\))
  3. ๐ŸŽฏ The probability of success \(p\) remains constant from trial to trial, with \(q = (1 - p)\)
  4. ๐Ÿ”— The trials are independent of each other
  5. ๐Ÿ“Š The random variable \(Y\) counts the number of successes in \(n\) trials

๐Ÿค” Think-Pair-Share Activity

๐Ÿ’ญ Activity 1: Identifying Binomial Experiments (3 minutes)

๐Ÿง  Think (1 minute): Consider the following financial scenario individually:

โ€œA credit analyst reviews 10 loan applications from different borrowers. Each application is either approved or rejected based on credit scores. Historical data shows a 65% approval rate.โ€

๐Ÿ‘ฅ Pair (1 minute): Discuss with your neighbor - does this meet all five binomial experiment properties?

03:00

๐Ÿ—ฃ๏ธ Share (1 minute): Groups share their conclusions with the class

๐Ÿ“Œ Example 1: Credit Risk Management

๐Ÿ’ณ Example 1

A bankโ€™s loan portfolio contains four independent corporate loans, each with a historical 5% probability of default over the next year. The risk manager is interested in \(Y\), the number of loans that will default. Is this a binomial experiment?

โœ… Solution

Letโ€™s verify each property:

  • โœ… Fixed trials: Four independent loans (\(n = 4\))
  • โœ… Two outcomes:
    • Success (\(S\)): loan defaults
    • Failure (\(F\)): loan does not default
  • โœ… Constant probability: \(p = P(S) = 0.05\)
  • โœ… Independence: Loans are from different sectors/independent
  • โœ… Count successes: \(Y\) = number of defaults

Conclusion: Yes, this is a binomial experiment! The bank can use binomial distribution to assess portfolio risk.

๐Ÿ“Œ Example 2: Market Survey

๐Ÿ“Š Example 2

An economist surveys 10 randomly selected consumers from a large population where 40% prefer brand A over competitors. Let \(Y\) be the number who prefer brand A. Does this meet binomial requirements?

๐Ÿ’ก Important Remark

When sampling without replacement from a finite population, trials are technically dependent. However:

Rule of Thumb: If the sample size is less than 5% of the population (\(n < 0.05N\)), the dependency is negligible and we can approximate with the binomial distribution.

For large consumer markets, this condition is easily satisfied!

๐Ÿ“ Quiz #1: Identifying Binomial Experiments

Which of the following is NOT a binomial experiment?

  • Monitoring 20 stock trades where each results in profit or loss
  • Surveying investors until finding 5 who own cryptocurrency
  • Testing 15 products from a production line for defects
  • Checking 30 loan applications for completeness
02:00

๐Ÿงฎ The Binomial Distribution

๐Ÿ“ Definition 2: Binomial Distribution

A random variable \(Y\) has a binomial distribution based on \(n\) trials with success probability \(p\) if and only if:

\[ p(y) = \binom{n}{y}p^yq^{n-y}, \quad y=0,1,2,\dotsc, n \quad \text{and} \quad 0\le p \le 1 \]

where \(q = 1 - p\) and \(\binom{n}{y} = \frac{n!}{y!(n-y)!}\)

Notation: We write \(Y \sim \text{Binomial}(n, p)\) or \(Y \sim B(n, p)\)

๐Ÿ’ก Binomial Theorem

\[ (q+p)^n = \binom{n}{0}q^n+\binom{n}{1}pq^{n-1}+\binom{n}{2}p^2q^{n-2}+\cdots+\binom{n}{n}p^n \]

Since \(p + q = 1\), we have \((q+p)^n = 1\), confirming that \(\sum_{y=0}^{n}p(y) = 1\) โœ“

๐Ÿ” Interactive Exploration

๐ŸŒ Explore Binomial Distributions

Visit this interactive applet: Binomial Distribution Applet

Activity: Experiment with different parameter values:

  • ๐Ÿ“Š Set \(n = 10\), \(p = 0.5\) โ†’ Symmetric distribution (like a fair coin)
  • ๐Ÿ“‰ Set \(n = 10\), \(p = 0.1\) โ†’ Right-skewed (rare events)
  • ๐Ÿ“ˆ Set \(n = 10\), \(p = 0.9\) โ†’ Left-skewed (common events)

Financial Interpretation: Observe how changing default probability \(p\) affects the distribution shape!

๐Ÿค” Think-Pair-Share Activity #2

๐Ÿงฉ Activity 2: Understanding the Formula Components (4 minutes)

๐Ÿง  Think (1.5 min): Why do we multiply \(\binom{n}{y}\), \(p^y\), and \(q^{n-y}\) together? What does each component represent in a portfolio of risky assets?

๐Ÿ‘ฅ Pair (1.5 min): Explain your reasoning to your partner. - What is \(\binom{n}{y}\)? (Number of ways to select which assets default) - What is \(p^y\)? (Probability those specific assets default) - What is \(q^{n-y}\)? (Probability remaining assets donโ€™t default)

04:00

๐Ÿ—ฃ๏ธ Share (1 min): Volunteer pairs share insights with the class

๐Ÿ“Œ Example 3: Investment Portfolio

๐Ÿ’ผ Example 3

A venture capital fund invests in 5 startup companies. Historical data shows that 5% of startups fail within the first year. Find the probability that at least one startup fails.

๐Ÿ”ข Solution

Let \(Y\) = number of failed startups. Then \(Y \sim \text{Binomial}(n=5, p=0.05)\).

\[ P(\text{at least one fails}) = 1 - P(Y = 0) \]

\[ P(Y = 0) = \binom{5}{0}(0.05)^0(0.95)^5 = 1 \times 1 \times (0.95)^5 = 0.774 \]

Therefore: \[ P(Y \geq 1) = 1 - 0.774 = 0.226 \]

Interpretation: Thereโ€™s a 22.6% chance that at least one startup will fail, despite the individual failure rate being only 5%. This demonstrates the importance of diversification!

๐Ÿ“Œ Example 4: Pharmaceutical Investment

๐Ÿ’Š Example 4

A pharmaceutical company knows that 30% of experimental drugs pass clinical trials without their new technology. They claim a new screening method increases success. With the new method, 9 out of 10 drugs passed trials. If the method were worthless, whatโ€™s \(P(Y \geq 9)\)?

๐Ÿ“Š Solution Setup

Let \(Y\) = number of drugs passing trials. If method is worthless: \(p = 0.3\), \(n = 10\).

We need to find \(P(Y \geq 9) = P(Y = 9) + P(Y = 10)\)

๐Ÿ“Œ Example 4: Solution

Probability of exactly nine successes:

\[ P(Y = 9) = \binom{10}{9} (0.3)^9 (0.7)^1 = 10 \times (0.3)^9 \times 0.7 = 0.000138 \]

Probability of exactly ten successes:

\[ P(Y = 10) = \binom{10}{10} (0.3)^{10} (0.7)^0 = (0.3)^{10} = 0.000006 \]

Therefore: \[ P(Y \geq 9) = 0.000138 + 0.000006 = 0.000144 \]

๐ŸŽฏ Statistical Interpretation

If the screening method is ineffective, observing 9+ successes has probability 0.0144% (about 1 in 7,000!). This provides very strong evidence that the new screening method is effective. A pharmaceutical company would likely invest in this technology based on such results.

๐Ÿ“ Quiz #2: Calculating Probabilities

An investor makes 6 independent trades with 50% probability of profit on each. What is the probability of exactly 4 profitable trades?

Hint: Use \(n=6\), \(p=0.5\), \(y=4\) in the binomial formula.

  • 0.234
  • 0.313
  • 0.156
  • 0.500
03:00

๐Ÿ“Œ Example 5: Using Statistical Tables

๐Ÿ“‹ Example 5

A bond portfolio contains bonds with 5% default probability. If the fund holds 20 bonds, find the probability that at least four bonds default.

๐Ÿ”ข Solution

With \(Y \sim \text{Binomial}(n=20, p=0.05)\):

\[ P(Y\geq 4) = 1 - P(Y\leq 3) = 1 - \sum_{y=0}^3p(y) \]

We can calculate each term or use cumulative binomial tables. Using statistical software or tables:

\(P(Y \leq 3) \approx 0.9841\)

Therefore: \[ P(Y \geq 4) = 1 - 0.9841 = 0.0159 \]

Risk Assessment: Only about 1.6% chance of having 4+ defaults, suggesting the portfolio is relatively safe under current assumptions.

๐Ÿ“Š Using Binomial Tables

๐Ÿ“š Cumulative Binomial Tables

Tabulated values represent cumulative probabilities: \[ P(Y \leq a) = \sum_{y=0}^{a}p(y) \]

Steps for \(n=20\), \(p=0.05\), finding \(P(Y \leq 3)\):

  1. โžŠ Locate the section for \(n=20\)
  2. โž‹ Find the column for \(p=0.05\)
  3. โžŒ Read the cumulative probability at row \(y=3\)

Modern Approach: Use statistical software for exact calculations:

  • R: pbinom(3, 20, 0.05) gives \(P(Y \leq 3)\)

  • Python: scipy.stats.binom.cdf(3, 20, 0.05)

  • Excel: =BINOM.DIST(3, 20, 0.05, TRUE)

๐ŸŽฏ Key Theorem

๐Ÿ“ Theorem 1: Mean and Variance

Let \(Y\) be a binomial random variable based on \(n\) trials with success probability \(p\). Then:

\[ \mu = E(Y) = np \]

\[ \sigma^2 = V(Y) = npq = np(1-p) \]

\[ \sigma = SD(Y) = \sqrt{npq} \]

where \(q = 1 - p\).

๐Ÿ’ก Financial Interpretation

  • ๐Ÿ“Š Expected value \(E(Y) = np\): Expected number of defaults, claims, or successes
  • ๐Ÿ“‰ Variance \(V(Y) = npq\): Measure of risk/uncertainty; maximum when \(p = 0.5\)
  • ๐Ÿ“ Standard deviation \(\sigma\): Typical deviation from expected value (used in risk management)

๐Ÿค” Group Activity: Portfolio Risk

๐Ÿฆ Activity 3: Understanding Expected Value (5 minutes)

Scenario: A bankโ€™s credit card portfolio has 1000 accounts, each with 2% monthly default probability.

Tasks (in groups of 3-4):

1. Calculate expected number of defaults per month (1 min)

2. Calculate the standard deviation (1 min)

3. If portfolio grows to 10,000 accounts, what happens to expected defaults and standard deviation? Does the default probability change? (2 min)

05:00

4. Present your insights and discuss risk implications (1 min/group)

๐Ÿ“Œ Example 6: Market Research

๐Ÿ“Š Example 6

A market research firm surveys 20 randomly selected consumers about a new product. 6 respondents expressed interest in purchasing. Estimate \(p\), the true proportion of the population interested in the product.

๐Ÿ” Solution: Maximum Likelihood Approach

  • Let \(Y\) = number interested in product (observed \(Y = 6\))
  • \(Y \sim \text{Binomial}(n=20, p=?)\)
  • Likelihood of observing \(Y=6\):

\[ L(p) = P(Y = 6 | p) = \binom{20}{6} p^6 (1 - p)^{14} \]

Question: What value of \(p\) maximizes this likelihood?

๐ŸŽ“ Maximum Likelihood Estimation

To maximize \(L(p) = \binom{20}{6} p^6 (1 - p)^{14}\):

  • ๐ŸŽฏ \(\binom{20}{6}\) is constant relative to \(p\)
  • ๐Ÿ“ˆ Since \(\ln(x)\) is strictly increasing, maximize log-likelihood instead:

\[ \ln L(p) = \ln\binom{20}{6} + 6\ln(p) + 14\ln(1-p) \]

Differentiate with respect to \(p\):

\[ \frac{d\ln L}{dp} = \frac{6}{p} - \frac{14}{1-p} \]

๐Ÿ” Finding the Maximum Likelihood Estimate

Set the derivative equal to zero:

\[ \frac{6}{p} - \frac{14}{1-p} = 0 \]

Multiply both sides by \(p(1-p)\):

\[ 6(1-p) - 14p = 0 \]

\[ 6 - 6p - 14p = 0 \]

\[ 6 = 20p \]

\[ p = \frac{6}{20} = 0.3 \]

โœ… Solution Verification

โœ“ Final Solution

Check the second derivative: \[ \frac{d^2\ln L}{dp^2} = -\frac{6}{p^2} - \frac{14}{(1-p)^2} \]

This is always negative for \(0 < p < 1\), confirming a maximum at \(p = 6/20\).

Maximum Likelihood Estimate: \[ \boxed{\hat{p} = \frac{6}{20} = 0.30 = 30\%} \]

๐ŸŽฏ General Result

For binomial data, the Maximum Likelihood Estimator (MLE) is always: \[ \hat{p} = \frac{Y}{n} = \frac{\text{number of successes}}{\text{number of trials}} \]

This is simply the sample proportion - intuitive and optimal!

๐Ÿ“ Quiz #3: Mean and Variance

A loan portfolio has 25 loans, each with 40% default probability (independent). What are \(E(Y)\) and \(SD(Y)\)?

Hint: \(E(Y) = np\) and \(SD(Y) = \sqrt{npq}\)

  • \(E(Y) = 10\), \(SD(Y) = 2.45\)
  • \(E(Y) = 12\), \(SD(Y) = 3.00\)
  • \(E(Y) = 10\), \(SD(Y) = 6.00\)
  • \(E(Y) = 15\), \(SD(Y) = 2.45\)
03:00

๐Ÿค” Think-Pair-Share Activity #3

๐Ÿญ Activity 4: Quality Control Application (5 minutes)

Scenario: A manufacturing companyโ€™s historical defect rate is 2%. Todayโ€™s inspection found 5 defects in a sample of 50 products.

๐Ÿง  Think (2 min): Is this evidence that the defect rate has increased? Under \(p = 0.02\), whatโ€™s \(E(Y)\)?

๐Ÿ‘ฅ Pair (2 min): Calculate (or estimate) \(P(Y \geq 5)\) when \(n=50\), \(p=0.02\). Use the fact that \(E(Y) = 1\). Is 5 unusually high?

05:00

๐Ÿ—ฃ๏ธ Share (1 min): Present conclusions about whether quality has deteriorated

๐Ÿ“ Summary

โœ… Key Takeaways

Five Properties of Binomial Experiments:

  1. ๐Ÿ”ข Fixed number of trials \(n\)

  2. โš–๏ธ Two outcomes: success or failure

  3. ๐ŸŽฏ Constant probability \(p\)

  4. ๐Ÿ”— Independent trials

  5. ๐Ÿ“Š Count of successes \(Y\)

Essential Formulas: \[ p(y) = \binom{n}{y}p^yq^{n-y}, \quad E(Y) = np, \quad V(Y) = npq, \quad \hat{p}_{\text{MLE}} = \frac{Y}{n} \]

Applications: Credit risk, portfolio management, quality control, market research, insurance claims

๐Ÿ“š Practice Problems

๐Ÿ“ Homework Problems

  1. ๐Ÿ’ป Investment Portfolio: A venture capital fund invests in 10 startups. If 5% typically fail within one year, find:
    • \(P(\text{at most 2 fail})\)
    • Expected failures and standard deviation
    • Is observing 3 failures unusual?
  2. ๐Ÿฅ Insurance Claims: An insurance company expects 8% of policyholders to file claims. For 15 randomly selected policies:
    • Find \(P(\text{exactly 2 claims})\)
    • Calculate \(E(Y)\) and \(SD(Y)\)
    • If 4 file claims, should the company investigate?
  3. ๐Ÿ“ฑ Market Research: Historical data shows 35% of customers convert after seeing an ad. Sample of 20:
    • Find \(P(Y = 7)\)
    • Find \(P(Y \geq 10)\)
    • If 12 converted, estimate the new conversion rate using MLE

Due: Next class meeting

๐ŸŒ Additional Resources

๐Ÿ“– For Further Study

Interactive Tools:

Statistical Software:

  • ๐Ÿ“Š R: dbinom(), pbinom(), qbinom(), rbinom()

  • ๐Ÿ Python: scipy.stats.binom

  • ๐Ÿ“ˆ Excel: =BINOM.DIST()

Textbook Reading:

  • Chapter 3.4: The Binomial Distribution

  • Chapter 3.5: Applications and Examples

  • Review: Expected Value and Variance (Chapter 3.2)

๐Ÿ‘‹ Thank You!

๐Ÿ“ฌ Contact Information:

Samir Orujov
Assistant Professor
School of Business ADA University

๐Ÿ“ง Email: sorujov@ada.edu.az
๐Ÿข Office: D312 โฐ Office Hours: By appointment

๐Ÿ“… Next Class:

  • Topic: Poisson Distribution

  • Reading: Chapter 3.6

  • Preparation: Review binomial concepts

โฐ Reminders:

  • โœ… Complete all homework problems

  • โœ… Review MLE concept thoroughly

  • โœ… Work hard

โ“ Questions?

๐Ÿ’ฌ Open Discussion (5 minutes)

Feel free to ask questions about:

  • ๐Ÿ“Š Binomial distribution properties and assumptions

  • ๐Ÿงฎ Calculating probabilities using formulas or tables

  • ๐Ÿ“ˆ Mean and variance interpretations

  • ๐ŸŽฏ Maximum likelihood estimation methodology

  • ๐Ÿ’ผ Applications to finance, economics, or your research

05:00