Mathematical Statistics
ADA University, School of Business
Information Communication Technologies Agency, Statistics Unit
2025-10-18
🤔 Think (1 minute): In your own words, what does “statistics” mean to you? How do you use it in daily life?
👥 Pair (2 minutes): Share your definition with a neighbor. Find common themes.
🗣️ Share: Let’s hear some definitions before we explore formal ones!
Webster’s New Collegiate Dictionary
“A branch of mathematics dealing with the collection, analysis, interpretation, and presentation of masses of numerical data.”
Stuart and Ord (1991)
“The branch of the scientific method which deals with the data obtained by counting or measuring the properties of populations.”
Rice (1995)
“Essentially concerned with procedures for analyzing data, especially data that in some vague sense have a random character.”
Key Insight
All definitions emphasize data and uncertainty! 🎲
Freund and Walpole (1987)
Statistics encompasses “the science of basing inferences on observed data and the entire problem of making decisions in the face of uncertainty.”
Mood, Graybill, and Boes (1974)
Statistics is “the technology of the scientific method” and is concerned with:
Central Goal of Statistics
The objective of statistics is to make an inference about a population based on information contained in a sample from that population and to provide an associated measure of goodness for the inference.
Definition 1: Random Variable
A random variable is a real-valued function for which the domain is a sample space.
In simple terms: A random variable assigns a numerical value to each outcome in a sample space.
\[Y: S \rightarrow \mathbb{R}\]
Economic Applications:
Financial Applications:
Problem Setup
An investor evaluates two stocks and decides to invest in one. Based on market analysis: - Probability both perform well: 0.25 - Probability exactly one performs well: 0.50 - Probability neither performs well: 0.25
Let \(Y\) equal the number of stocks that perform well.
Tasks:
Step 1: Sample Space
\[S = \{BB, BG, GB, GG\}\] where B = bad performance, G = good performance
Step 2: Probability Distribution
| \(Y\) | Sample Points | \(P(Y = y)\) |
|---|---|---|
| 0 | {BB} | 0.25 |
| 1 | {BG, GB} | 0.50 |
| 2 | {GG} | 0.25 |
Problem Setup
Consider an experiment as tossing two coins and observing the results. Let \(Y\) equal the number of heads obtained.
Tasks:
Step 1: Sample Space
\(S = \{HH, HT, TH, TT\}\)
Step 2: Assign Values
| Sample Point | \(Y\) (Number of Heads) |
|---|---|
| HH | 2 |
| HT | 1 |
| TH | 1 |
| TT | 0 |
Step 3: Values of Random Variable
| \(Y\) | Sample Points | \(P(Y = y)\) |
|---|---|---|
| 0 | {TT} | 1/4 |
| 1 | {HT, TH} | 2/4 = 1/2 |
| 2 | {HH} | 1/4 |
Visualization:
Remark 1
\(P(Y = y)\) is the sum of the probabilities of the sample points that are assigned the value \(y\).
Example:
\[P(Y = 1) = P(\{HT\}) + P(\{TH\}) = \frac{1}{4} + \frac{1}{4} = \frac{1}{2}\]
If we toss three coins, how many possible values can the random variable “number of heads” take?
Definition 2: Random Sample
Let \(N\) and \(n\) represent the numbers of elements in the population and sample, respectively.
If the sampling is conducted in such a way that each of the \(\binom{N}{n}\) samples has an equal probability of being selected, the sampling is said to be random, and the result is said to be a random sample.
Remark 2
The method of sampling, known as the design of an experiment, affects both:
Hence, every sampling procedure must be clearly described if we wish to make valid inferences from sample to population.
🤔 Think (1 minute): You need to survey 100 students from a university of 10,000 students. What methods could you use? Which would give a random sample?
👥 Pair (2 minutes): Discuss potential biases in different sampling methods.
🗣️ Share: What are the pros and cons of each method?
Definition 3: Discrete Random Variable
A random variable \(Y\) is said to be discrete if it can assume only a finite or countably infinite number of distinct values.
Examples in Finance/Economics:
Standard Notation
Definition 4: Probability Function
The probability that \(Y\) takes on the value \(y\), \(P(Y = y)\), is defined as the sum of the probabilities of all sample points in \(S\) that are assigned the value \(y\).
We will sometimes denote \(P(Y = y)\) by \(p(y)\), which is called probability function for \(Y\).
\[p(y) = P(Y = y) = \sum_{\omega \in S: Y(\omega) = y} P(\{\omega\})\]
Definition 5: Probability Distribution
The probability distribution for a discrete variable \(Y\) can be represented by a formula, a table, or a graph that provides \(p(y) = P(Y = y)\) for all \(y\).
Three Representations:
Problem Setup
A supervisor in a manufacturing plant has three men and three women working for him. He wants to choose two workers for a special job.
Not wishing to show any biases in his selection, he decides to select the two workers at random.
Let \(Y\) denote the number of women in his selection.
Task: Find the probability distribution for \(Y\).
Step 1: Identify Possible Values
\(Y\) can take values: 0, 1, or 2 (number of women selected)
Step 2: Total Number of Ways to Select 2 Workers
Total workers = 6 (3 men + 3 women)
\[\text{Total ways} = \binom{6}{2} = \frac{6!}{2!(6-2)!} = \frac{6 \times 5}{2 \times 1} = 15\]
Case 1: \(Y = 0\) (no women, 2 men)
\[P(Y = 0) = \frac{\binom{3}{2} \times \binom{3}{0}}{\binom{6}{2}} = \frac{3 \times 1}{15} = \frac{3}{15} = \frac{1}{5}\]
Case 2: \(Y = 1\) (1 woman, 1 man)
\[P(Y = 1) = \frac{\binom{3}{1} \times \binom{3}{1}}{\binom{6}{2}} = \frac{3 \times 3}{15} = \frac{9}{15} = \frac{3}{5}\]
Case 3: \(Y = 2\) (2 women, no men)
\[P(Y = 2) = \frac{\binom{3}{2} \times \binom{3}{0}}{\binom{6}{2}} = \frac{3 \times 1}{15} = \frac{3}{15} = \frac{1}{5}\]
Table Representation:
| \(y\) | \(p(y) = P(Y = y)\) |
|---|---|
| 0 | 1/5 = 0.20 |
| 1 | 3/5 = 0.60 |
| 2 | 1/5 = 0.20 |
| Sum | 1.00 |
Verification: \(\frac{1}{5} + \frac{3}{5} + \frac{1}{5} = \frac{5}{5} = 1\) ✅
Theorem 1: Fundamental Properties
For any discrete probability distribution, the following must be true:
\(0 \leq p(y) \leq 1\) for all \(y\)
\(\sum_{y} p(y) = 1\)
where the summation is over all values of \(y\) with nonzero probability.
Why are these important?
Activity (3 minutes)
In small groups, verify that Example 2 (worker selection) satisfies both properties of Theorem 1.
Check:
Solution:
Your Turn!
An investment portfolio contains 4 high-risk and 2 low-risk assets. A portfolio manager randomly selects 3 assets for quarterly review.
Let \(Y\) = number of high-risk assets selected.
Tasks:
Step 1: Possible Values
\(Y\) can be: 1, 2, or 3 (need at least 1 high-risk since only 2 low-risk assets)
Step 2: Total Ways to Select 3 from 6
\[\binom{6}{3} = \frac{6!}{3!3!} = 20\]
Case 1: \(Y = 1\) (1 high-risk, 2 low-risk)
\[P(Y = 1) = \frac{\binom{4}{1} \times \binom{2}{2}}{\binom{6}{3}} = \frac{4 \times 1}{20} = \frac{4}{20} = 0.20\]
Case 2: \(Y = 2\) (2 high-risk, 1 low-risk)
\[P(Y = 2) = \frac{\binom{4}{2} \times \binom{2}{1}}{\binom{6}{3}} = \frac{6 \times 2}{20} = \frac{12}{20} = 0.60\]
Case 3: \(Y = 3\) (3 high-risk, 0 low-risk)
\[P(Y = 3) = \frac{\binom{4}{3} \times \binom{2}{0}}{\binom{6}{3}} = \frac{4 \times 1}{20} = \frac{4}{20} = 0.20\]
| \(y\) | \(p(y)\) |
|---|---|
| 1 | 0.20 |
| 2 | 0.60 |
| 3 | 0.20 |
| Sum | 1.00 |
Verification:
Which of the following could be a valid probability distribution for a discrete random variable \(Y\)?
Why Discrete Random Variables Matter
Economics:
Finance:
Summary
Statistics is about making inferences from samples to populations
Random variables map sample space outcomes to numerical values
Discrete random variables take finite or countably infinite values
Probability distributions must satisfy:
Applications span economics, finance, and business decision-making
Problem 1: Exchange Rate Analysis
A currency can move Up, Down, or Unchanged daily. Historical data shows: P(Up)=0.4, P(Down)=0.3, P(Unchanged)=0.3.
Let \(Y\) = number of “Up” days in 2 trading days.
Problem 2: Loan Portfolio
A bank’s portfolio has 6 loans: 4 are performing well, 2 are underperforming. A regulator randomly audits 3 loans.
Let \(Y\) = number of underperforming loans in the audit.
Problem 3: Financial Transaction Model
Consider the probability function:
\[p(y) = \begin{cases} cy^2 & \text{for } y = 1, 2, 3 \\ 0 & \text{otherwise} \end{cases}\]
Problem 1 Hint:
List all possible outcomes for 2 days: (U,U), (U,D), (U,N), etc. Calculate probability for each outcome using independence.
Problem 2 Hint:
Use hypergeometric probability: \(P(Y=y) = \frac{\binom{2}{y}\binom{4}{3-y}}{\binom{6}{3}}\)
Problem 3 Hint:
Use property \(\sum_{y=1}^{3} cy^2 = 1\) to solve for \(c\). Then \(c(1^2 + 2^2 + 3^2) = c(14) = 1\).
Coming Up Next
Preparation: Review the concept of weighted averages and summation notation.
Thank you!
Office Hours: By appointment via email
Contact: sorujov@ada.edu.az

Mathematical Statistics - Discrete Random Variables