Wald's equation
Encyclopedia
In probability theory
, Wald's equation, Wald's identity or Wald's lemma is an important identity
that simplifies the calculation of the expected value
of the sum of a random number of random quantities. In its simplest form, it relates the expectation of a sum of randomly many finite-mean, identically distributed random variable
s to the expected number of terms in the sum and the random variables' common expectation under the condition that the number of terms in the sum is independent of the summands. The equation is named after the mathematician
Abraham Wald
. An identity for the second moment is given by the Blackwell–Girshick equation.
} be an infinite sequence
of real-valued, finite-mean random variables and let N be a nonnegative integer-valued random variable. Assume thatNEWLINE
below for the necessity. Assumption (4) is of more technical nature, implying absolute convergence
and therefore allowing arbitrary rearrangement of an infinite series in the proof. Assumption (4) can be strengthened to the simpler condition NEWLINE[ Proof] , which is finite by assumption (1). Therefore, (1) and (5) imply assumption (4).
Assume in addition to (1) and (2) thatNEWLINE
when considering the total claim amount within a certain time period, say one year, arising from a random number N of individual insurance claims, whose sizes are described by the random variables (Xn)n∈ℕ. Under the above assumptions, Wald's equation can be used to calculate the expected total claim amount when information about the average claim number per year and the average claim size is available. Under stronger assumptions and with more information about the underlying distributions, Panjer's recursion
can be used to calculate the distribution of S.[ Z] = 0. Define Xn = (–1)nZ for all n ∈ ℕ. Then assumptions (1), (2), (6), and (7) with C := E[ |Z|] are satisfied, hence also (3) and (5), and Wald's equation applies. If the distribution of Z is not symmetric, then (8) does not hold. Note that, when Z is not almost surely equal to the zero random variable, then (10) and (11) cannot hold simultaneously for any filtration (Fn)n∈ℕ, because Z cannot be independent of itself as E[ Z 2] = (E[ Z] )2 = 0 is impossible.
[ N] < ∞ by the ratio test. The assumptions (1), (8), hence (2) and (7) with C = 1, (9), (10), and (11) hold, hence also (3), and (5) and Wald's equation applies. However, (6) does not hold, because N is defined in terms of the sequence (Xn)n∈ℕ. Intuitively, one might expect to have E[ S] > 0 in this example, because the summation stops right after a one, thereby apparently creating a positive bias. However, Wald's equation shows that this intuition is misleading.
[ S] = 0, but E[ X1] = ½ and E[ N] = ½ and therefore Wald's equation does not hold. Indeed, the assumptions (1), (2) and (4) are satisfied, however, the equation in assumption (3) holds for all n ∈ ℕ except for n = 1.
[ Xn] = 0 for all n ∈ ℕ, assumption (2) holds. However, since S = 1 almost surely, Wald's equation cannot hold.
Since N is a stopping time with respect to the filtration generated by (Xn)n∈ℕ, assumption (3) holds, see above. Therefore, only assumption (4) can fail, and indeed, since
and therefore P(N ≥ n) = 1/2n–1 for every n ∈ ℕ, it follows that
Probability theory
Probability theory is the branch of mathematics concerned with analysis of random phenomena. The central objects of probability theory are random variables, stochastic processes, and events: mathematical abstractions of non-deterministic events or measured quantities that may either be single...
, Wald's equation, Wald's identity or Wald's lemma is an important identity
Identity (mathematics)
In mathematics, the term identity has several different important meanings:*An identity is a relation which is tautologically true. This means that whatever the number or value may be, the answer stays the same. For example, algebraically, this occurs if an equation is satisfied for all values of...
that simplifies the calculation of the expected value
Expected value
In probability theory, the expected value of a random variable is the weighted average of all possible values that this random variable can take on...
of the sum of a random number of random quantities. In its simplest form, it relates the expectation of a sum of randomly many finite-mean, identically distributed random variable
Random variable
In probability and statistics, a random variable or stochastic variable is, roughly speaking, a variable whose value results from a measurement on some type of random process. Formally, it is a function from a probability space, typically to the real numbers, which is measurable functionmeasurable...
s to the expected number of terms in the sum and the random variables' common expectation under the condition that the number of terms in the sum is independent of the summands. The equation is named after the mathematician
Mathematician
A mathematician is a person whose primary area of study is the field of mathematics. Mathematicians are concerned with quantity, structure, space, and change....
Abraham Wald
Abraham Wald
- See also :* Sequential probability ratio test * Wald distribution* Wald–Wolfowitz runs test...
. An identity for the second moment is given by the Blackwell–Girshick equation.
Statement
Let {Xn ; n ∈ ℕNatural number
In mathematics, the natural numbers are the ordinary whole numbers used for counting and ordering . These purposes are related to the linguistic notions of cardinal and ordinal numbers, respectively...
} be an infinite sequence
Sequence
In mathematics, a sequence is an ordered list of objects . Like a set, it contains members , and the number of terms is called the length of the sequence. Unlike a set, order matters, and exactly the same elements can appear multiple times at different positions in the sequence...
of real-valued, finite-mean random variables and let N be a nonnegative integer-valued random variable. Assume thatNEWLINE
- NEWLINE
- N has finite expectation, NEWLINE
- {Xn ; n∈ℕ} all have the same expectation, NEWLINE
- E
[ Xn1{N ≥ n}] = E[ Xn] P(N ≥ n) for every natural numberNatural numberIn mathematics, the natural numbers are the ordinary whole numbers used for counting and ordering . These purposes are related to the linguistic notions of cardinal and ordinal numbers, respectively...
n, and NEWLINE - the infinite series satisfies
- NEWLINE
- NEWLINE
- NEWLINE
Discussion of assumptions
Clearly, assumptions (1) and (2) are needed to formulate Wald's equation. Assumption (3) controls the amount of dependence allowed between the sequence (Xn)n∈ℕ and the number N of terms, see the counterexampleCounterexample
In logic, and especially in its applications to mathematics and philosophy, a counterexample is an exception to a proposed general rule. For example, consider the proposition "all students are lazy"....
below for the necessity. Assumption (4) is of more technical nature, implying absolute convergence
Absolute convergence
In mathematics, a series of numbers is said to converge absolutely if the sum of the absolute value of the summand or integrand is finite...
and therefore allowing arbitrary rearrangement of an infinite series in the proof. Assumption (4) can be strengthened to the simpler condition NEWLINE
- NEWLINE
- 5. there exists a constant C such that E
[ |Xn|1{N ≥ n}] ≤ C P(N ≥ n) for all natural numbers n.
- NEWLINE
- 6. N is independent of the sequence (Xn)n∈N and NEWLINE
- 7. there exists a constant C such that E
[ |Xn|] ≤ C for all natural numbers n.
- NEWLINE
- 8. the random variables (Xn)n∈ℕ all have the same distribution.
- NEWLINE
- 9. N is a stopping time with respect to the filtration, and NEWLINE
- 10. Xn and Fn–1 are independent for every n ∈ ℕ.
- NEWLINE
- 11. the sequence (Xn)n∈ℕ is adaptedAdapted processIn the study of stochastic processes, an adapted process is one that cannot "see into the future". An informal interpretation is that X is adapted if and only if, for every realisation and every n, Xn is known at time n...
to the filtration (Fn)n∈ℕ, meaning the Xn is Fn-measurable for every n ∈ ℕ.
Application
An application is in actuarial scienceActuarial science
Actuarial science is the discipline that applies mathematical and statistical methods to assess risk in the insurance and finance industries. Actuaries are professionals who are qualified in this field through education and experience...
when considering the total claim amount within a certain time period, say one year, arising from a random number N of individual insurance claims, whose sizes are described by the random variables (Xn)n∈ℕ. Under the above assumptions, Wald's equation can be used to calculate the expected total claim amount when information about the average claim number per year and the average claim size is available. Under stronger assumptions and with more information about the underlying distributions, Panjer's recursion
Panjer recursion
The Panjer recursion is an algorithm to compute the probability distribution of a compound random variablewhere both N\, and X_i\, are random variables and of special types. In more general cases the distribution of S is a compound distribution. The recursion for the special cases considered was...
can be used to calculate the distribution of S.