Problems

1990 Paper 2 Q16

D: 1600.0 B: 1494.9

conditional probability Bayes' theorem combinatorics probability equally likely outcomes law of total probability

Each day, I choose at random between my brown trousers, my grey trousers and my expensive but fashionable designer jeans. Also in my wardrobe, I have a black silk tie, a rather smart brown and fawn polka-dot tie, my regimental tie, and an elegant powder-blue cravat which I was given for Christmas. With my brown or grey trousers, I choose ties (including the cravat) at random, except of course that I don\textquoteright t wear the cravat with the brown trousers or the polka-dot tie with the grey trousers. With the jeans, the choice depends on whether it is Sunday or one of the six weekdays: on weekdays, half the time I wear a cream-coloured sweat-shirt with \(E=mc{}^{2}\) on the front and no tie; otherwise, and on Sundays (when naturally I always wear a tie), I just pick at random from my four ties. This morning, I received through the post a compromising photograph of myself. I often receive such photographs and they are equally likely to have been taken on any day of the week. However, in this particular photograph, I am wearing my black silk tie. Show that, on the basis of this information, the probability that the photograph was taken on Sunday is \(11/68\). I should have mentioned that on Mondays I lecture on calculus and I therefore always wear my jeans (to make the lectures seem easier to understand). Find, on the basis of the complete information, the probability that the photograph was taken on Sunday. [The phrase `at random' means `with equal probability'.]

View

1990 Paper 3 Q15

D: 1700.0 B: 1482.6

probability geometric distribution marginal distribution independence joint distribution expected value summation discrete random variables

An unbiased twelve-sided die has its faces marked \(A,A,A,B,B,B,B,B,B,B,B,B.\) In a series of throws of the die the first \(M\) throws show \(A,\) the next \(N\) throws show \(B\) and the \((M+N+1)\)th throw shows \(A\). Write down the probability that \(M=m\) and \(N=n\), where \(m\geqslant0\) and \(n\geqslant1.\) Find

the marginal distributions of \(M\) and \(N\),
the mean values of \(M\) and \(N\).

Investigate whether \(M\) and \(N\) are independent. Find the probability that \(N\) is greater than a given integer \(k\), where \(k\geqslant1,\) and find \(\mathrm{P}(N > M).\) Find also \(\mathrm{P}(N=M)\) and show that \(\mathrm{P}(N < M)=\frac{1}{52}.\)

View

1990 Paper 3 Q16

D: 1700.0 B: 1484.0

probability conditional probability geometric probability triangle inequality obtuse triangle uniform distribution bivariate distribution integration

A rod of unit length is cut into pieces of length \(X\) and \(1-X\); the latter is then cut in half. The random variable \(X\) is uniformly distributed over \([0,1].\) For some values of \(X\) a triangle can be formed from the three pieces of the rod. Show that the conditional probability that, if a triangle can be formed, it will be obtuse-angled is \(3-2\sqrt{2.}\)
The bivariate distribution of the random variables \(X\) and \(Y\) is uniform over the triangle with vertices \((1,0),(1,1)\) and \((0,1).\) A pair of values \(x,y\) is chosen at random from this distribution and a (perhaps degenerate) triangle \(ABC\) is constructed with \(BC=x\) and \(CA=y\) and \(AB=2-x-y.\) Show that the construction is always possible and that \(\angle ABC\) is obtuse if and only if \[ y>\frac{x^{2}-2x+2}{2-x}. \] Deduce that the probability that \(\angle ABC\) is obtuse is \(3-4\ln2.\)

View

1989 Paper 1 Q14

D: 1516.0 B: 1453.5

probability continuous probability distributions probability density function conditional distribution hypothesis testing expected value binomial distribution significance test

The prevailing winds blow in a constant southerly direction from an enchanted castle. Each year, according to an ancient tradition, a princess releases 96 magic seeds from the castle, which are carried south by the wind before falling to rest. South of the castle lies one league of grassy parkland, then one league of lake, then one league of farmland, and finally the sea. If a seed falls on land it will immediately grow into a fever tree. (Fever trees do not grow in water). Seeds are blown independently of each other. The random variable \(L\) is the distance in leagues south of the castle at which a seed falls to rest (either on land or water). It is known that the probability density function \(\mathrm{f}\) of \(L\) is given by \[ \mathrm{f}(x)=\begin{cases} \frac{1}{2}-\frac{1}{8}x & \mbox{ for }0\leqslant x\leqslant4,\\ 0 & \mbox{ otherwise.} \end{cases} \] What is the mean number of fever trees which begin to grow each year?

The random variable \(Y\) is defined as the distance in leagues south of the castle at which a new fever tree grows from a seed carried by the wind. Sketch the probability density function of \(Y\), and find the mean of \(Y\).
One year messengers bring the king the news that 23 new fever trees have grown in the farmland. The wind never varies, and so the king suspects that the ancient tradition have not been followed properly. Is he justified in his suspicions?

View

1989 Paper 1 Q15

D: 1500.0 B: 1516.0

Poisson distribution uniform distribution binomial distribution probability conditional Bayes' theorem expectation optimisation route choice

I can choose one of three routes to cycle to school. Via Angle Avenue the distance is 5\(\,\)km, and I am held up at a level crossing for \(A\) minutes, where \(A\) is a continuous random variable uniformly distributed between \(0\) and 10. Via Bend Boulevard the distance is 4\(\,\)km, and I am delayed, by talking to each of \(B\) friends for 3\(\,\)minutes, for a total of \(3B\) minutes, where \(B\) is a random variable whose distribution is Poisson with mean 4. Via Detour Drive the distance should be only 2\(\,\)km, but in addition, due to never-ending road works, there are five places at each of which, with probability \(\frac{4}{5},\) I have to make a detour that increases the distance by 1\(\,\)km. Except when delayed by talking to friends or at the level crossing, I cycle at a steady 12\(\,\)km\(\,\)h\(^{-1}\). For each of the three routs, calculate the probability that a journey lasts at least 27 minutes. Each day I choose one of the three routes at random, and I am equally likely to choose any of the three alternatives. One day I arrive at school after a journey of at least 27 minutes. What is the probability that I came via Bend Boulevard? Which route should I use all the time: \begin{questionparts} \item if I wish my average journey time to be as small as possible; \item if I wish my journey time to be less than 32 minutes as often as possible? \end{questionpart} Justify your answers.

View

1989 Paper 1 Q16

D: 1516.0 B: 1470.2

probability conditional game theory expected value optimisation strategy linear programming

A and B play a guessing game. Each simultaneously names one of the numbers \(1,2,3.\) If the numbers differ by 2, whoever guessed the smaller pays the opponent £\(2\). If the numbers differ by 1, whoever guessed the larger pays the opponent £\(1.\) Otherwise no money changes hands. Many rounds of the game are played.

If A says he will always guess the same number \(N\), explain (for each value of \(N\)) how B can maximise his winnings.
In an attempt to improve his play, A announces that he will guess each number at random with probability \(\frac{1}{3},\) guesses on different rounds being independent. To counter this, B secretly decides to guess \(j\) with probability \(b_{j}\) (\(j=1,2,3,\, b_{1}+b_{2}+b_{3}=1\)), guesses on different rounds being independent. Derive an expression for B's expected winnings on any round. How should the probabilities \(b_{j}\) be chosen so as to maximize this expression?
A now announces that he will guess \(j\) with probability \(a_{j}\) (\(j=1,2,3,\, a_{1}+a_{2}+a_{3}=1\)). If B guesses \(j\) with probability \(b_{j}\) (\(j=1,2,3,\, b_{1}+b_{2}+b_{3}=1\)), obtain an expression for his expected winnings in the form \[ Xa_{1}+Ya_{2}+Za_{3}. \] Show that he can choose \(b_{1},b_{2}\) and \(b_{3}\) such that \(X,Y\) and \(Z\) are all non-negative. Deduce that, whatever values for \(a_{j}\) are chosen by A, B can ensure that in the long run he loses no money.

View

1989 Paper 2 Q16

D: 1600.0 B: 1484.0

probability binomial distribution quality control conditional probability expected value combinatorics verification

Widgets are manufactured in batches of size \((n+N)\). Any widget has a probability \(p\) of being faulty, independent of faults in other widgets. The batches go through a quality control procedure in which a sample of size \(n\), where \(n\geqslant2\), is taken from each batch and tested. If two or more widgets in the sample are found to be faulty, all widgets in the batch are tested and all faults corrected. If fewer than two widgets in the sample are found to be faulty, the sample is replaced in the batch and no faults are corrected. Show that the probability that the batch contains exactly \(k\), where \(k\leqslant N\), faulty widgets after quality control is \[ \frac{\left[N+1+k\left(n-1\right)\right]N!}{\left(N-k+1\right)!k!}p^{k}\left(1-p\right)^{N+n-k}, \] and verify that this formula also gives the correct answer for \(k=N+1\). Show that the expected number of faulty widgets in a batch after quality control is \[ \left[N+n+pN(n-1)\right]p(1-p)^{n-1}. \]

View

1989 Paper 3 Q15

D: 1700.0 B: 1503.8

probability cumulative distribution function uniform distribution order statistics median probability density function integration expectation variance

The continuous random variable \(X\) is uniformly distributed over the interval \([-c,c].\) Write down expressions for the probabilities that:

\(n\) independently selected values of \(X\) are all greater than \(k\),
\(n\) independently selected values of \(X\) are all less than \(k\),

where \(k\) lies in \([-c,c]\). A sample of \(2n+1\) values of \(X\) is selected at random and \(Z\) is the median of the sample. Show that \(Z\) is distributed over \([-c,c]\) with probability density function \[ \frac{(2n+1)!}{(n!)^{2}(2c)^{2n+1}}(c^{2}-z^{2})^{n}. \] Deduce the value of \({\displaystyle \int_{-c}^{c}(c^{2}-z^{2})^{n}\,\mathrm{d}z.}\) Evaluate \(\mathrm{E}(Z)\) and \(\mathrm{var}(Z).\)

Solution:

\begin{align*} \mathbb{P}(n\text{ independent values of }X > k) &= \prod_{i=1}^n \mathbb{P}(X > k) \\ &= \left ( \frac{c-k}{2c}\right)^n \end{align*}
\begin{align*} \mathbb{P}(n\text{ independent values of }X < k) &= \prod_{i=1}^n \mathbb{P}(X < k) \\ &= \left ( \frac{k+c}{2c}\right)^n \end{align*}

\begin{align*} &&\mathbb{P}(\text{median} < z+\delta \text{ and median} > z - \delta) &= \mathbb{P}(n\text{ values } < z - \delta \text{ and } n \text{ values} > z + \delta) \\ &&&= \binom{2n+1}{n,n,1} \left ( \frac{c-(z+\delta)}{2c}\right)^n\left ( \frac{(z-\delta)+c}{2c}\right)^n \frac{2 \delta}{2 c} \\ &&&= \frac{(2n+1)!}{n! n!} \frac{((c-(z+\delta))(c+(z-\delta)))^n 2\delta}{2^n c^n} \\ &&&= \frac{(2n+1)!}{(n!)^2 (2c)^{2n+1}}((c-(z+\delta))(c+(z-\delta)))^n 2\delta \\ \Rightarrow && \lim_{\delta \to 0} \frac{\mathbb{P}(\text{median} < z+\delta \text{ and median} > z - \delta)}{2 \delta} &= \frac{(2n+1)!}{(n!)^2 (2c)^{2n+1}}((c-z)(c+z))^n \\ &&&= \frac{(2n+1)!}{(n!)^2 (2c)^{2n+1}}(c^2-z^2) \\ \end{align*} \begin{align*} && 1 &= \int_{-c}^c \frac{(2n+1)!}{(n!)^2 (2c)^{2n+1}}(c^2-z^2)^n \d z \\ \Rightarrow && \frac{(n!)^2 (2c)^{2n+1}}{(2n+1)!} &= \int_{-c}^c (c^2-z^2)^n \d z \end{align*} \begin{align*} \mathbb{E}(Z) &= \int_{-c}^c z \frac{(2n+1)!}{(n!)^2 (2c)^{2n+1}}(c^2-z^2)^n \d z \\ &=\frac{(2n+1)!}{(n!)^2 (2c)^{2n+1}} \int_{-c}^c z (c^2-z^2)^n \d z \\ &= 0 \end{align*} \begin{align*} \mathrm{Var}(Z) &= \mathbb{E}(Z^2) - \mathbb{E}(Z)^2 \\ &= \mathbb{E}(Z^2) \\ &= \int_{-c}^c z^2 \frac{(2n+1)!}{(n!)^2 (2c)^{2n+1}}(c^2-z^2)^n \d z \\ &=\frac{(2n+1)!}{(n!)^2 (2c)^{2n+1}} \int_{-c}^c z^2 (c^2-z^2)^n \d z \\ &=\frac{(2n+1)!}{(n!)^2 (2c)^{2n+1}} \left ( \left [ -\frac{1}{2(n+1)}z(c^2-z^2)^{n+1} \right]_{-c}^c + \frac{1}{2(n+1)}\int_{-c}^c (c^2-z^2)^{n+1} \d z \right) \\ &= \frac{(2n+1)!}{(n!)^2 (2c)^{2n+1}} \frac{1}{2(n+1)} \frac{((n+1)!)^2 (2c)^{2n+3}}{(2n+3)!} \\ &= \frac{(n+1)^2(2c)^2}{(n+1)(2n+2)(2n+3)} \\ &= \frac{2c^2}{2n+3} \end{align*}

View

1989 Paper 3 Q16

D: 1700.0 B: 1484.0

probability conditional probability approximating binomial to normal distribution normal approximation hypothesis testing population model joint probability table continuity correction

It is believed that the population of Ruritania can be described as follows:

\(25\%\) are fair-haired and the rest are dark-haired;
\(20\%\) are green-eyed and the rest hazel-eyed;
the population can also be divided into narrow-headed and broad-headed;
no narrow-headed person has green eyes and fair hair;
those who are green-eyed are as likely to be narrow-headed as broad-headed;
those who are green-eyed and broad-headed are as likely to be fair-headed as dark-haired;
half of the population is broad-headed and dark-haired;
a hazel-eyed person is as likely to be fair-haired and broad-headed as dark-haired and narrow-headed.

Find the proportion believed to be narrow-headed. I am acquainted with only six Ruritanians, all of whom are broad-headed. Comment on this observation as evidence for or against the given model. A random sample of 200 Ruritanians is taken and is found to contain 50 narrow-heads. On the basis of the given model, calculate (to a reasonable approximation) the probability of getting 50 or fewer narrow-heads. Comment on the result.

View

1988 Paper 1 Q15

D: 1500.0 B: 1484.0

probability discrete distribution hypothesis testing significance test binomial distribution expected value sports modelling conditional probability

In Fridge football, each team scores two points for a goal and one point for a foul committed by the opposing team. In each game, for each team, the probability that the team scores \(n\) goals is \(\left(3-\left|2-n\right|\right)/9\) for \(0\leqslant n\leqslant4\) and zero otherwise, while the number of fouls committed against it will with equal probability be one of the numbers from \(0\) to \(9\) inclusive. The numbers of goals and fouls of each team are mutually independent. What is the probability that in some game a particular team gains more than half its points from fouls? In response to criticisms that the game is boring and violent, the ruling body increases the number of penalty points awarded for a foul, in the hope that this will cause large numbers of fouls to be less probable. During the season following the rule change, 150 games are played and on 12 occasions (out of 300) a team committed 9 fouls. Is this good evidence of a change in the probability distribution of the number of fouls? Justify your answer.

View

1988 Paper 1 Q16

D: 1500.0 B: 1498.6

probability continuous probability distribution probability density function conditional probability Bayes' theorem integration trigonometric substitution rust/decay model

Wondergoo is applied to all new cars. It protects them completely against rust for three years, but thereafter the probability density of the time of onset of rust is proportional to \(t^{2}/(1+t^{2})^{2}\) for a car of age \(3+t\) years \((t\geqslant0)\). Find the probability that a car becomes rusty before it is \(3+t\) years old. Every car is tested for rust annually on the anniversary of its manufacture. If a car is not rusty, it will certainly pass; if it is rusty, it will pass with probability \(\frac{1}{2}.\) Cars which do not pass are immediately taken off the road and destroyed. What is the probability that a randomly selected new car subsequently fails a test taken on the fifth anniversary of its manufacture? Find also the probability that a car which was destroyed immediately after its fifth anniversary test was rusty when it passed its fourth anniversary test.

Solution: Given the probability density after \(3\) years is proportional to \(\frac{t^2}{(1+t^2)^2}\) then we must have that: \begin{align*} && 1 &= A \int_0^{\infty} \frac{t^2}{(1+t^2)^2} \, \d t \\ &&&= A \left [ -\frac12 \frac{t}{1+t^2} \right]_0^{\infty} + \frac{A}2 \int_0^{\infty} \frac{1}{1+t^2} \d t \\ &&&= \frac{A}{2} \frac{\pi}{2} \\ \Rightarrow && A &= \frac{4}{\pi} \end{align*} In order to fail a test on the fifth anniversary, there are two possibilities for when we went faulty. We could have gone faulty before \(4\) years, got lucky once and then failed the second test, or gone faulty in the next year and then failed the first test. \begin{align*} \P(\text{rusty before } 4 \text{ years}) &=\frac{4}{\pi} \int_0^1 \frac{t^2}{(1+t^2)^2} \d t \\ &= \frac{4}{\pi} \left [ -\frac12 \frac{t}{1+t^2} \right]_0^{1} + \frac{2}{\pi} \int_0^{1} \frac{1}{1+t^2} \d t \\ &= -\frac{1}{\pi} + \frac{2}{\pi} \frac{\pi}{4} \\ &= \frac12 - \frac{1}{\pi} \\ &\approx 0.181690\cdots \\ \\ \P(\text{rusty before } 5 \text{ years}) &=\frac{4}{\pi} \int_0^1 \frac{t^2}{(1+t^2)^2} \d t \\ &= \frac{4}{\pi} \left [ -\frac12 \frac{t}{1+t^2} \right]_0^{2} + \frac{2}{\pi} \int_0^{2} \frac{1}{1+t^2} \d t \\ &= -\frac{4}{5\pi} + \frac{2}{\pi} \tan^{-1} 2 \\ &\approx 0.450184\cdots \\ \end{align*} Therefore: \begin{align*} \P(\text{fails 5th anniversary}) &= \P(\text{rusty before } 4 \text{ years}) \P(\text{pass one, fail other}) + \\ & \quad \quad + \P(\text{rusty between 4 and 5 years}) \P(\text{fail}) \\ &= 0.181690\cdots \cdot \frac{1}{4} + \frac{1}{2} ( 0.450184\cdots- 0.181690\cdots) \\ &= \frac{1}{2} 0.450184\cdots - \frac{1}{4} 0.181690\cdots \\ &= 0.1796688\cdots \\ &= 18.0\%\,\, (3\text{ s.f.}) \end{align*} We also must have that: \begin{align*} \P(\text{rusty at 4 years}|\text{destroyed at 5}) &= \frac{\P(\text{rusty at 4 years and destroyed at 5})}{\P(\text{destroyed at 5})} \\ &= \frac{0.181690\cdots \cdot \frac{1}{4}}{\frac{1}{2} 0.450184\cdots - \frac{1}{4} 0.181690\cdots} \\ &= 0.252811\cdots \\ &= 25.3\%\,\,(3\text{ s.f.}) \end{align*}

View

1988 Paper 2 Q15

D: 1600.0 B: 1516.0

probability uniform distribution discrete random variables convolution summation counting arguments symmetry

An examination consists of several papers, which are marked independently. The mark given for each paper can be an integer from \(0\) to \(m\) inclusive, and the total mark for the examination is the sum of the marks on the individual papers. In order to make the examination completely fair, the examiners decide to allocate the mark for each paper at random, so that the probability that any given candidate will be allocated \(k\) marks \((0\leqslant k\leqslant m)\) for a given paper is \((m+1)^{-1}\). If there are just two papers, show that the probability that a given candidate will receive a total of \(n\) marks is \[ \frac{2m-n+1}{\left(m+1\right)^{2}} \] for \(m< n\leqslant2m\), and find the corresponding result for \(0\leqslant n\leqslant m\). If the examination consists of three papers, show that the probability that a given candidate will receive a total of \(n\) marks is \[ \frac{6mn-4m^{2}-2n^{2}+3m+2}{2\left(m+1\right)^{2}} \] in the case \(m< n\leqslant2m\). Find the corresponding result for \(0\leqslant n\leqslant m\), and deduce the result for \(2m< n\leqslant3m\).

View

1988 Paper 2 Q16

D: 1600.0 B: 1570.7

normal distribution probability quadratic equation real roots conditional probability expected value discriminant Vieta's formulas

Find the probability that the quadratic equation \[ X^{2}+2BX+1=0 \] has real roots when \(B\) is normally distributed with zero mean and unit variance. Given that the two roots \(X_{1}\) and \(X_{2}\) are real, find:

the probability that both \(X_{1}\) and \(X_{2}\) are greater than \(\frac{1}{5}\);
the expected value of \(\left|X_{1}+X_{2}\right|\);

giving your answers to three significant figures.

View

1988 Paper 3 Q15

D: 1700.0 B: 1486.2

sequences recurrence relation probability expectation particular integral linear recurrence combinatorics

Each day, books returned to a library are placed on a shelf in order of arrival, and left there. When a book arrives for which there is no room on the shelf, that book and all books subsequently returned are put on a trolley. At the end of each day, the shelf and trolley are cleared. There are just two-sizes of book: thick, requiring two units of shelf space; and thin, requiring one unit. The probability that a returned book is thick is \(p\), and the probability that it is thin is \(q=1-p.\) Let \(M(n)\) be the expected number of books that will be put on the shelf, when the length of the shelf is \(n\) units and \(n\) is an integer, on the assumption that more books will be returned each day than can be placed on the shelf. Show, giving reasoning, that

\(M(0)=0;\)
\(M(1)=q;\)
\(M(n)-qM(n-1)-pM(n-2)=1,\) for \(n\geqslant2.\)

Verify that a possible solution to these equations is \[ M(n)=A(-p)^{n}+B+Cn, \] where \(A,B\) and \(C\) are numbers independent of \(n\) which you should express in terms of \(p\).

View

1988 Paper 3 Q16

D: 1700.0 B: 1610.5

hypergeometric distribution probability sampling without replacement indicator random variables expectation variance covariance combinatorics

Balls are chosen at random without replacement from an urn originally containing \(m\) red balls and \(M-m\) green balls. Find the probability that exactly \(k\) red balls will be chosen in \(n\) choices \((0\leqslant k\leqslant m,0\leqslant n\leqslant M).\) The random variables \(X_{i}\) \((i=1,2,\ldots,n)\) are defined for \(n\leqslant M\) by \[ X_{i}=\begin{cases} 0 & \mbox{ if the \(i\)th ball chosen is green}\\ 1 & \mbox{ if the \(i\)th ball chosen is red. } \end{cases} \] Show that

\(\mathrm{P}(X_{i}=1)=\dfrac{m}{M}.\)
\(\mathrm{P}(X_{i}=1\mbox{ and }X_{j}=1)=\dfrac{m(m-1)}{M(M-1)}\), for \(i\neq j\).

Find the mean and variance of the random variable \(X\) defined by \[ X=\sum_{i=1}^{n}X_{i}. \]

Solution: There are \(\displaystyle \binom{m}{k} \binom{M-m}{n-k}\) ways to choose \(k\) red and and \(n-k\) green balls out of a total \(\displaystyle \binom{M}{n}\) ways to choose balls. Therefore the probability is: \[ \mathbb{P}(\text{exactly }k\text{ red balls in }n\text{ choices}) = \frac{\binom{m}{k} \binom{M-m}{n-k}}{ \binom{M}{n}}\]

Note that there is nothing special about the \(i\)th ball chosen. (We could consider all draws look at the \(i\)th ball, or consider all draws apply a permutation to make the \(i\)th ball the first ball, and both would look like identical sequences). Therefore \(\mathbb{P}(X_i = 1) = \mathbb{P}(X_1 = 1) = \frac{m}{M}\).
Similarly we could apply a permutation to all sequences which takes the \(i\)th ball to the first ball and the \(j\)th ball to the second ball, therefore: \begin{align*} \mathbb{P}(X_i = 1, X_j = 1) &= \mathbb{P}(X_1 = 1, X_2 = 1) \\ &= \mathbb{P}(X_1 = 1) \cdot \mathbb{P}(X_2 = 1 | X_1 = 1) \\ &= \frac{m}{M} \cdot \frac{m-1}{M-1} \\ &= \frac{m(m-1)}{M(M-1)} \end{align*}

So: \begin{align*} \mathbb{E}(X) &= \mathbb{E}(\sum_{i=1}^{n}X_{i}) \\ &= \sum_{i=1}^{n}\mathbb{E}(X_{i}) \\ &= \sum_{i=1}^{n} 1\cdot\mathbb{P}(X_i = 1) \\ &= \sum_{i=1}^{n} \frac{m}{M} \\ &= \frac{mn}{M} \end{align*} and \begin{align*} \mathbb{E}(X^2) &= \mathbb{E}\left[\left(\sum_{i=1}^{n}X_{i} \right)^2 \right] \\ &= \mathbb{E}\left[\sum_{i=1}^n X_i^2 + 2 \sum_{i < j} X_i X_j \right] \\ &= \sum_{i=1}^n \mathbb{E}(X_i^2) + 2 \sum_{i < j} \mathbb{E}(X_i X_j) \\ &= \frac{nm}{M} + n(n-1) \frac{m(m-1)}{M(M-1)} \\ \textrm{Var}(X) &= \mathbb{E}(X^2) - (\mathbb{E}(X))^2 \\ &= \frac{nm}{M} + n(n-1) \frac{m(m-1)}{M(M-1)} - \frac{n^2m^2}{M^2} \\ &= \frac{nm}{M} \left (1-\frac{nm}{M}+(n-1)\frac{m-1}{M-1} \right) \\ &= \frac{nm}{M} \left ( \frac{M(M-1)-(M-1)nm+(n-1)(m-1)M}{M(M-1)} \right) \\ &= \frac{nm}{M} \frac{(M-m)(M-n)}{M(M-1)} \\ &= n \frac{m}{M} \frac{M-m}{M} \frac{M-n}{M-1} \end{align*} Note: This is a very nice way of deriving the mean and variance of the hypergeometric distribution

View

Problems

Filters