183 problems found
Let \(\F(x)\) be the cumulative distribution function of a random variable \(X\), which satisfies \(\F(a)=0\) and \(\F(b)=1\), where \(a>0\). Let \[ \G(y) = \frac{\F(y)}{2-\F(y)}\;. \] Show that \(\G(a)=0\,\), \(\G(b)=1\,\) and that \(\G'(y)\ge0\,\). Show also that \[ \frac12 \le \frac2{(2-\F(y))^2} \le 2\;. \] The random variable \(Y\) has cumulative distribution function \(\G(y)\,\). Show that \[ { \tfrac12} \,\E(X) \le \E(Y) \le 2 \E(X) \;, \] and that \[ \var(Y) \le 2\var(X) +\tfrac 74 \big(\E(X)\big)^2\;. \]
Solution: \begin{align*} && G(a) &= \frac{F(a)}{2-F(a)}\\ &&&= 0 \tag{\(F(a)= 0\)}\\ \\ && G(b) &= \frac{F(b)}{2-F(b)} \\ &&&= \frac{1}{2-1} = 1 \tag{\(F(b)=1\)}\\ \\ && G'(y) &= \frac{F'(y)(2-F(y))+F(y)F'(y)}{(2-F(y))^2} \\ &&&= \frac{2F'(y)}{(2-F(y))^2} \geq 0 \tag{\(F'(y) \geq 0\)} \end{align*} \begin{align*} && 0 \leq F(y)\leq1\\ \Leftrightarrow&& 1\leq 2-F(y) \leq 2\\ \Leftrightarrow &&1 \leq (2-F(y))^2 \leq 4\\ \Leftrightarrow && 1 \geq \frac{1}{(2-F(y))^2} \geq \frac14 \\ \Leftrightarrow && 2 \geq \frac{2}{(2-F(y))^2} \geq\frac12 \end{align*} \begin{align*} && \mathbb{E}(Y) &= \int_a^b y G'(y) \d y \\ &&&= \int_a^b y F'(y) \underbrace{\frac{2}{(2-F(y))^2}}_{\in [\frac12, 2]} \d y \\ &&&\leq 2 \E[X] \\ &&&\geq \frac12 \E[X]\\ \\ && \E[Y^2] &\leq 2\E[X^2] \\ && \E[Y^2] &\geq \frac12\E[X^2] \\ \\ \Rightarrow && \var[Y] &= \E[Y^2]-\E[Y]^2 \\ &&& \leq 2 \E[X^2] - (\tfrac12\E[X])^2 \\ &&&= 2 \var[X] + \tfrac74(\E[X])^2 \end{align*}
A densely populated circular island is divided into \(N\) concentric regions \(R_1\), \(R_2\), \(\ldots\,\), \(R_N\), such that the inner and outer radii of \(R_n\) are \(n-1\) km and \(n\) km, respectively. The average number of road accidents that occur in any one day in \(R_n\) is \(2-n/N\,\), independently of the number of accidents in any other region. Each day an observer selects a region at random, with a probability that is proportional to the area of the region, and records the number of road accidents, \(X\), that occur in it. Show that, in the long term, the average number of recorded accidents per day will be \[ 2-\frac16\left(1+\frac1N\right)\left(4-\frac1N\right)\;. \] [Note: \(\sum\limits_{n=1}^N n^2 = \frac16 N(N+1)(2N+1) \;\).] Show also that \[ \P(X=k) = \frac{\e^{-2}N^{-k-2}}{k!}\sum_{n=1}^N (2n-1)(2N-n)^k\e^{n/N} \;. \] Suppose now that \(N=3\) and that, on a particular day, two accidents were recorded. Show that the probability that \(R_2\) had been selected is \[ \frac{48}{48 + 45\e^{1/3} +25 \e^{-1/3}}\;. \]
Solution: The area of \(R_n\) is \(\pi(n^2 - (n-1)^2) = (2n-1)\pi\). The area of the whole region is \(\pi N^2\). \begin{align*} && \E[X] &= \E[\E[X | \text{choose region }n]] \\ &&&= \sum_{n=1}^N \left (2 - \frac{n}{N} \right) \cdot \frac{(2n-1)\pi}{N^2 \pi} \\ &&&= \sum_{n=1}^N \left (2\cdot \frac{(2n-1)\pi}{N^2 \pi} - \frac{n}{N}\cdot \frac{(2n-1)\pi}{N^2 \pi} \right) \\ &&&= 2 - \frac{1}{N^3} \sum_{n=1}^N (2n^2-n) \\ &&&= 2 - \frac{1}{N^3} \left (\frac{2N(N+1)(2N+1)}{6} - \frac{N(N+1)}{2} \right) \\ &&&= 2 - \frac{N+1}{6N^2} \left (2(2N+1)-3 \right) \\ &&&= 2 - \frac{N+1}{6N^2} (4N - 1) \\ &&&= 2 - \frac16 \left (1 + \frac1N \right) \left (4 - \frac1N \right) \end{align*} Modelling each region as \(Po(2 - n/N)\) we have \begin{align*} \mathbb{P}(X = k ) &= \sum_{n=1}^N \exp(-2 + n/N) \frac{(2-n/N)^k}{k!} \frac{2n-1}{N^2} \\ &= \frac{e^{-2}N^{-k-2}}{k!} \sum_{n=1}^N e^{n/N} (2N-n)^k(2n-1) \end{align*} as desired. Supposing \(N=3\) and two accidents then \begin{align*} \mathbb{P}(R_2 | X = 2) &= \frac{\frac{3}{9} e^{-4/3}\frac{(\frac43)^2}{2!}}{\mathbb{P}(X=2)} \\ &= \frac{\frac{3}{9} e^{-4/3} \frac{(\frac43)^2}{2!}}{\frac{1}{9} e^{-5/3} \frac{(\frac53)^2}{2!} + \frac{3}{9} e^{-4/3} \frac{(\frac43)^2}{2!} + \frac{5}{9} e^{-2/3} \frac{(\frac33)^2}{2!}} \\ &= \frac{3 \cdot 16}{25e^{-1/3} + 3 \cdot 16 + 5 \cdot 9e^{1/3}} \\ &= \frac{48}{25e^{-1/3} + 48 + 45e^{1/3}} \end{align*} as required.
In a game, a player tosses a biased coin repeatedly until two successive tails occur, when the game terminates. For each head which occurs the player wins \(\pounds 1\). If \(E\) is the expected number of tosses of the coin in the course of a game, and \(p\) is the probability of a head, explain why \[ E = p \l 1 + E \r + \l 1 - p \r p \l 2 + E \r + 2 \l 1 - p \r ^2\,, \] and hence determine \(E\) in terms of \(p\). Find also, in terms of \(p\), the expected winnings in the course of a game. A second game is played, with the same rules, except that the player continues to toss the coin until \(r\) successive tails occur. Show that the expected number of tosses in the course of a game is given by the expression \(\displaystyle {1 - q^r \over p q^r}\,\), where \(q = 1 - p\).
A continuous random variable is said to have an exponential distribution with parameter \(\lambda\) if its density function is \(\f(t) = \lambda \e ^{- \lambda t} \; \l 0 \le t < \infty \r\,\). If \(X_1\) and \(X_2\), which are independent random variables, have exponential distributions with parameters \(\lambda_1\) and \(\lambda_2\) respectively, find an expression for the probability that either \(X_1\) or \(X_2\) (or both) is less than \(x\). Prove that if \(X\) is the random variable whose value is the lesser of the values of \(X_1\) and \(X_2\), then \(X\) also has an exponential distribution. Route A and Route B buses run from my house to my college. The time between buses on each route has an exponential distribution and the mean time between buses is 15 minutes for Route A and 30 minutes for Route B. The timings of the buses on the two routes are independent. If I emerge from my house one day to see a Route A bus and a Route B bus just leaving the stop, show that the median wait for the next bus to my college will be approximately 7 minutes.
Prove that, for any two discrete random variables \(X\) and \(Y\), \[ \mathrm{Var} \left(X + Y \right) = \mathrm{Var}(X) + \mathrm{Var}(Y) + 2 \, \mathrm{Cov}(X,Y), \] where \(\mathrm{Var}(X)\) is the variance of \(X\) and \(\mathrm{Cov}(X,Y)\) is the covariance of \(X\) and \(Y\). When a Grandmaster plays a sequence of \(m\) games of chess, she is, independently, equally likely to win, lose or draw each game. If the values of the random variables \(W\), \(L\) and \(D\) are the numbers of her wins, losses and draws respectively, justify briefly the following claims:
Solution: \begin{align*} && \var[X+Y] &= \E\left [(X+Y-\E[X+Y])^2 \right] \\ &&&= \E \left [ (X - \E[X] + Y - \E[Y])^2 \right] \\ &&&= \E \left [(X - \E[X])^2 + (Y-\E[Y])^2 + 2(X-\E[X])(Y-\E[Y]) \right] \\ &&&= \E \left [(X - \E[X])^2 \right]+\E \left [(Y-\E[Y])^2 \right]+\E \left [2(X-\E[X])(Y-\E[Y]) \right] \\ &&&= \var[X] + \var[Y] + 2 \mathrm{Cov}(X,Y) \end{align*}
Four students, Arthur, Bertha, Chandra and Delilah, exchange gossip. When Arthur hears a rumour, he tells it to one of the other three without saying who told it to him. He decides whom to tell by choosing at random amongst the other three, omitting the ones that he knows have already heard the rumour. When Bertha, Chandra or Delilah hear a rumour, they behave in exactly the same way (even if they have already heard it themselves). The rumour stops being passed round when it is heard by a student who knows that the other three have already heard it. Arthur starts a rumour and tells it to Chandra. By means of a tree diagram, or otherwise, show that the probability that Arthur rehears it is \(3/4\). Find also the probability that Bertha hears it twice and the probability that Chandra hears it twice.
Solution: Without loss of generality, \(C\) will tell \(B\) about the rumour. If \(B\) tells \(D\) then \(D\) can either tell \(A\) or \(C\) at which point either \(A\) is told or the rumour stops spreading.
Four students, one of whom is a mathematician, take turns at washing up over a long period of time. The number of plates broken by any student in this time obeys a Poisson distribution, the probability of any given student breaking \(n\) plates being \(\e^{-\lambda} \lambda^n/n!\) for some fixed constant \(\lambda\), independent of the number of breakages by other students. Given that five plates are broken, find the probability that three or more were broken by the mathematician.
Solution: Let \(X\) be the number of plates broken by the mathematician and \(Y\) by the other student. Then \(X \sim Po(\lambda), Y \sim Po(3\lambda)\) and \(X+Y \sim Po(4\lambda)\) \begin{align*} && \mathbb{P}(X = k | X+Y = n) &= \frac{\mathbb{P}(X = k, Y = n-k)}{\mathbb{P}(X+Y=n)} \\ &&&= \frac{e^{-\lambda} \lambda^k/k! \cdot e^{-3\lambda} (4\lambda)^{n-k}/(n-k)!}{e^{-4\lambda}(4\lambda)^n/n!} \\ &&&= \binom{n}{k} \left ( \frac{1}{4} \right)^k \left ( \frac{3}{5} \right)^{n-k} \end{align*} Therefore \(X | X+Y = n \sim Binomial(n, \tfrac14)\) \begin{align*} \mathbb{P}(X \geq 3 | X + Y = n) &= \binom{5}{3} \frac{3^2}{4^5} + \binom{5}{4} \frac{3}{4^5} + \binom{5}{5} \frac{1}{4^5} \\ &= \frac{1}{4^5} \left ( 90+ 15 + 1 \right) \\ &= \frac{106}{4^5} = \frac{53}{512} \approx \frac1{10} \end{align*}
On the basis of an interview, the \(N\) candidates for admission to a college are ranked in order according to their mathematical potential. The candidates are interviewed in random order (that is, each possible order is equally likely).
Solution:
The national lottery of Ruritania is based on the positive integers from \(1\) to \(N\), where \(N\) is very large and fixed. Tickets cost \(\pounds1\) each. For each ticket purchased, the punter (i.e. the purchaser) chooses a number from \(1\) to \(N\). The winning number is chosen at random, and the jackpot is shared equally amongst those punters who chose the winning number. A syndicate decides to buy \(N\) tickets, choosing every number once to be sure of winning a share of the jackpot. The total number of tickets purchased in this draw is \(3.8N\) and the jackpot is \(\pounds W\). Assuming that the non-syndicate punters choose their numbers independently and at random, find the most probable number of winning tickets and show that the expected net loss of the syndicate is approximately \[ N\; - \; %\textstyle{ \frac{5 \big(1- e^{-2.8}\big)}{14} \;W\;. \]
Two coins \(A\) and \(B\) are tossed together. \(A\) has probability \(p\) of showing a head, and \(B\) has probability \(2p\), independent of \(A\), of showing a head, where \(0 < p < \frac12\). The random variable \(X\) takes the value 1 if \(A\) shows a head and it takes the value \(0\) if \(A\) shows a tail. The random variable \(Y\) takes the value 1 if \(B\) shows a head and it takes the value \(0\) if \(B\) shows a tail. The random variable \(T\) is defined by \[ T= \lambda X + {\textstyle\frac12} (1-\lambda)Y. \] Show that \(\E(T)=p\) and find an expression for \(\var(T)\) in terms of \(p\) and \(\lambda\). Show that as \(\lambda\) varies, the minimum of \(\var(T)\) occurs when \[ \lambda =\frac{1-2p}{3-4p}\;. \] The two coins are tossed \(n\) times, where \(n>30\), and \(\overline{T}\) is the mean value of \(T\). Let \(b\) be a fixed positive number. Show that the maximum value of \(\P\big(\vert \overline{T}-p\vert < b\big)\) as \(\lambda\) varies is approximately \(2\Phi(b/s)-1\), where \(\Phi\) is the cumulative distribution function of a standard normal variate and \[ s^2= \frac{p(1-p)(1-2p)}{(3-4p)n}\;. \]
Solution: \begin{align*} && \E[T] &= \E[\lambda X + \tfrac12(1-\lambda)Y] \\ &&&= \lambda \E[X] + \tfrac12(1-\lambda) \E[Y] \\ &&&= \lambda p + \tfrac12 (1-\lambda) 2p \\ &&&= p \\ \\ && \var[T] &= \var[\lambda X + \tfrac12(1-\lambda)Y] \\ &&&= \lambda^2 \var[X] + \tfrac14(1-\lambda)^2 \var[Y] \\ &&&= \lambda^2 p(1-p) + \tfrac14(1-\lambda)^22p(1-2p) \\ &&&= p(\lambda^2 + \tfrac12(1-\lambda)^2) -p^2(\lambda^2+(1-\lambda)^2)\\ &&&= p(\tfrac32\lambda^2 - \lambda + \tfrac12) -p^2(2\lambda^2 -2\lambda + 2) \end{align*} Differentiating \(\var[T]\) with respect to \(\lambda\), and noting it is a quadratic with positive leading coefficient, we get \begin{align*} && \frac{\d \var[T]}{\d \lambda} &= p(2\lambda -(1-\lambda)) - p^2(2 \lambda -2(1-\lambda)) \\ &&&= p(3\lambda - 1)-p^2(4\lambda - 2) \\ \Rightarrow && \lambda(4p-3) &= 2p-1 \\ \Rightarrow && \lambda &= \frac{1-2p}{3-4p} \end{align*} By the central limit theorem \(\overline{T} \sim N(p, \frac{\sigma^2}{n})\) in particular, \(\mathbb{P}(|\overline{T} - p| < b) = \mathbb{P}(\left \lvert |\frac{\overline{T}-p}{\frac{\sigma}{\sqrt{n}}} \right \lvert < \frac{b}{\frac{\sigma}{\sqrt{n}}}) = \mathbb{P}(|Z| < \frac{b\sqrt{n}}{\sigma}) = 2\Phi(b/s) - 1\) where \(s = \frac{\sigma}{\sqrt{n}}\) so \begin{align*} && s^2 &= \frac1n \sigma^2 \\ &&&= \frac1n \left ( \left (\left ( \frac{1-2p}{3-4p} \right)^2 + \tfrac12 \left (1-\frac{1-2p}{3-4p} \right)^2 \right)p - \left ( \left ( \frac{1-2p}{3-4p} \right)^2 + \left (1-\frac{1-2p}{3-4p} \right)^2\right)p^2 \right) \\ &&&= \frac1n \left ( \left (\left ( \frac{1-2p}{3-4p} \right)^2 + \tfrac12 \left (\frac{2-2p}{3-4p} \right)^2 \right)p - \left ( \left ( \frac{1-2p}{3-4p} \right)^2 + \left (\frac{2-2p}{3-4p} \right)^2\right)p^2 \right) \\ &&&= \frac{p}{n(3-4p)^2} \left ( (1 -4p + 4p^2 + 2-4p+2p^2) - (1-4p+4p^2+4-8p+4p^2)p \right) \\ &&&= \frac{p}{n(3-4p)^2} \left (3-13p+18p^2-8p^3 \right) \\ &&&= \frac{p}{n(3-4p)^2} (3-4p)(1-2p)(1-p) \\ &&&= \frac{p(1-p)(1-2p)}{(3-4p)n} \end{align*}
A bag contains \(b\) black balls and \(w\) white balls. Balls are drawn at random from the bag and when a white ball is drawn it is put aside.
In a game for two players, a fair coin is tossed repeatedly. Each player is assigned a sequence of heads and tails and the player whose sequence appears first wins. Four players, \(A\), \(B\), \(C\) and \(D\) take turns to play the game. Each time they play, \(A\) is assigned the sequence TTH (i.e.~Tail then Tail then Head), \(B\) is assigned THH, \(C\) is assigned HHT and \(D\) is assigned~HTT.
I have \(k\) different keys on my key ring. When I come home at night I try one key after another until I find the key that fits my front door. What is the probability that I find the correct key in exactly \(n\) attempts in each of the following three cases?
Every person carries two genes which can each be either of type \(A\) or of type \(B\). It is known that \(81\%\) of the population are \(AA\) (i.e. both genes are of type \(A\)), \(18\%\) are \(AB\) (i.e. there is one gene of type \(A\) and one of type \(B\)) and \(1\%\) are \(BB\). A child inherits one gene from each of its parents. If one parent is \(AA\), the child inherits a gene of type \(A\) from that parent; if the parent is \(BB\), the child inherits a gene of type \(B\) from that parent; if the parent is \(AB\), the inherited gene is equally likely to be \(A\) or \(B\).
The random variable \(X\) is uniformly distributed on the interval \([-1,1]\). Find \(\E(X^2)\) and \(\var (X^2)\). A second random variable \(Y\), independent of \(X\), is also uniformly distributed on \([-1,1]\), and \(Z=Y-X\). Find \(\E(Z^2)\) and show that \(\var (Z^2) = 7 \var (X^2)\).
Solution: \(X \sim U(-1,1)\) \begin{align*} \E[X^2] &= \int_{-1}^1 \frac12 x^2 \, dx \\ &= \frac{1}{6} \left [ x^3 \right]_{-1}^1 \\ &= \frac{1}{3} \end{align*} \begin{align*} \E[X^4] &= \int_{-1}^1 \frac12 x^4 \, dx \\ &= \frac{1}{10} \left [ x^5 \right]_{-1}^1 \\ &= \frac{1}{5} \end{align*} \begin{align*} \var[X^2] &=\E[X^4] - \E[X^2]^2 \\ &= \frac{1}{5} - \frac{1}{9} \\ &= \frac{4}{45} \end{align*} \begin{align*} \E(Z^2) &= \E(Y^2 - 2XY+Z^2) \\ &= \E(Y^2) - 2\E(X)\E(Y)+\E(Z^2) \\ &= \frac{1}{3} - 0 + \frac{1}{3} \\ &= \frac{2}{3} \end{align*} \begin{align*} \E[Z^4] &= \E[Y^4 -4Y^3X+6Y^2X^2-4YX^3+X^4] \\ &= \E[Y^4]-4\E[Y^3]\E[X]+6\E[Y^2]\E[X^2]-4\E[Y]\E[X^3]+\E[X^4] \\ &= \frac{1}{5}+6 \frac{1}{3} \frac13 + \frac{1}{5} \\ &= \frac{2}{5} + \frac{2}{3} \\ &= \frac{16}{15} \end{align*} \begin{align*} \var(Z^2) &= \E(Z^4) - \E(Z^2) \\ &= \frac{16}{15} - \frac{4}{9} \\ &= \frac{28}{45} \\ &= 7 \var(X^2) \end{align*}