Year 13 course of Further Statistics
Discrete random variables, including joint distributions and covariance
I have a Penny Black stamp which I want to sell to my friend Jim, but we cannot agree a price. So I put the stamp under one of two cups, jumble them up, and let Jim guess which one it is under. If he guesses correctly, I add a third cup, jumble them up, and let Jim guess correctly, adding another cup each time. The price he pays for the stamp is \(\pounds N,\) where \(N\) is the number of cups present when Jim fails to guess correctly. Find \(\mathrm{P}(N=k)\). Show that \(\mathrm{E}(N)=\mathrm{e}\) and calculate \(\mathrm{Var}(N).\)
Show SolutionLet \(X\) be a Poisson random variable with mean \(\lambda\) and let \(p_r = P(X = r)\), for \(r = 0, 1, 2, \ldots\). Neither \(\lambda\) nor \(\lambda + \frac{1}{2} + \sqrt{\lambda + \frac{1}{4}}\) is an integer.
The number of customers arriving at a builders' merchants each day follows a Poisson distribution with mean \(\lambda\). Each customer is offered some free sand. The probability of any given customer taking the free sand is \(p\).
Adam and Eve are catching fish. The number of fish, \(X\), that Adam catches in any time interval is Poisson distributed with parameter \(\lambda t\), where \(\lambda\) is a constant and \(t\) is the length of the time interval. The number of fish, \(Y\), that Eve catches in any time interval is Poisson distributed with parameter \(\mu t\), where \(\mu\) is a constant and \(t\) is the length of the time interval The two Poisson variables are independent. You may assume that that expected time between Adam catching a fish and Adam catching his next fish is \(\lambda^{-1}\), and similarly for Eve.
Let \(X\) be a random variable with mean \(\mu\) and standard deviation \(\sigma\). Chebyshev's inequality, which you may use without proof, is \[ \P\left(\vert X-\mu\vert > k\sigma\right) \le \frac 1 {k^2} \,, \] where \(k\) is any positive number.
The number \(X\) of casualties arriving at a hospital each day follows a Poisson distribution with mean 8; that is, \[ \P(X=n) = \frac{ \e^{-8}8^n}{n!}\,, \ \ \ \ n=0, \ 1, \ 2, \ \ldots \ . \] Casualties require surgery with probability \(\frac14\). The number of casualties arriving on any given day is independent of the number arriving on any other day and the casualties require surgery independently of one another.
The random variable \(U\) has a Poisson distribution with parameter \(\lambda\). The random variables \(X\) and \(Y\) are defined as follows. \begin{align*} X&= \begin{cases} U & \text{ if \(U\) is 1, 3, 5, 7, \(\ldots\,\)} \\ 0 & \text{ otherwise} \end{cases} \\ Y&= \begin{cases} U & \text{ if \(U\) is 2, 4, 6, 8, \(\ldots\,\) } \\ 0 & \text{ otherwise} \end{cases} \end{align*}
In this question, you may assume that \(\displaystyle \int_0^\infty \!\!\! \e^{-x^2/2} \d x = \sqrt{\tfrac12 \pi}\,\). The number of supermarkets situated in any given region can be modelled by a Poisson random variable, where the mean is \(k\) times the area of the given region. Find the probability that there are no supermarkets within a circle of radius \(y\). The random variable \(Y\) denotes the distance between a randomly chosen point in the region and the nearest supermarket. Write down \(\P(Y < y)\) and hence show that the probability density function of \(Y\) is \(\displaystyle 2\pi y k \e^{-\pi k y^2}\) for \(y\ge0\). Find \(\E(Y)\) and show that \(\var(Y) = \dfrac{4-\pi}{4\pi k}\).
Show SolutionThe number of texts that George receives on his mobile phone can be modelled by a Poisson random variable with mean \(\lambda\) texts per hour. Given that the probability George waits between 1 and 2 hours in the morning before he receives his first text is \(p\), show that \[ p\e^{2\lambda}-\e^{\lambda}+1=0. \] Given that \(4p<1\), show that there are two positive values of \(\lambda\) that satisfy this equation. The number of texts that Mildred receives on each of her two mobile phones can be modelled by independent Poisson random variables with different means \(\lambda_{1}\) and \(\lambda_{2}\) texts per hour. Given that, for each phone, the probability that Mildred waits between 1 and 2 hours in the morning before she receives her first text is also \(p\), find an expression for \(\lambda_{1}+\lambda_{2}\) in terms of~\(p\). Find the probability, in terms of \(p\), that she waits between 1 and 2 hours in the morning to receive her first text.
The discrete random variable \(X\) has a Poisson distribution with mean \(\lambda\).
A very generous shop-owner is hiding small diamonds in chocolate bars. Each diamond is hidden independently of any other diamond, and on average there is one diamond per kilogram of chocolate.
The number of printing errors on any page of a large book of \(N\) pages is modelled by a Poisson variate with parameter \(\lambda\) and is statistically independent of the number of printing errors on any other page. The number of pages in a random sample of \(n\) pages (where \(n\) is much smaller than \(N\) and \(n\ge2\)) which contain fewer than two errors is denoted by \(Y\). Show that \(\P(Y=k) = \binom n k p^kq^{n-k}\) where \(p=(1+\lambda)e^{-\lambda}\) and \(q=1-p\,\). Show also that, if \(\lambda\) is sufficiently small,
Brief interruptions to my work occur on average every ten minutes and the number of interruptions in any given time period has a Poisson distribution. Given that an interruption has just occurred, find the probability that I will have less than \(t\) minutes to work before the next interruption. If the random variable \(T\) is the time I have to work before the next interruption, find the probability density function of \(T\,\). I need an uninterrupted half hour to finish an important paper. Show that the expected number of interruptions before my first uninterrupted period of half an hour or more is \(\e^3-1\). Find also the expected length of time between interruptions that are less than half an hour apart. Hence write down the expected wait before my first uninterrupted period of half an hour or more.
Jane goes out with any of her friends who call, except that she never goes out with more than two friends in a day. The number of her friends who call on a given day follows a Poisson distribution with parameter \(2\). Show that the average number of friends she sees in a day is~\(2-4\e^{-2}\,\). Now Jane has a new friend who calls on any given day with probability \(p\). Her old friends call as before, independently of the new friend. She never goes out with more than two friends in a day. Find the average number of friends she now sees in a day.
Four students, one of whom is a mathematician, take turns at washing up over a long period of time. The number of plates broken by any student in this time obeys a Poisson distribution, the probability of any given student breaking \(n\) plates being \(\e^{-\lambda} \lambda^n/n!\) for some fixed constant \(\lambda\), independent of the number of breakages by other students. Given that five plates are broken, find the probability that three or more were broken by the mathematician.
Show SolutionIn a lottery, any one of \(N\) numbers, where \(N\) is large, is chosen at random and independently for each player by machine. Each week there are \(2N\) players and one winning number is drawn. Write down an exact expression for the probability that there are three or fewer winners in a week, given that you hold a winning ticket that week. Using the fact that $$ {\biggl( 1 - {a \over n} \biggr) ^n \approx \e^{-a}}$$ for \(n\) much larger than \(a\), or otherwise, show that this probability is approximately \({2 \over 3}\) . Discuss briefly whether this probability would increase or decrease if the numbers were chosen by the players. Show that the expected number of winners in a week, given that you hold a winning ticket that week, is \( 3-N^{-1}\).
In the basic version of Horizons (H1) the player has a maximum of \(n\) turns, where \(n \ge 1\). At each turn, she has a probability \(p\) of success, where \(0 < p < 1\). If her first success is at the \(r\)th turn, where \(1 \le r \le n\), she collects \(r\) pounds and then withdraws from the game. Otherwise, her winnings are nil. Show that in H1, her expected winnings are $$ p^{-1}\left[1+nq^{n+1}-(n+1)q^n\right]\quad\hbox{pounds}, $$ where \(q=1-p\). The rules of H2 are the same as those of H1, except that \(n\) is randomly selected from a Poisson distribution with parameter \(\lambda\). If \(n=0\) her winnings are nil. Otherwise she plays H1 with the selected \(n\). Show that in H2, her expected winnings are $$ {1 \over p}{\left(1-{\e^{-{\lambda}p}}\right)} -{{\lambda}q}{\e^{-{\lambda}p}} \quad\hbox{pounds}. $$
The staff of Catastrophe College are paid a salary of \(A\) pounds per year. With a Teaching Assessment Exercise impending it is decided to try to lower the student failure rate by offering each lecturer an alternative salary of \(B/(1+X)\) pounds, where \(X\) is the number of his or her students who fail the end of year examination. Dr Doom has \(N\) students, each with independent probability \(p\) of failure. Show that she should accept the new salary scheme if $$A(N+1)p < B(1-(1-p)^{N+1}).$$ Under what circumstances could \(X\), for Dr Doom, be modelled by a Poisson random variable? What would Dr Doom's expected salary be under this model?
Traffic enters a tunnel which is 9600 metres long, and in which overtaking is impossible. The number of vehicles which enter in any given time is governed by the Poisson distribution with mean 6 cars per minute. All vehicles travel at a constant speed until forced to slow down on catching up with a slower vehicle ahead. I enter the tunnel travelling at 30 m\(\,\)s\(^{-1}\) and all the other traffic is travelling at 32 m\(\,\)s\(^{-1}\). What is the expected number of vehicles in the queue behind me when I leave the tunnel? Assuming again that I travel at 30 m\(\,\)s\(^{-1}\), but that all the other vehicles are independently equally likely to be travelling at 30 m\(\,\)s\(^{-1}\) or 32 m\(\,\)s\(^{-1}\), find the probability that exactly two vehicles enter the tunnel within 20 seconds of my doing so and catch me up before I leave it. Find also the probability that there are exactly two vehicles queuing behind me when I leave the tunnel. \noindent [Ignore the lengths of the vehicles.]
Whenever I go cycling I start with my bike in good working order. However if all is well at time \(t\), the probability that I get a puncture in the small interval \((t,t+\delta t)\) is \(\alpha\,\delta t.\) How many punctures can I expect to get on a journey during which my total cycling time is \(T\)? When I get a puncture I stop immediately to repair it and the probability that, if I am repairing it at time \(t\), the repair will be completed in time \((t,t+\delta t)\) is \(\beta\,\delta t.\) If \(p(t)\) is the probability that I am repairing a puncture at time \(t\), write down an equation relating \(p(t)\) to \(p(t+\delta t)\), and derive from this a differential equation relating \(p'(t)\) and \(p(t).\) Show that \[ p(t)=\frac{\alpha}{\alpha+\beta}(1-\mathrm{e}^{-(\alpha+\beta)t}) \] satisfies this differential equation with the appropriate initial condition. Find an expression, involving \(\alpha,\beta\) and \(T\), for the time expected to be spent mending punctures during a journey of total time \(T\). Hence, or otherwise, show that, the fraction of the journey expected to be spent mending punctures is given approximately by \[ \quad\frac{\alpha T}{2}\quad\ \mbox{ if }(\alpha+\beta)T\text{ is small, } \] and by \[ \frac{\alpha}{\alpha+\beta}\quad\mbox{ if }(\alpha+\beta)T\text{ is large.} \]
Fly By Night Airlines run jumbo jets which seat \(N\) passengers. From long experience they know that a very small proportion \(\epsilon\) of their passengers fail to turn up. They decide to sell \(N+k\) tickets for each flight. If \(k\) is very small compared with \(N\) explain why they might expect \[ \mathrm{P}(r\mbox{ passengers fail to turn up})=\frac{\lambda^{r}}{r!}\mathrm{e}^{-\lambda} \] approximately, with \(\lambda=N\epsilon.\) For the rest of the question you may assume that the formula holds exactly. Each ticket sold represents \(\pounds A\) profit, but the airline must pay each passenger that it cannot fly \(\pounds B\) where \(B>A>0.\) Explain why, if \(r\) passengers fail to turn up, its profit, in pounds, is \[ A(N+k)-B\max(0,k-r), \] where \(\max(0,k-r)\) is the larger of \(0\) and \(k-r.\) Write down the expected profit \(u_{k}\) when \(k=0,1,2\) and \(3.\) Find \(v_{k}=u_{k+1}-u_{k}\) for general \(k\) and show that \(v_{k}>v_{k+1}.\) Show also that \[ v_{k}\rightarrow A-B \] as \(k\rightarrow\infty.\) Advise Fly By Night on how to choose \(k\) to maximise its expected profit \(u_{k}.\)
A scientist is checking a sequence of microscope slides for cancerous cells, marking each cancerous cell that she detects with a red dye. The number of cancerous cells on a slide is random and has a Poisson distribution with mean \(\mu.\) The probability that the scientist spots any one cancerous cell is \(p\), and is independent of the probability that she spots any other one.
At the terminus of a bus route, passengers arrive at an average rate of 4 per minute according to a Poisson process. Each minute, on the minute, one bus arrives with probability \(\frac{1}{4},\) independently of the arrival of passengers or previous buses. Just after eight o'clock there is no-one at the bus stop.
The probability that there are exactly \(n\) misprints in an issue of a newspaper is \(\mathrm{e}^{-\lambda}\lambda^{n}/n!\) where \(\lambda\) is a positive constant. The probability that I spot a particular misprint is \(p\), independent of what happens for other misprints, and \(0 < p < 1.\)
I can choose one of three routes to cycle to school. Via Angle Avenue the distance is 5\(\,\)km, and I am held up at a level crossing for \(A\) minutes, where \(A\) is a continuous random variable uniformly distributed between \(0\) and 10. Via Bend Boulevard the distance is 4\(\,\)km, and I am delayed, by talking to each of \(B\) friends for 3\(\,\)minutes, for a total of \(3B\) minutes, where \(B\) is a random variable whose distribution is Poisson with mean 4. Via Detour Drive the distance should be only 2\(\,\)km, but in addition, due to never-ending road works, there are five places at each of which, with probability \(\frac{4}{5},\) I have to make a detour that increases the distance by 1\(\,\)km. Except when delayed by talking to friends or at the level crossing, I cycle at a steady 12\(\,\)km\(\,\)h\(^{-1}\). For each of the three routs, calculate the probability that a journey lasts at least 27 minutes. Each day I choose one of the three routes at random, and I am equally likely to choose any of the three alternatives. One day I arrive at school after a journey of at least 27 minutes. What is the probability that I came via Bend Boulevard? Which route should I use all the time: \begin{questionparts} \item if I wish my average journey time to be as small as possible; \item if I wish my journey time to be less than 32 minutes as often as possible? \end{questionpart} Justify your answers.
Show SolutionThe random variable \(X\) takes the values \(k=1\), \(2\), \(3\), \(\dotsc\), and has probability distribution $$ \P(X=k)= A{{{\lambda}^k\e^{-{\lambda}}} \over {k!}}\,, $$ where \(\lambda \) is a positive constant. Show that \(A = (1-\e^{-\lambda})^{-1}\,\). Find the mean \({\mu}\) in terms of \({\lambda}\) and show that $$ \var(X) = {\mu}(1-{\mu}+{\lambda})\;. $$ Deduce that \({\lambda} < {\mu} < 1+{\lambda}\,\). Use a normal approximation to find the value of \(P(X={\lambda})\) in the case where \({\lambda}=100\,\), giving your answer to 2 decimal places.
The national lottery of Ruritania is based on the positive integers from \(1\) to \(N\), where \(N\) is very large and fixed. Tickets cost \(\pounds1\) each. For each ticket purchased, the punter (i.e. the purchaser) chooses a number from \(1\) to \(N\). The winning number is chosen at random, and the jackpot is shared equally amongst those punters who chose the winning number. A syndicate decides to buy \(N\) tickets, choosing every number once to be sure of winning a share of the jackpot. The total number of tickets purchased in this draw is \(3.8N\) and the jackpot is \(\pounds W\). Assuming that the non-syndicate punters choose their numbers independently and at random, find the most probable number of winning tickets and show that the expected net loss of the syndicate is approximately \[ N\; - \; %\textstyle{ \frac{5 \big(1- e^{-2.8}\big)}{14} \;W\;. \]
The random variable \(X\) takes only non-negative integer values and has probability generating function \(\G(t)\). Show that \[ \P(X = 0 \text{ or } 2 \text{ or } 4 \text { or } 6 \ \ldots ) = \frac{1}{2}\big(\G\left(1\right)+\G\left(-1\right)\big). \] You are now given that \(X\) has a Poisson distribution with mean \(\lambda\). Show that \[ \G(t) = \e^{-\lambda(1-t)} \,. \]
A 6-sided fair die has the numbers 1, 2, 3, 4, 5, 6 on its faces. The die is thrown \(n\) times, the outcome (the number on the top face) of each throw being independent of the outcome of any other throw. The random variable \(S_n\) is the sum of the outcomes.
I play a game which has repeated rounds. Before the first round, my score is \(0\). Each round can have three outcomes: \begin{compactenum}[1.] \item my score is unchanged and the game ends; \item my score is unchanged and I continue to the next round; \item my score is increased by one and I continue to the next round. \end{compactenum} The probabilities of these outcomes are \(a\), \(b\) and~\(c\), respectively (the same in each round), where \(a+b+c=1\) and \(abc\ne0\). The random variable \(N\) represents my score at the end of a randomly chosen game. Let \(\.G(t)\) be the probability generating function of \(N\).
The random variable \(N\) takes positive integer values and has pgf (probability generating function) \(\G(t)\). The random variables \(X_i\), where \(i=1\), \(2\), \(3\), \(\ldots,\) are independently and identically distributed, each with pgf \({\H}(t)\). The random variables \(X_i\) are also independent of \(N\). The random variable \(Y\) is defined by \[ Y= \sum_{i=1}^N X_i \;. \] Given that the pgf of \(Y\) is \(\G(\H(t))\), show that \[ \E(Y) = \E(N)\E(X_i) \text{ \ \ \ \ and \ \ \ \ } \var(Y) = \var(N)\big(\E(X_i)\big)^2 + \E(N) \var(X_i) \,.\] A fair coin is tossed until a head occurs. The total number of tosses is \(N\). The coin is then tossed a further \(N\) times and the total number of heads in these \(N\) tosses is \(Y\). Find in this particular case the pgf of \(Y\), \(\E(Y)\), \(\var(Y)\) and \(\P(Y=r)\).
Write down the probability generating function for the score on a standard, fair six-faced die whose faces are labelled \(1, 2, 3, 4, 5, 6\). Hence show that the probability generating function for the sum of the scores on two standard, fair six-faced dice, rolled independently, can be written as \[ \frac1{36} t^2 \l 1 + t \r^2 \l 1 - t + t^2 \r^2 \l 1 + t + t^2 \r^2 \;. \] Write down, in factorised form, the probability generating functions for the scores on two fair six-faced dice whose faces are labelled with the numbers \(1, 2, 2, 3, 3, 4\) and \(1, 3, 4, 5, 6, 8,\) and hence show that when these dice are rolled independently, the probability of any given sum of the scores is the same as for the two standard fair six-faced dice. Standard, fair four-faced dice are tetrahedra whose faces are labelled \(1, 2, 3, 4,\) the score being taken from the face which is not visible after throwing, and each score being equally likely. Find all the ways in which two fair four-faced dice can have their faces labelled with positive integers if the probability of any given sum of the scores is to be the same as for the two standard fair four-faced dice.
A set of \(n\) dice is rolled repeatedly. For each die the probability of showing a six is \(p\). Show that the probability that the first of the dice to show a six does so on the \(r\)th roll is $$q^{n r } ( q^{-n} - 1 )$$ where \(q = 1 - p\). Determine, and simplify, an expression for the probability generating function for this distribution, in terms of \(q\) and \(n\). The first of the dice to show a six does so on the \(R\)th roll. Find the expected value of \(R\) and show that, in the case \(n = 2\), \(p=1/6\), this value is \(36/11\). Show that the probability that the last of the dice to show a six does so on the \(r\)th roll is \[ \big(1-q^r\big)^n-\big(1-q^{r-1}\big)^n. \] Find, for the case \(n = 2\), the probability generating function. The last of the dice to show a six does so on the \(S\)th roll. Find the expected value of \(S\) and evaluate this when \(p=1/6\).
The random variable \(X\) has probability density function \(\f(x)\) (which you may assume is differentiable) and cumulative distribution function \(\F(x)\) where $-\infty < x < \infty \(. The random variable \)Y\( is defined by \)Y= \e^X$. You may assume throughout this question that \(X\) and \(Y\) have unique modes.
The lifetime of a fly (measured in hours) is given by the continuous random variable~\(T\) with probability density function \(\.f(t)\) and cumulative distribution function \(\.F(t)\). The \emph{hazard function}, \(\.h(t)\), is defined, for \(\.F(t)<1\), by \[ \.h(t) = \frac{\.f(t)}{1-\.F(t)}\,. \]
What property of a distribution is measured by its {\em skewness}?
The continuous random variable \(X\) has probability density function \(\f(x)\), where \[ \f(x) = \begin{cases} a & \text {for } 0\le x < k \\ b & \text{for } k \le x \le 1\\ 0 & \text{otherwise}, \end{cases} \] where \(a > b > 0\) and \(0 < k < 1\). Show that \(a > 1\) and \(b < 1\).
The random variable \(X\) has a continuous probability density function \(\f(x)\) given by \begin{equation*} \f(x) = \begin{cases} 0 & \text{for } x \le 1 \\ \ln x & \text{for } 1\le x \le k\\ \ln k & \text{for } k\le x \le 2k\\ a-bx & \text{for } 2k \le x \le 4k \\ 0 & \text{for } x\ge 4k \end{cases} \end{equation*} where \(k\), \(a\) and \(b\) are constants.
In this question, you may use the result \[ \displaystyle \int_0^\infty \frac{t^m}{(t+k)^{n+2}} \; \mathrm{d}t =\frac{m!\, (n-m)!}{(n+1)! \, k^{n-m+1}}\;, \] where \(m\) and \(n\) are positive integers with \(n\ge m\,\), and where \(k>0\,\). The random variable \(V\) has density function \[ \f(x) = \frac{C \, k^{a+1} \, x^a}{(x+k)^{2a+2}} \quad \quad (0 \le x < \infty) \;, \] where \(a\) is a positive integer. Show that \(\displaystyle C = \frac{(2a+1)!}{a! \, a!}\;\). Show, by means of a suitable substitution, that \[ \int_0^v \frac{x^a}{(x+k)^{2a+2}} \; \mathrm{d}x = \int_{\frac{k^2}{v}}^\infty \frac{u^a}{(u+k)^{2a+2}} \; \mathrm{d}u \] and deduce that the median value of \(V\) is \(k\). Find the expected value of \(V\). The random variable \(V\) represents the speed of a randomly chosen gas molecule. The time taken for such a particle to travel a fixed distance \(s\) is given by the random variable \(\ds T=\frac{s}{V}\). Show that \begin{equation} \mathbb{P}( T < t) = \ds \int_{\frac{s}{t}}^\infty \frac{C \, k^{a+1} \, x^a}{(x+k)^{2a+2}}\; \mathrm{d}x \tag{\( *\)} \end{equation} and hence find the density function of \(T\). You may find it helpful to make the substitution \(\ds u = \frac{s}{x}\) in the integral \((*)\). Hence show that the product of the median time and the median speed is equal to the distance \(s\), but that the product of the expected time and the expected speed is greater than \(s\).
Show SolutionSketch the graph, for \(x \ge 0\,\), of $$ y = kx\e^{-ax^2} \;, $$ where \(a\) and \(k\) are positive constants. The random variable \(X\) has probability density function \(\f(x)\) given by \begin{equation*} \f(x)= \begin{cases} kx\e^{-ax^2} & \text{for \(0 \le x \le 1\)}\\[3pt] 0 & \text{otherwise}. \end{cases} \end{equation*} Show that \(\ds k=\frac{2a}{1-\e^{-a}}\) and find the mode \(m\) in terms of \(a\,\), distinguishing between the cases \(a < \frac12\) and \(a > \frac12\,\). Find the median \(h\) in terms of \(a\)\, and show that \(h > m\) if \(a > -\ln\left(2\e^{-1/2} - 1\right).\) Show that, \(-\ln\left(2\e^{-1/2}-1\right)> \frac12 \,\). Show also that, if \(a > -\ln\left(2\e^{-1/2} - 1\right) \,\), then $$ P(X > m \;\vert\; X < h) = {{2\e^{-1/2}-\e^{-a}-1} \over 1-\e^{-a}}\;. $$
Let \(\F(x)\) be the cumulative distribution function of a random variable \(X\), which satisfies \(\F(a)=0\) and \(\F(b)=1\), where \(a>0\). Let \[ \G(y) = \frac{\F(y)}{2-\F(y)}\;. \] Show that \(\G(a)=0\,\), \(\G(b)=1\,\) and that \(\G'(y)\ge0\,\). Show also that \[ \frac12 \le \frac2{(2-\F(y))^2} \le 2\;. \] The random variable \(Y\) has cumulative distribution function \(\G(y)\,\). Show that \[ {\ts \frac12} \,\E(X) \le \E(Y) \le 2 \E(X) \;, \] and that \[ \var(Y) \le 2\var(X) +{\ts \frac 74} \big(\E(X)\big)^2\;. \]
A goat \(G\) lies in a square field \(OABC\) of side \(a\). It wanders randomly round its field, so that at any time the probability of its being in any given region is proportional to the area of this region. Write down the probability that its distance, \(R\), from \(O\) is less than \(r\) if \(0 < r\leqslant a,\) and show that if \(r\geqslant a\) the probability is \[ \left(\frac{r^{2}}{a^{2}}-1\right)^{\frac{1}{2}}+\frac{\pi r^{2}}{4a^{2}}-\frac{r^{2}}{a^{2}}\cos^{-1}\left(\frac{a}{r}\right). \] Find the median of \(R\) and probability density function of \(R\). The goat is then tethered to the corner \(O\) by a chain of length \(a\). Find the conditional probability that its distance from the fence \(OC\) is more than \(a/2\).
A target consists of a disc of unit radius and centre \(O\). A certain marksman never misses the target, and the probability of any given shot hitting the target within a distance \(t\) from \(O\) it \(t^{2}\), where \(0\leqslant t\leqslant1\). The marksman fires \(n\) shots independently. The random variable \(Y\) is the radius of the smallest circle, with centre \(O\), which encloses all the shots. Show that the probability density function of \(Y\) is \(2ny^{2n-1}\) and find the expected area of the circle. The shot which is furthest from \(O\) is rejected. Show that the expected area of the smallest circle, with centre \(O\), which encloses the remaining \((n-1)\) shots is \[ \left(\frac{n-1}{n+1}\right)\pi. \]
Show SolutionThe continuous random variable \(X\) is uniformly distributed over the interval \([-c,c].\) Write down expressions for the probabilities that:
An internet tester sends \(n\) e-mails simultaneously at time \(t=0\). Their arrival times at their destinations are independent random variables each having probability density function \(\lambda \e^{-\lambda t}\) (\(0\le t<\infty\), \( \lambda >0\)).
The maximum height \(X\) of flood water each year on a certain river is a random variable with probability density function \(\f\) given by \[ \f(x) = \begin{cases} \lambda \e^{-\lambda x} & \text{for \(x\ge0\)}\,, \\ 0 & \text{otherwise,} \end{cases} \] where \(\lambda\) is a positive constant. It costs \(ky\) pounds each year to prepare for flood water of height \(y\) or less, where \(k\) is a positive constant and \(y\ge0\). If \(X \le y\) no further costs are incurred but if \(X> y\) the additional cost of flood damage is \(a(X - y )\) pounds where \(a\) is a positive constant.
A continuous random variable is said to have an exponential distribution with parameter \(\lambda\) if its density function is \(\f(t) = \lambda \e ^{- \lambda t} \; \l 0 \le t < \infty \r\,\). If \(X_1\) and \(X_2\), which are independent random variables, have exponential distributions with parameters \(\lambda_1\) and \(\lambda_2\) respectively, find an expression for the probability that either \(X_1\) or \(X_2\) (or both) is less than \(x\). Prove that if \(X\) is the random variable whose value is the lesser of the values of \(X_1\) and \(X_2\), then \(X\) also has an exponential distribution. Route A and Route B buses run from my house to my college. The time between buses on each route has an exponential distribution and the mean time between buses is 15 minutes for Route A and 30 minutes for Route B. The timings of the buses on the two routes are independent. If I emerge from my house one day to see a Route A bus and a Route B bus just leaving the stop, show that the median wait for the next bus to my college will be approximately 7 minutes.
In order to get money from a cash dispenser I have to punch in an identification number. I have forgotten my identification number, but I do know that it is equally likely to be any one of the integers \(1\), \(2\), \ldots , \(n\). I plan to punch in integers in order until I get the right one. I can do this at the rate of \(r\) integers per minute. As soon as I punch in the first wrong number, the police will be alerted. The probability that they will arrive within a time \(t\) minutes is \(1-\e^{-\lambda t}\), where \(\lambda\) is a positive constant. If I follow my plan, show that the probability of the police arriving before I get my money is \[ \sum_{k=1}^n \frac{1-\e^{-\lambda(k-1)/r}}n\;. \] Simplify the sum. On past experience, I know that I will be so flustered that I will just punch in possible integers at random, without noticing which I have already tried. Show that the probability of the police arriving before I get my money is \[ 1-\frac1{n-(n-1)\e^{-\lambda/r}} \;. \]
A random variable \(X\) has the probability density function \[ \mathrm{f}(x)=\begin{cases} \lambda\mathrm{e}^{-\lambda x} & x\geqslant0,\\ 0 & x<0. \end{cases} \] Show that $${\rm P}(X>s+t\,\vert X>t) = {\rm P}(X>s).$$ The time it takes an assistant to serve a customer in a certain shop is a random variable with the above distribution and the times for different customers are independent. If, when I enter the shop, the only two assistants are serving one customer each, what is the probability that these customers are both still being served at time \(t\) after I arrive? One of the assistants finishes serving his customer and immediately starts serving me. What is the probability that I am still being served when the other customer has finished being served?
The maximum height \(X\) of flood water each year on a certain river is a random variable with density function \begin{equation*} {\mathrm f}(x)= \begin{cases} \exp(-x)&\text{if \(x\geqslant 0\),}\\ 0&\text{otherwise}. \end{cases} \end{equation*} It costs \(y\) megadollars each year to prepare for flood water of height \(y\) or less. If \(X\leqslant y\) no further costs are incurred but if \(X\geqslant y\) the cost of flood damage is \(r+s(X-y)\) megadollars where \(r,s>0\). The total cost \(T\) megadollars is thus given by \begin{equation*} T= \begin{cases} y&\text{if \(X\leqslant y\)},\\ y+r+s(X-y)&\text{if \(X>y\)}. \end{cases} \end{equation*} Show that we can minimise the expected total cost by taking \[y=\ln(r+s).\]
During his performance a trapeze artist is supported by two identical ropes, either of which can bear his weight. Each rope is such that the time, in hours of performance, before it fails is exponentially distributed, independently of the other, with probability density function \(\lambda\exp(-\lambda t)\) for \(t\geqslant0\) (and 0 for \(t<0\)), for some \(\lambda>0.\) A particular rope has already been in use for \(t_{0}\) hours of performance. Find the distribution for the length of time the artist can continue to use it before it fails. Interpret and comment upon your result. Before going on tour the artist insists that the management purchase two new ropes of the above type. Show that the probability density function of the time until both ropes fail is \[ \mathrm{f}(t)=\begin{cases} 2\lambda\mathrm{e}^{-\lambda t}(1-\mathrm{e}^{-\lambda t}) & \text{ if }t\geqslant0,\\ 0 & \text{ otherwise.} \end{cases} \] If each performance lasts for \(h\) hours, find the probability that both ropes fail during the \(n\)th performance. Show that the probability that both ropes fail during the same performance is \(\tanh(\lambda h/2)\).
A random process generates, independently, \(n\) numbers each of which is drawn from a uniform (rectangular) distribution on the interval 0 to 1. The random variable \(Y_k\) is defined to be the \(k\)th smallest number (so there are \(k-1\) smaller numbers).
The discrete random variables \(X\) and \(Y\) can each take the values \(1\), \(\ldots\,\), \(n\) (where \(n\ge2\)). Their joint probability distribution is given by \[ \P(X=x, \ Y=y) = k(x+y) \,, \] where \(k\) is a constant.
Given a random variable \(X\) with mean \(\mu\) and standard deviation \(\sigma\), we define the kurtosis, \(\kappa\), of \(X\) by \[ \kappa = \frac{ \E\big((X-\mu)^4\big)}{\sigma^4} -3 \,. \] Show that the random variable \(X-a\), where \(a\) is a constant, has the same kurtosis as \(X\).
Each of the two independent random variables \(X\) and \(Y\) is uniformly distributed on the interval~\([0,1]\).
A list consists only of letters \(A\) and \(B\) arranged in a row. In the list, there are \(a\) letter \(A\)s and \(b\) letter \(B\)s, where \(a\ge2\) and \(b\ge2\), and \(a+b=n\). Each possible ordering of the letters is equally probable. The random variable \(X_1\) is defined by \[ X_1 = \begin{cases} 1 & \text{if the first letter in the row is \(A\)};\\ 0 & \text{otherwise.} \end{cases} \] The random variables \(X_k\) (\(2 \le k \le n\)) are defined by \[ X_k = \begin{cases} 1 & \text{if the \((k-1)\)th letter is \(B\) and the \(k\)th is \(A\)};\\ 0 & \text{otherwise.} \end{cases} \] The random variable \(S\) is defined by \(S = \sum\limits_ {i=1}^n X_i\,\).
In this question, \({\rm Corr}(U,V)\) denotes the product moment correlation coefficient between the random variables \(U\) and \(V\), defined by \[ \mathrm{Corr}(U,V) \equiv \frac{\mathrm{Cov}(U,V)}{\sqrt{\var(U)\var(V)}}\,. \] The independent random variables \(Z_1\), \(Z_2\) and \(Z_3\) each have expectation 0 and variance 1. What is the value of \(\mathrm{Corr} (Z_1,Z_2)\)? Let \(Y_1 = Z_1\) and let \[ Y_2 = \rho _{12} Z_1 + (1 - {\rho_{12}^2})^{ \frac12} Z_ 2\,, \] where \(\rho_{12}\) is a given constant with $-1<\rho _{12}<1$. Find \(\E(Y_2)\), \(\var(Y_2)\) and \(\mathrm{Corr}(Y_1, Y_2)\). Now let \(Y_3 = aZ_1 + bZ_2 + cZ_3\), where \(a\), \(b\) and \(c\) are real constants and \(c\ge0\). Given that \(\E(Y_3) = 0\), \(\var(Y_3) = 1\), \( \mathrm{Corr}(Y_1, Y_3) =\rho^{{2}}_{13} \) and \( \mathrm{Corr}(Y_2, Y_3)= \rho^{{2}} _{23}\), express \(a\), \(b\) and \(c\) in terms of \(\rho^{2} _{23}\), \(\rho^{2}_{13}\) and \(\rho^{2} _{12}\). Given constants \(\mu_i\) and \(\sigma_i\), for \(i=1\), \(2\) and \(3\), give expressions in terms of the \(Y_i\) for random variables \(X_i\) such that \(\E(X_i) = \mu_i\), \(\var(X_i) = \sigma_ i^2\) and \(\mathrm{Corr}(X_i,X_j) = \rho_{ij}\).
Show SolutionI choose a number from the integers \(1, 2, \ldots\), \((2n-1)\) and the outcome is the random variable~\(N\). Calculate \( \E(N)\) and \(\E(N^2)\). I then repeat a certain experiment \(N\) times, the outcome of the \(i\)th experiment being the random variable \(X_i\) ($1\le i \le N\(). For each \)i$, the random variable \(X_i\) has mean \(\mu\) and variance~\(\sigma^2\), and \(X_i\) is independent of \(X_j\) for \(i\ne j\) and also independent of \(N\). The random variable \(Y\) is defined by \(Y= \sum\limits_{i=1}^NX_i\). Show that \(\E(Y)=n\mu\) and that \(\mathrm{Cov}(Y,N) = \frac13n(n-1)\mu\). Find \(\var(Y) \) in terms of \(n\), \(\sigma^2\) and \(\mu\).
For any random variables \(X_1\) and \(X_2\), state the relationship between \(\E(aX_1+bX_2)\) and \(\E(X_1)\) and \(\E(X_2)\), where \(a\) and \(b\) are constants. If \(X_1\) and \(X_2\) are independent, state the relationship between \(\E(X_1X_2)\) and \(\E(X_1)\) and \(\E(X_2)\). An industrial process produces rectangular plates. The length and the breadth of the plates are modelled by independent random variables \(X_1\) and \(X_2\) with non-zero means \(\mu_1\) and \(\mu_2\) and non-zero standard deviations \(\sigma_1\) and \(\sigma_2\), respectively. Using the results in the paragraph above, and without quoting a formula for \(\var(aX_1+bX_2)\), find the means and standard deviations of the perimeter \(P\) and area \(A\) of the plates. Show that \(P\) and \(A\) are not independent. The random variable \(Z\) is defined by \(Z=P-\alpha A\), where \(\alpha \) is a constant. Show that \(Z\) and \(A\) are not independent if \[ \alpha \ne \dfrac{2(\mu_1^{\vphantom2} \sigma_2^2 +\mu_2^{\vphantom2}\sigma_1^2)} { \mu_1^2 \sigma_2^2 +\mu_2^2\sigma_1^2 + \sigma_1^2\sigma_2^2 } \;. \] Given that \(X_1\) and \(X_2\) can each take values 1 and 3 only, and that they each take these values with probability \(\frac 12\), show that \(Z\) and \(A\) are not independent for any value of \(\alpha\).
Show SolutionFive independent timers time a runner as she runs four laps of a track. Four of the timers measure the individual lap times, the results of the measurements being the random variables \(T_1\) to \(T_4\), each of which has variance \(\sigma^2\) and expectation equal to the true time for the lap. The fifth timer measures the total time for the race, the result of the measurement being the random variable \(T\) which has variance \(\sigma^2\) and expectation equal to the true race time (which is equal to the sum of the four true lap times). Find a random variable \(X\) of the form \(aT+b(T_1+T_2+T_3+T_4)\), where \(a\) and \(b\) are constants independent of the true lap times, with the two properties: (1) \ whatever the true lap times, the expectation of \(X\) is equal to the true race time; (2) \ the variance of \(X\) is as small as possible. Find also a random variable \(Y\) of the form \(cT+d(T_1+T_2+T_3+T_4)\), where \(c\) and \(d\) are constants independent of the true lap times, with the property that, whatever the true lap times, the expectation of \(Y^2\) is equal to \(\sigma^2\). In one particular race, \(T\) takes the value 220 seconds and \((T_1 + T_2 + T_3 + T_4)\) takes the value \(220.5\) seconds. Use the random variables \(X\) and \(Y\) to estimate an interval in which the true race time lies.
A team of \(m\) players, numbered from \(1\) to \(m\), puts on a set of a \(m\) shirts, similarly numbered from \(1\) to \(m\). The players change in a hurry, so that the shirts are assigned to them randomly, one to each player. Let \(C_i\) be the random variable that takes the value \(1\) if player \(i\) is wearing shirt \(i\), and 0 otherwise. Show that \(\mathrm{E}\left(C_1\right)={1 \over m}\) and find \(\var \left(C_1\right)\) and \(\mathrm{Cov}\left(C_1 \, , \; C_2 \right) \,\). Let \(\, N = C_1 + C_2 + \cdots + C_m \,\) be the random variable whose value is the number of players who are wearing the correct shirt. Show that \(\mathrm{E}\left(N\right)= \var \left(N\right) = 1 \,\). Explain why a Normal approximation to \(N\) is not likely to be appropriate for any \(m\), but that a Poisson approximation might be reasonable. In the case \(m = 4\), find, by listing equally likely possibilities or otherwise, the probability that no player is wearing the correct shirt and verify that an appropriate Poisson approximation to \(N\) gives this probability with a relative error of about \(2\%\). [Use \(\e \approx 2\frac{72}{100} \,\).]
Show SolutionProve that, for any two discrete random variables \(X\) and \(Y\), \[ \mathrm{Var} \left(X + Y \right) = \mathrm{Var}(X) + \mathrm{Var}(Y) + 2 \, \mathrm{Cov}(X,Y), \] where \(\mathrm{Var}(X)\) is the variance of \(X\) and \(\mathrm{Cov}(X,Y)\) is the covariance of \(X\) and \(Y\). When a Grandmaster plays a sequence of \(m\) games of chess, she is, independently, equally likely to win, lose or draw each game. If the values of the random variables \(W\), \(L\) and \(D\) are the numbers of her wins, losses and draws respectively, justify briefly the following claims:
The random variable \(X\) takes only the values \(x_1\) and \(x_2\) (where \( x_1 \not= x_2 \)), and the random variable \(Y\) takes only the values \(y_1\) and \(y_2\) (where \(y_1 \not= y_2\)). Their joint distribution is given by $$ \P ( X = x_1 , Y = y_1 ) = a \ ; \ \ \P ( X = x_1 , Y = y_2 ) = q - a \ ; \ \ \P ( X = x_2 , Y = y_1 ) = p - a \ . $$ Show that if \(\E(X Y) = \E(X)\E(Y)\) then $$ (a - p q ) ( x_1 - x_2 ) ( y_1 - y_2 ) = 0 . $$ Hence show that two random variables each taking only two distinct values are independent if \(\E(X Y) = \E(X) \E(Y)\). Give a joint distribution for two random variables \(A\) and \(B\), each taking the three values \(- 1\), \(0\) and \(1\) with probability \({1 \over 3}\), which have \(\E(A B) = \E( A)\E (B)\), but which are not independent.
Show SolutionAn industrial process produces rectangular plates of mean length \(\mu_{1}\) and mean breadth \(\mu_{2}\). The length and breadth vary independently with non-zero standard deviations \(\sigma_{1}\) and \(\sigma_{2}\) respectively. Find the means and standard deviations of the perimeter and of the area of the plates. Show that the perimeter and area are not independent.
The random variables \(X\) and \(Y\) are independently normally distributed with means 0 and variances 1. Show that the joint probability density function for \((X,Y)\) is \[ \mathrm{f}(x,y)=\frac{1}{2\pi}\mathrm{e}^{-\frac{1}{2}(x^{2}+y^{2})}\qquad-\infty < x < \infty,-\infty < y < \infty. \] If \((x,y)\) are the coordinates, referred to rectangular axes, of a point in the plane, explain what is meant by saying that this density is radially symmetrical. The random variables \(U\) and \(V\) have a joint probability density function which is radially symmetrical (in the above sense). By considering the straight line with equation \(U=kV,\) or otherwise, show that \[ \mathrm{P}\left(\frac{U}{V} < k\right)=2\mathrm{P}(U < kV,V > 0). \] Hence, or otherwise, show that the probability density function of \(U/V\) is \[ \mathrm{g}(k)=\frac{1}{\pi(1+k^{2})}\qquad-\infty < k < \infty. \]
The random variables \(X\) and \(Y\) take integer values \(x\) and \(y\) respectively which are restricted by \(x\geqslant1,\) \(y\geqslant1\) and \(2x+y\leqslant2a\) where \(a\) is an integer greater than 1. The joint probability is given by \[ \mathrm{P}(X=x,Y=y)=c(2x+y), \] where \(c\) is a positive constant, within this region and zero elsewhere. Obtain, in terms of \(x,c\) and \(a,\) the marginal probability \(\mathrm{P}(X=x)\) and show that \[ c=\frac{6}{a(a-1)(8a+5)}. \] Show that when \(y\) is an even number the marginal probability \(\mathrm{P}(Y=y)\) is \[ \frac{3(2a-y)(2a+2+y)}{2a(a-1)(8a+5)} \] and find the corresponding expression when \(y\) is off. Evaluate \(\mathrm{E}(Y)\) in terms of \(a\).
No problems in this section yet.
Let \(X\) be a random variable with a Laplace distribution, so that its probability density function is given by \[ \f(x) = \frac12 \e^{-\vert x \vert }\;, \text{ \ \ \ \(-\infty < x < \infty \)\ }. \tag{\(*\)} \] Sketch \(\f(x)\). Show that its moment generating function \({\rm M}_X(\theta)\) is given by ${\rm M}_X(\theta) = (1-\theta^2)^{-1}\( and hence find the variance of \)X$. A frog is jumping up and down, attempting to land on the same spot each time. In fact, in each of \(n\) successive jumps he always lands on a fixed straight line but when he lands from the \(i\)th jump (\(i=1\,,2\,,\ldots\,,n\)) his displacement from the point from which he jumped is \(X_i\,\)cm, where \(X_i\) has the distribution \((*)\). His displacement from his starting point after \(n\)~jumps is \(Y\,\)cm (so that \(Y=\sum\limits_{i=1}^n X_i\)). Each jump is independent of the others. Obtain the moment generating function for \(Y/ \sqrt {2n}\) and, by considering its logarithm, show that this moment generating function tends to \(\exp(\frac12\theta^2)\) as \(n\to\infty\). Given that \(\exp(\frac12\theta^2)\) is the moment generating function of the standard Normal random variable, estimate the least number of jumps such that there is a \(5\%\) chance that the frog lands 25~cm or more from his starting point.
In the game of endless cricket the scores \(X\) and \(Y\) of the two sides are such that \[ \P (X=j,\ Y=k)=\e^{-1}\frac{(j+k)\lambda^{j+k}}{j!k!},\] for some positive constant \(\lambda\), where \(j,k = 0\), \(1\), \(2\), \(\ldots\).
Let \(X\) and \(Y\) be independent standard normal random variables: the probability density function, \(\f\), of each is therefore given by \[ \f(x)=\left(2\pi\right)^{-\frac{1}{2}}\e^{-\frac{1}{2}x^{2}}. \]
A candidate finishes examination questions in time \(T\), where \(T\) has probability density function \[ \mathrm{f}(t)=t\mathrm{e}^{-t}\qquad t\geqslant0, \] the probabilities for the various questions being independent. Find the moment generating function of \(T\) and hence find the moment generating function for the total time \(U\) taken to finish two such questions. Show that the probability density function for \(U\) is \[ \mathrm{g}(u)=\frac{1}{6}u^{3}\mathrm{e}^{-u}\qquad u\geqslant0. \] Find the probability density function for the total time taken to answer \(n\) such questions.
Show SolutionNo problems in this section yet.
A random variable \(X\) is distributed uniformly on \([\, 0\, , \, a\,]\). Show that the variance of \(X\) is\phantom{ }\({1 \over 12} a^2\). A sample, \(X_1\) and \(X_2\), of two independent values of the random variable is drawn, and the variance \(V\) of the sample is determined. Show that \(V = {1 \over 4} \l X_1 -X_2 \r ^2\), and hence prove that \(2 V\) is an unbiased estimator of the variance of X. Find an exact expression for the probability that the value of \(V\) is less than \({1 \over 12} a^2\) and estimate the value of this probability correct to one significant figure.
Two coins \(A\) and \(B\) are tossed together. \(A\) has probability \(p\) of showing a head, and \(B\) has probability \(2p\), independent of \(A\), of showing a head, where \(0 < p < \frac12\). The random variable \(X\) takes the value 1 if \(A\) shows a head and it takes the value \(0\) if \(A\) shows a tail. The random variable \(Y\) takes the value 1 if \(B\) shows a head and it takes the value \(0\) if \(B\) shows a tail. The random variable \(T\) is defined by \[ T= \lambda X + {\textstyle\frac12} (1-\lambda)Y. \] Show that \(\E(T)=p\) and find an expression for \(\var(T)\) in terms of \(p\) and \(\lambda\). Show that as \(\lambda\) varies, the minimum of \(\var(T)\) occurs when \[ \lambda =\frac{1-2p}{3-4p}\;. \] The two coins are tossed \(n\) times, where \(n>30\), and \(\overline{T}\) is the mean value of \(T\). Let \(b\) be a fixed positive number. Show that the maximum value of \(\P\big(\vert \overline{T}-p\vert < b\big)\) as \(\lambda\) varies is approximately \(2\Phi(b/s)-1\), where \(\Phi\) is the cumulative distribution function of a standard normal variate and \[ s^2= \frac{p(1-p)(1-2p)}{(3-4p)n}\;. \]
The random variables \(X_1\), \(X_2\), \(\ldots\) , \(X_{2n+1}\) are independently and uniformly distributed on the interval \(0 \le x \le 1\). The random variable \(Y\) is defined to be the median of \(X_1\), \(X_2\), \(\ldots\) , \(X_{2n+1}\). Given that the probability density function of \(Y\) is \(\g(y)\), where \[ \mathrm{g}(y)=\begin{cases} ky^{n}(1-y)^{n} & \mbox{ if }0\leqslant y\leqslant1\\ 0 & \mbox{ otherwise} \end{cases} \] use the result $$ \int_0^1 {y^{r}}{{(1-y)}^{s}}\,\d y = \frac{r!s!}{(r+s+1)!} $$ to show that \(k={(2n+1)!}/{{(n!)}^2}\), and evaluate \(\E(Y)\) and \({\rm Var}\,(Y)\). Hence show that, for any given positive number \(d\), the inequality $$ {\P\left({\vert {Y - 1/2} \vert} < {d/{\sqrt {n}}} \right)} < {\P\left({\vert {{\bar X} - 1/2} \vert} < {d/{\sqrt {n}}} \right)} $$ holds provided \(n\) is large enough, where \({\bar X}\) is the mean of \(X_1\), \(X_2\), \(\ldots\) , \(X_{2n+1}\). [You may assume that \(Y\) and \(\bar X\) are normally distributed for large \(n\).]
A hostile naval power possesses a large, unknown number \(N\) of submarines. Interception of radio signals yields a small number \(n\) of their identification numbers \(X_i\) (\(i=1,2,...,n\)), which are taken to be independent and uniformly distributed over the continuous range from \(0\) to \(N\). Show that \(Z_1\) and \(Z_2\), defined by $$ Z_1 = {n+1\over n} {\max}\{X_1,X_2,...,X_n\} \hspace{0.3in} {\rm and} \hspace{0.3in} Z_2 = {2\over n} \sum_{i=1}^n X_i \;, $$ both have means equal to \(N\). Calculate the variance of \(Z_1\) and of \(Z_2\). Which estimator do you prefer, and why?
An experiment produces a random number \(T\) uniformly distributed on \([0,1]\). Let \(X\) be the larger root of the equation \[x^{2}+2x+T=0.\] What is the probability that \(X>-1/3\)? Find \(\mathrm{E}(X)\) and show that \(\mathrm{Var}(X)=1/18\). The experiment is repeated independently 800 times generating the larger roots \(X_{1}\), \(X_{2}\), \dots, \(X_{800}\). If \[Y=X_{1}+X_{2}+\dots+X_{800}.\] find an approximate value for \(K\) such that \[\mathrm{P}(Y\leqslant K)=0.08.\]
The random variable \(X\) is uniformly distributed on \([0,1]\). A new random variable \(Y\) is defined by the rule \[ Y=\begin{cases} 1/4 & \mbox{ if }X\leqslant1/4,\\ X & \mbox{ if }1/4\leqslant X\leqslant3/4\\ 3/4 & \mbox{ if }X\geqslant3/4. \end{cases} \] Find \({\mathrm E}(Y^{n})\) for all integers \(n\geqslant 1\). Show that \({\mathrm E}(Y)={\mathrm E}(X)\) and that \[{\mathrm E}(X^{2})-{\mathrm E}(Y^{2})=\frac{1}{24}.\] By using the fact that \(4^{n}=(3+1)^{n}\), or otherwise, show that \({\mathrm E}(X^{n}) > {\mathrm E}(Y^{n})\) for \(n\geqslant 2\). Suppose that \(Y_{1}\), \(Y_{2}\), \dots are independent random variables each having the same distribution as \(Y\). Find, to a good approximation, \(K\) such that \[{\rm P}(Y_{1}+Y_{2}+\cdots+Y_{240000} < K)=3/4.\]
The average number of pedestrians killed annually in road accidents in Poldavia during the period 1974-1989 was 1080 and the average number killed annually in commercial flight accidents during the same period was 180. Discuss the following newspaper headlines which appeared in 1991. (The percentage figures in square brackets give a rough indication of the weight of marks attached to each discussion.)
A fair coin is thrown \(n\) times. On each throw, 1 point is scored for a head and 1 point is lost for a tail. Let \(S_{n}\) be the points total for the series of \(n\) throws, i.e. \(S_{n}=X_{1}+X_{2}+\cdots+X_{n},\) where \[ X_{j}=\begin{cases} 1 & \text{ if the }j \text{ th throw is a head}\\ -1 & \text{ if the }j\text{ th throw is a tail.} \end{cases} \]
\textit{In this question, \(\Phi(z)\) is the cumulative distribution function of a standard normal random variable.} A random variable is known to have a Normal distribution with mean \(\mu\) and standard deviation either \(\sigma_0\) or \(\sigma_1\), where \(\sigma_0 < \sigma_1\,\). The mean, \(\overline{X}\), of a random sample of \(n\) values of \(X\) is to be used to test the hypothesis \(\mathrm{H}_0: \sigma = \sigma_0\) against the alternative \(\mathrm{H}_1: \sigma = \sigma_1\,\). Explain carefully why it is appropriate to use a two sided test of the form: accept \(\mathrm{H}_0\) if \phantom{} \(\mu - c < \overline{X} < \mu+c\,\), otherwise accept \(\mathrm{H}_1\). Given that the probability of accepting \(\mathrm{H}_1\) when \(\mathrm{H}_0\) is true is \(\alpha\), determine \(c\) in terms of \(n\), \(\sigma_0\) and \(z_{\alpha}\), where \(z_\alpha \) is defined by \(\ds\Phi(z_{\alpha}) = 1 - \tfrac{1}{2}\alpha\). The probability of accepting \(\mathrm{H}_0\) when \(\mathrm{H}_1\) is true is denoted by \(\beta\). Show that \(\beta\) is independent of \(n\). Given that \(\Phi(1.960)\approx 0.975\) and that \(\Phi(0.063) \approx 0.525\,\), determine, approximately, the minimum value of \(\ds \frac{\sigma_1}{\sigma_0}\) if \(\alpha\) and \(\beta\) are both to be less than \(0.05\,\).
No problems in this section yet.
No problems in this section yet.
No problems in this section yet.
No problems in this section yet.
No problems in this section yet.
No problems in this section yet.
No problems in this section yet.
No problems in this section yet.