Central limit theorem - UFM Statistics

2009 Paper 3 Q13

D: 1700.0 B: 1488.4

The point $P$ lies on the circumference of a circle of unit radius and centre $O$. The angle, $\theta$, between $OP$ and the positive $x$-axis is a random variable, uniformly distributed on the interval $0\le\theta<2\pi$. The cartesian coordinates of $P$ with respect to $O$ are $(X,Y)$. Find the probability density function for $X$, and calculate $\var (X)$. Show that $X$ and $Y$ are uncorrelated and discuss briefly whether they are independent.
The points $P_i$ ($i=1$, $2$, $\ldots$ , $n$) are chosen independently on the circumference of the circle, as in part (i), and have cartesian coordinates $(X_i, Y_i)$. The point $\overline P$ has coordinates $(\overline X, \overline Y)$, where $\overline X =\dfrac1n \sum\limits _{i=1}^n X_i$ and $\overline Y =\dfrac1n \sum\limits _{i=1}^n Y_i$. Show that $\overline X$ and $\overline Y$ are uncorrelated. Show that, for large $n$, $\displaystyle \P\left(\vert \overline X \vert \le \sqrt{\frac2n}\right)\approx 0.95\,$.

Show Solution

Two coins $A$ and $B$ are tossed together. $A$ has probability $p$ of showing a head, and $B$ has probability $2p$, independent of $A$, of showing a head, where $0 < p < \frac12$. The random variable $X$ takes the value 1 if $A$ shows a head and it takes the value $0$ if $A$ shows a tail. The random variable $Y$ takes the value 1 if $B$ shows a head and it takes the value $0$ if $B$ shows a tail. The random variable $T$ is defined by \[ T= \lambda X + {\textstyle\frac12} (1-\lambda)Y. \] Show that $\E(T)=p$ and find an expression for $\var(T)$ in terms of $p$ and $\lambda$. Show that as $\lambda$ varies, the minimum of $\var(T)$ occurs when \[ \lambda =\frac{1-2p}{3-4p}\;. \] The two coins are tossed $n$ times, where $n>30$, and $\overline{T}$ is the mean value of $T$. Let $b$ be a fixed positive number. Show that the maximum value of $\P\big(\vert \overline{T}-p\vert < b\big)$ as $\lambda$ varies is approximately $2\Phi(b/s)-1$, where $\Phi$ is the cumulative distribution function of a standard normal variate and \[ s^2= \frac{p(1-p)(1-2p)}{(3-4p)n}\;. \]

Show Solution

\begin{align*} && \E[T] &= \E[\lambda X + \tfrac12(1-\lambda)Y] \\ &&&= \lambda \E[X] + \tfrac12(1-\lambda) \E[Y] \\ &&&= \lambda p + \tfrac12 (1-\lambda) 2p \\ &&&= p \\ \\ && \var[T] &= \var[\lambda X + \tfrac12(1-\lambda)Y] \\ &&&= \lambda^2 \var[X] + \tfrac14(1-\lambda)^2 \var[Y] \\ &&&= \lambda^2 p(1-p) + \tfrac14(1-\lambda)^22p(1-2p) \\ &&&= p(\lambda^2 + \tfrac12(1-\lambda)^2) -p^2(\lambda^2+(1-\lambda)^2)\\ &&&= p(\tfrac32\lambda^2 - \lambda + \tfrac12) -p^2(2\lambda^2 -2\lambda + 2) \end{align*} Differentiating $\var[T]$ with respect to $\lambda$, and noting it is a quadratic with positive leading coefficient, we get \begin{align*} && \frac{\d \var[T]}{\d \lambda} &= p(2\lambda -(1-\lambda)) - p^2(2 \lambda -2(1-\lambda)) \\ &&&= p(3\lambda - 1)-p^2(4\lambda - 2) \\ \Rightarrow && \lambda(4p-3) &= 2p-1 \\ \Rightarrow && \lambda &= \frac{1-2p}{3-4p} \end{align*} By the central limit theorem $\overline{T} \sim N(p, \frac{\sigma^2}{n})$ in particular, $\mathbb{P}(|\overline{T} - p| < b) = \mathbb{P}(\left \lvert |\frac{\overline{T}-p}{\frac{\sigma}{\sqrt{n}}} \right \lvert < \frac{b}{\frac{\sigma}{\sqrt{n}}}) = \mathbb{P}(|Z| < \frac{b\sqrt{n}}{\sigma}) = 2\Phi(b/s) - 1$ where $s = \frac{\sigma}{\sqrt{n}}$ so \begin{align*} && s^2 &= \frac1n \sigma^2 \\ &&&= \frac1n \left ( \left (\left ( \frac{1-2p}{3-4p} \right)^2 + \tfrac12 \left (1-\frac{1-2p}{3-4p} \right)^2 \right)p - \left ( \left ( \frac{1-2p}{3-4p} \right)^2 + \left (1-\frac{1-2p}{3-4p} \right)^2\right)p^2 \right) \\ &&&= \frac1n \left ( \left (\left ( \frac{1-2p}{3-4p} \right)^2 + \tfrac12 \left (\frac{2-2p}{3-4p} \right)^2 \right)p - \left ( \left ( \frac{1-2p}{3-4p} \right)^2 + \left (\frac{2-2p}{3-4p} \right)^2\right)p^2 \right) \\ &&&= \frac{p}{n(3-4p)^2} \left ( (1 -4p + 4p^2 + 2-4p+2p^2) - (1-4p+4p^2+4-8p+4p^2)p \right) \\ &&&= \frac{p}{n(3-4p)^2} \left (3-13p+18p^2-8p^3 \right) \\ &&&= \frac{p}{n(3-4p)^2} (3-4p)(1-2p)(1-p) \\ &&&= \frac{p(1-p)(1-2p)}{(3-4p)n} \end{align*}

2001 Paper 3 Q14

D: 1700.0 B: 1484.0

A random variable $X$ is distributed uniformly on $[\, 0\, , \, a\,]$. Show that the variance of $X$ is ${1 \over 12} a^2$. A sample, $X_1$ and $X_2$, of two independent values of the random variable is drawn, and the variance $V$ of the sample is determined. Show that $V = {1 \over 4} \l X_1 -X_2 \r ^2$, and hence prove that $2 V$ is an unbiased estimator of the variance of X. Find an exact expression for the probability that the value of $V$ is less than ${1 \over 12} a^2$ and estimate the value of this probability correct to one significant figure.

Show Solution

2000 Paper 2 Q14

D: 1600.0 B: 1484.0

The random variables $X_1$, $X_2$, $\ldots$ , $X_{2n+1}$ are independently and uniformly distributed on the interval $0 \le x \le 1$. The random variable $Y$ is defined to be the median of $X_1$, $X_2$, $\ldots$ , $X_{2n+1}$. Given that the probability density function of $Y$ is $\g(y)$, where \[ \mathrm{g}(y)=\begin{cases} ky^{n}(1-y)^{n} & \mbox{ if }0\leqslant y\leqslant1\\ 0 & \mbox{ otherwise} \end{cases} \] use the result $$ \int_0^1 {y^{r}}{{(1-y)}^{s}}\,\d y = \frac{r!s!}{(r+s+1)!} $$ to show that $k={(2n+1)!}/{{(n!)}^2}$, and evaluate $\E(Y)$ and ${\rm Var}\,(Y)$. Hence show that, for any given positive number $d$, the inequality $$ {\P\left({\vert {Y - 1/2} \vert} < {d/{\sqrt {n}}} \right)} < {\P\left({\vert {{\bar X} - 1/2} \vert} < {d/{\sqrt {n}}} \right)} $$ holds provided $n$ is large enough, where ${\bar X}$ is the mean of $X_1$, $X_2$, $\ldots$ , $X_{2n+1}$. [You may assume that $Y$ and $\bar X$ are normally distributed for large $n$.]

1998 Paper 3 Q14

D: 1700.0 B: 1500.0

A hostile naval power possesses a large, unknown number $N$ of submarines. Interception of radio signals yields a small number $n$ of their identification numbers $X_i$ ($i=1,2,...,n$), which are taken to be independent and uniformly distributed over the continuous range from $0$ to $N$. Show that $Z_1$ and $Z_2$, defined by $$ Z_1 = {n+1\over n} {\max}\{X_1,X_2,...,X_n\} \hspace{0.3in} {\rm and} \hspace{0.3in} Z_2 = {2\over n} \sum_{i=1}^n X_i \;, $$ both have means equal to $N$. Calculate the variance of $Z_1$ and of $Z_2$. Which estimator do you prefer, and why?

1997 Paper 1 Q12

D: 1500.0 B: 1500.0

An experiment produces a random number $T$ uniformly distributed on $[0,1]$. Let $X$ be the larger root of the equation \[x^{2}+2x+T=0.\] What is the probability that $X>-1/3$? Find $\mathbb{E}(X)$ and show that $\mathrm{Var}(X)=1/18$. The experiment is repeated independently 800 times generating the larger roots $X_{1}, X_{2}, \dots, X_{800}$. If \[Y=X_{1}+X_{2}+\dots+X_{800}.\] find an approximate value for $K$ such that \[\mathrm{P}(Y\leqslant K)=0.08.\]

Show Solution

1996 Paper 2 Q14

D: 1600.0 B: 1500.0

The random variable $X$ is uniformly distributed on $[0,1]$. A new random variable $Y$ is defined by the rule \[ Y=\begin{cases} 1/4 & \mbox{ if }X\leqslant1/4,\\ X & \mbox{ if }1/4\leqslant X\leqslant3/4\\ 3/4 & \mbox{ if }X\geqslant3/4. \end{cases} \] Find ${\mathrm E}(Y^{n})$ for all integers $n\geqslant 1$. Show that ${\mathrm E}(Y)={\mathrm E}(X)$ and that \[{\mathrm E}(X^{2})-{\mathrm E}(Y^{2})=\frac{1}{24}.\] By using the fact that $4^{n}=(3+1)^{n}$, or otherwise, show that ${\mathrm E}(X^{n}) > {\mathrm E}(Y^{n})$ for $n\geqslant 2$. Suppose that $Y_{1}$, $Y_{2}$, \dots are independent random variables each having the same distribution as $Y$. Find, to a good approximation, $K$ such that \[{\rm P}(Y_{1}+Y_{2}+\cdots+Y_{240000} < K)=3/4.\]

Show Solution

1995 Paper 2 Q14

D: 1600.0 B: 1500.0

Suppose $X$ is a random variable with probability density \[ \mathrm{f}(x)=Ax^{2}\exp(-x^{2}/2) \] for $-\infty < x < \infty.$ Find $A$. You belong to a group of scientists who believe that the outcome of a certain experiment is a random variable with the probability density just given, while other scientists believe that the probability density is the same except with different mean (i.e. the probability density is $\mathrm{f}(x-\mu)$ with $\mu\neq0$). In each of the following two cases decide whether the result given would shake your faith in your hypothesis, and justify your answer.

A single trial produces the result 87.3.
1000 independent trials produce results having a mean value $0.23.$

{[}Great weight will be placed on clear statements of your reasons and none on the mere repetition of standard tests, however sophisticated, if unsupported by argument. There are several possible approaches to this question. For some of them it is useful to know that if $Z$ is normal with mean 0 and variance 1 then $\mathrm{E}(Z^{4})=3.${]}

Show Solution

1992 Paper 1 Q14

D: 1500.0 B: 1484.8

The average number of pedestrians killed annually in road accidents in Poldavia during the period 1974-1989 was 1080 and the average number killed annually in commercial flight accidents during the same period was 180. Discuss the following newspaper headlines which appeared in 1991. (The percentage figures in square brackets give a rough indication of the weight of marks attached to each discussion.)

[$10\%$] Six Times Safer To Fly Than To Walk. 1974-1989 Figures Prove It.
[$10\%$] Our Skies Are Safer. Only 125 People Killed In Air Accidents In 1990.
[$30\%$] Road Carnage Increasing. 7 People Killed On Tuesday.
[$50\%$] Alarming Rise In Pedestrian Casualties. 1350 Pedestrians Killed In Road Accidents During 1990.

Show Solution

1991 Paper 1 Q15

D: 1516.0 B: 1484.0

A fair coin is thrown $n$ times. On each throw, 1 point is scored for a head and 1 point is lost for a tail. Let $S_{n}$ be the points total for the series of $n$ throws, i.e. $S_{n}=X_{1}+X_{2}+\cdots+X_{n},$ where \[ X_{j}=\begin{cases} 1 & \text{ if the }j \text{ th throw is a head}\\ -1 & \text{ if the }j\text{ th throw is a tail.} \end{cases} \]

If $n=10\,000,$ find an approximate value for the probability that $S_{n}>100.$
Find an approximate value for the least $n$ for which $\mathrm{P}(S_{n}>0.01n)<0,01.$

Suppose that instead no points are scored for the first throw, but that on each successive throw, 2 points are scored if both it and the first throw are heads, two points are deducted if both are tails, and no points are scored or lost if the throws differ. Let $Y_{k}$ be the score on the $k$th throw, where $2\leqslant k\leqslant n.$ Show that $Y_{k}=X_{1}+X_{k}.$ Calculate the mean and variance of each $Y_{k}$ and determine whether it is true that \[ \mathrm{P}(Y_{2}+Y_{3}+\cdots+Y_{n}>0.01(n-1))\rightarrow0\quad\mbox{ as }n\rightarrow\infty. \]

Show Solution