Problems

2025 Paper 3 Q11

D: 1500.0 B: 1500.0

cumulative distribution function exponential distribution order statistics probability density function logarithmic approximation median confidence interval independent random variables

Let $\lambda > 0$. The independent random variables $X_1, X_2, \ldots, X_n$ all have probability density function $$f(t) = \begin{cases} \lambda e^{-\lambda t} & t \geq 0 \\ 0 & t < 0 \end{cases}$$ and cumulative distribution function $F(x)$. The value of random variable $Y$ is the largest of the values $X_1, X_2, \ldots, X_n$. Show that the cumulative distribution function of $Y$ is given, for $y \geq 0$, by $$G(y) = (1 - e^{-\lambda y})^n$$
The values $L(\alpha)$ and $U(\alpha)$, where $0 < \alpha \leq \frac{1}{2}$, are such that $$P(Y < L(\alpha)) = \alpha \text{ and } P(Y > U(\alpha)) = \alpha$$ Show that $$L(\alpha) = -\frac{1}{\lambda}\ln(1 - \alpha^{1/n})$$ and write down a similar expression for $U(\alpha)$.
Use the approximation $e^t \approx 1 + t$, for $|t|$ small, to show that, for sufficiently large $n$, $$\lambda L(\alpha) \approx \ln(n) - \ln\left(\ln\left(\frac{1}{\alpha}\right)\right)$$
Hence show that the median of $Y$ tends to infinity as $n$ increases, but that the width of the interval $U(\alpha) - L(\alpha)$ tends to a value which is independent of $n$.
You are given that, for $|t|$ small, $\ln(1 + t) \approx t$ and that $e^3 \approx 20$. Show that, for sufficiently large $n$, there is an interval of width approximately $4\lambda^{-1}$ in which $Y$ lies with probability $0.9$.

Solution:

Note that $\displaystyle F(y) = \mathbb{P}(X_i < y) = \int_0^y \lambda e^{-\lambda t} \d t = 1-e^{-\lambda y}$. Notice also that \begin{align*} G(y) &= \mathbb{P}(Y < y) \\ &= \mathbb{P}(\max_i(X_i) < y) \\ &= \mathbb{P}(X_i < y \text{ for all }i) \\ &= \prod_{i=1}^n \mathbb{P}(X_i < y) \\ &= \prod_{i=1}^n (1-e^{-\lambda y})\\ &= (1-e^{-\lambda y})^n \end{align*} as required.
\begin{align*} && \mathbb{P}(Y < L(\alpha)) &= \alpha \\ \Rightarrow && (1-e^{-\lambda L(\alpha)})^n &= \alpha \\ \Rightarrow && 1-e^{-\lambda L(\alpha)} &= \alpha^{\tfrac1n} \\ \Rightarrow && L(\alpha) &= -\frac{1}{\lambda}\ln \left (1-\alpha^{\tfrac1n} \right) \end{align*} Notice also: \begin{align*} && \mathbb{P}(Y > U(\alpha)) &= \alpha \\ \Rightarrow && 1 - (1-e^{-\lambda U(\alpha)})^n &= \alpha \\ \Rightarrow && U(\alpha) &= -\frac{1}{\lambda}\ln \left ( 1-(1-\alpha)^{\tfrac1n} \right) \end{align*}
\begin{align*} \lambda L(\alpha) &= -\ln \left (1-\alpha^{\tfrac1n} \right) \\ &= -\ln \left (1-e^{\tfrac1n \ln \alpha} \right) \\ &\approx - \ln \left ( 1 - 1 - \frac1n \ln \alpha\right) \tag{$e^t \approx 1 + t$} \\ &= -\ln \left ( \frac{1}{n} \ln \frac{1}\alpha \right) \\ &= - \ln \frac{1}{n} - \ln \left ( \ln \frac{1}{\alpha} \right )\\ &= \ln n - \ln \left ( \ln \left ( \frac{1}{\alpha} \right ) \right) \end{align*} since if $n$ is large, $\frac{\ln \alpha}{n}$ is small.
The median is the value where $\mathbb{P}(Y < M) = \frac12$, or in other words $L(\frac12)$, but this is $\approx \frac{\ln n - \ln (\ln 2)}{\lambda} \to \infty$. \begin{align*} && \lambda U(\alpha) &\approx \ln n - \ln \left ( \ln \left ( \frac{1}{1-\alpha} \right ) \right) \\ \Rightarrow && \lambda(U(\alpha) - L(\alpha)) &\approx -\ln \left ( \ln \left ( \frac{1}{1-\alpha} \right ) \right)+ \ln \left ( \ln \left ( \frac{1}{\alpha} \right ) \right) \\ \Rightarrow && U(\alpha) - L(\alpha) &\to \frac{1}{\lambda} \left ( \ln \left ( \ln \left ( \frac{1}{\alpha} \right ) \right)-\ln \left ( \ln \left ( \frac{1}{1-\alpha} \right ) \right ) \right) \end{align*} which doesn't depend on $n$.
Suppose $\alpha = \frac{1}{20}$ then \begin{align*} U(\alpha) - L(\alpha) &\approx \frac{1}{\lambda} \left (\ln \ln 20 - \ln \ln \frac{20}{19} \right) \\ &= \lambda^{-1} \left (\ln \ln 20 - \ln \ln (1 + \frac{1}{19}) \right) \\ &\approx \lambda^{-1} \left (\ln 3 - \ln \frac{1}{19} \right) \tag{$\ln(1+t) \approx t$} \\ &\approx \lambda^{-1} \ln 3 \cdot 19 \\ &\approx \lambda^{-1} (1 + 3) \\ &\approx 4\lambda^{-1} \end{align*} [Note that $\ln \ln 20 - \ln \ln \frac{20}{19} = 4.0673\ldots$]

View

2023 Paper 2 Q12

D: 1500.0 B: 1500.0

continuous probability distributions order statistics probability density function cumulative distribution function median integration maximum of random variables

Each of the independent random variables $X_1, X_2, \ldots, X_n$ has the probability density function $\mathrm{f}(x) = \frac{1}{2}\sin x$ for $0 \leqslant x \leqslant \pi$ (and zero otherwise). Let $Y$ be the random variable whose value is the maximum of the values of $X_1, X_2, \ldots, X_n$.

Explain why $\mathrm{P}(Y \leqslant t) = \big[\mathrm{P}(X_1 \leqslant t)\big]^n$ and hence, or otherwise, find the probability density function of $Y$.

Let $m(n)$ be the median of $Y$ and $\mu(n)$ be the mean of $Y$.

Find an expression for $m(n)$ in terms of $n$. How does $m(n)$ change as $n$ increases?
Show that \[\mu(n) = \pi - \frac{1}{2^n}\int_0^{\pi} (1-\cos x)^n\,\mathrm{d}x\,.\]
1. Show that $\mu(n)$ increases with $n$.
2. Show that $\mu(2) < m(2)$.

View

2020 Paper 3 Q12

D: 1500.0 B: 1500.0

geometric distribution independence joint distribution discrete probability distributions convolution order statistics absolute difference

$A$ and $B$ both toss the same biased coin. The probability that the coin shows heads is $p$, where $0 < p < 1$, and the probability that it shows tails is $q = 1 - p$. Let $X$ be the number of times $A$ tosses the coin until it shows heads. Let $Y$ be the number of times $B$ tosses the coin until it shows heads.

The random variable $S$ is defined by $S = X + Y$ and the random variable $T$ is the maximum of $X$ and $Y$. Find an expression for $\mathrm{P}(S = s)$ and show that \[ \mathrm{P}(T = t) = pq^{t-1}(2 - q^{t-1} - q^t). \]
The random variable $U$ is defined by $U = |X - Y|$, and the random variable $W$ is the minimum of $X$ and $Y$. Find expressions for $\mathrm{P}(U = u)$ and $\mathrm{P}(W = w)$.
Show that $\mathrm{P}(S = 2 \text{ and } T = 3) \neq \mathrm{P}(S = 2) \times \mathrm{P}(T = 3)$.
Show that $U$ and $W$ are independent, and show that no other pair of the four variables $S$, $T$, $U$ and $W$ are independent.

View

2018 Paper 3 Q12

D: 1700.0 B: 1516.0

probability order statistics uniform distribution binomial coefficients probability density function integration expectation combinatorial identity Beta function

A random process generates, independently, $n$ numbers each of which is drawn from a uniform (rectangular) distribution on the interval 0 to 1. The random variable $Y_k$ is defined to be the $k$th smallest number (so there are $k-1$ smaller numbers).

Show that, for $0\le y\le1\,$, \[ {\rm P}\big(Y_k\le y) =\sum^{n}_{m=k}\binom{n}{m}y^{m}\left(1-y\right)^{n-m} . \tag{$*$} \]
Show that \[ m\binom n m = n \binom {n-1}{m-1} \] and obtain a similar expression for $\displaystyle (n-m) \, \binom n m\,$. Starting from $(*)$, show that the probability density function of $Y_k$ is \[ n\binom{ n-1}{k-1} y^{k-1}\left(1-y\right)^{ n-k} \,.\] Deduce an expression for $\displaystyle \int_0^1 y^{k-1}(1-y)^{n-k} \, \d y \,$.
Find $\E(Y_k) $ in terms of $n$ and $k$.

View

2016 Paper 1 Q13

D: 1500.0 B: 1500.0

exponential distribution order statistics probability density function minimum of random variables expected value CDF method independence

An internet tester sends $n$ e-mails simultaneously at time $t=0$. Their arrival times at their destinations are independent random variables each having probability density function $\lambda \e^{-\lambda t}$ ($0\le t<\infty$, $ \lambda >0$).

The random variable $T$ is the time of arrival of the e-mail that arrives first at its destination. Show that the probability density function of $T$ is \[ n \lambda \e^{-n\lambda t}\,,\] and find the expected value of $T$.
Write down the probability that the second e-mail to arrive at its destination arrives later than time $t$ and hence derive the density function for the time of arrival of the second e-mail. Show that the expected time of arrival of the second e-mail is \[ \frac{1}{\lambda} \left( \frac1{n-1} + \frac 1 n \right) \]

View

2008 Paper 1 Q12

D: 1516.0 B: 1484.0

probability discrete random variables order statistics expectation median summation maximum of random variables approximation

In this question, you may use without proof the results: \[ \sum_{r=1}^n r = \tfrac12 n(n+1) \qquad\text{and}\qquad \sum_{r=1}^n r^2 = \tfrac1 6 n(n+1)(2n+1)\,. \] The independent random variables $X_1$ and $X_2$ each take values $1$, $2$, $\ldots$, $N$, each value being equally likely. The random variable $X$ is defined by \[ X= \begin{cases} X_1 & \text { if } X_1\ge X_2\\ X_2 & \text { if } X_2\ge X_1\;. \end{cases} \]

Show that $\P(X=r) = \dfrac{2r-1}{N^2}\,$ for $r=1$, $2$, $\ldots$, $N$.
Find an expression for the expectation, $\mu$, of $X$ and show that $\mu=67.165$ in the case $N=100$.
The median, $m$, of $X$ is defined to be the integer such that $\P(X\ge m) \ge \frac 12$ and $\P(X\le m)\ge \frac12$. Find an expression for $m$ in terms of $N$ and give an explicit value for $m$ in the case $N=100$.
Show that when $N$ is very large, \[ \frac \mu m \approx \frac {2\sqrt2}3\,. \]

View

2007 Paper 3 Q14

D: 1700.0 B: 1500.0

geometric probability order statistics expected value uniform distribution CDF integration dartboard smallest enclosing region

My favourite dartboard is a disc of unit radius and centre $O$. I never miss the board, and the probability of my hitting any given area of the dartboard is proportional to the area. Each throw is independent of any other throw. I throw a dart $n$ times (where $n>1$). Find the expected area of the smallest circle, with centre $O$, that encloses all the $n$ holes made by my dart. Find also the expected area of the smallest circle, with centre $O$, that encloses all the $(n-1)$ holes nearest to $O$.
My other dartboard is a square of side 2 units, with centre $Q$. I never miss the board, and the probability of my hitting any given area of the dartboard is proportional to the area. Each throw is independent of any other throw. I throw a dart $n$ times (where $n>1$). Find the expected area of the smallest square, with centre $Q$, that encloses all the $n$ holes made by my dart.
Determine, without detailed calculations, whether the expected area of the smallest circle, with centre $Q$, on my square dartboard that encloses all the $n$ holes made by my darts is larger or smaller than that for my circular dartboard.

View

2004 Paper 1 Q13

D: 1500.0 B: 1458.1

probability continuous uniform distribution order statistics hypothesis testing CDF of maximum expectation significance level power of test

Three real numbers are drawn independently from the continuous rectangular distribution on $[ 0, 1 ]\,$. The random variable $X$ is the maximum of the three numbers. Show that the probability that $X \le 0.8$ is $0.512\,$, and calculate the expectation of $X$.
$N$ real numbers are drawn independently from a continuous rectangular distribution on $[ 0, a ]\,$. The random variable $X$ is the maximum of the $N$ numbers. A hypothesis test with a significance level of 5\% is carried out using the value, $x$, of $X $. The null hypothesis is that $a=1$ and the alternative hypothesis is that $a<1 \,$. The form of the test is such that $H_0$ is rejected if $x < c\,$, for some chosen number $c\,$. Using the approximation $2^{10} \approx 10^3\,$, determine the smallest integer value of $N$ such that if $x \le 0.8$ the null hypothesis will be rejected. With this value of $N$, write down the probability that the null hypothesis is rejected if $a = 0.8\,$, and find the probability that the null hypothesis is rejected if $a = 0.9\,$.

View

2001 Paper 2 Q13

D: 1600.0 B: 1517.3

continuous probability distributions median mean survival function order statistics parameter estimation conditional probability Pareto distribution memoryless property

The life times of a large batch of electric light bulbs are independently and identically distributed. The probability that the life time, $T$ hours, of a given light bulb is greater than $t$ hours is given by \[ \P(T>t) \; = \; \frac{1}{(1+kt)^\alpha}\;, \] where $\alpha$ and $k$ are constants, and $\alpha >1$. Find the median $M$ and the mean $m$ of $T$ in terms of $\alpha$ and $k$. Nine randomly selected bulbs are switched on simultaneously and are left until all have failed. The fifth failure occurs at 1000 hours and the mean life time of all the bulbs is found to be 2400 hours. Show that $\alpha\approx2$ and find the approximate value of $k$. Hence estimate the probability that, if a randomly selected bulb is found to last $M$ hours, it will last a further $m-M$ hours.

View

2001 Paper 3 Q14

D: 1700.0 B: 1484.0

uniform distribution variance unbiased estimator sample variance central limit theorem probability density function continuous distribution order statistics

A random variable $X$ is distributed uniformly on $[\, 0\, , \, a\,]$. Show that the variance of $X$ is ${1 \over 12} a^2$. A sample, $X_1$ and $X_2$, of two independent values of the random variable is drawn, and the variance $V$ of the sample is determined. Show that $V = {1 \over 4} \l X_1 -X_2 \r ^2$, and hence prove that $2 V$ is an unbiased estimator of the variance of X. Find an exact expression for the probability that the value of $V$ is less than ${1 \over 12} a^2$ and estimate the value of this probability correct to one significant figure.

View

2000 Paper 2 Q14

D: 1600.0 B: 1484.0

central limit theorem order statistics median uniform distribution beta integral expectation variance normal approximation probability inequality

The random variables $X_1$, $X_2$, $\ldots$ , $X_{2n+1}$ are independently and uniformly distributed on the interval $0 \le x \le 1$. The random variable $Y$ is defined to be the median of $X_1$, $X_2$, $\ldots$ , $X_{2n+1}$. Given that the probability density function of $Y$ is $\g(y)$, where \[ \mathrm{g}(y)=\begin{cases} ky^{n}(1-y)^{n} & \mbox{ if }0\leqslant y\leqslant1\\ 0 & \mbox{ otherwise} \end{cases} \] use the result $$ \int_0^1 {y^{r}}{{(1-y)}^{s}}\,\d y = \frac{r!s!}{(r+s+1)!} $$ to show that $k={(2n+1)!}/{{(n!)}^2}$, and evaluate $\E(Y)$ and ${\rm Var}\,(Y)$. Hence show that, for any given positive number $d$, the inequality $$ {\P\left({\vert {Y - 1/2} \vert} < {d/{\sqrt {n}}} \right)} < {\P\left({\vert {{\bar X} - 1/2} \vert} < {d/{\sqrt {n}}} \right)} $$ holds provided $n$ is large enough, where ${\bar X}$ is the mean of $X_1$, $X_2$, $\ldots$ , $X_{2n+1}$. [You may assume that $Y$ and $\bar X$ are normally distributed for large $n$.]

View

2000 Paper 3 Q13

D: 1700.0 B: 1516.0

probability generating functions PGF dice geometric distribution expectation order statistics first success last success inclusion-exclusion

A set of $n$ dice is rolled repeatedly. For each die the probability of showing a six is $p$. Show that the probability that the first of the dice to show a six does so on the $r$th roll is $$q^{n r } ( q^{-n} - 1 )$$ where $q = 1 - p$. Determine, and simplify, an expression for the probability generating function for this distribution, in terms of $q$ and $n$. The first of the dice to show a six does so on the $R$th roll. Find the expected value of $R$ and show that, in the case $n = 2$, $p=1/6$, this value is $36/11$. Show that the probability that the last of the dice to show a six does so on the $r$th roll is \[ \big(1-q^r\big)^n-\big(1-q^{r-1}\big)^n. \] Find, for the case $n = 2$, the probability generating function. The last of the dice to show a six does so on the $S$th roll. Find the expected value of $S$ and evaluate this when $p=1/6$.

View

1998 Paper 2 Q13

D: 1600.0 B: 1516.0

exponential distribution memoryless property conditional probability probability density function independence order statistics queueing

A random variable $X$ has the probability density function \[ \mathrm{f}(x)=\begin{cases} \lambda\mathrm{e}^{-\lambda x} & x\geqslant0,\\ 0 & x<0. \end{cases} \] Show that $${\rm P}(X>s+t\,\vert X>t) = {\rm P}(X>s).$$ The time it takes an assistant to serve a customer in a certain shop is a random variable with the above distribution and the times for different customers are independent. If, when I enter the shop, the only two assistants are serving one customer each, what is the probability that these customers are both still being served at time $t$ after I arrive? One of the assistants finishes serving his customer and immediately starts serving me. What is the probability that I am still being served when the other customer has finished being served?

View

1998 Paper 3 Q14

D: 1700.0 B: 1500.0

estimation uniform distribution order statistics unbiased estimator variance comparison maximum likelihood method of moments expected value efficiency

A hostile naval power possesses a large, unknown number $N$ of submarines. Interception of radio signals yields a small number $n$ of their identification numbers $X_i$ ($i=1,2,...,n$), which are taken to be independent and uniformly distributed over the continuous range from $0$ to $N$. Show that $Z_1$ and $Z_2$, defined by $$ Z_1 = {n+1\over n} {\max}\{X_1,X_2,...,X_n\} \hspace{0.3in} {\rm and} \hspace{0.3in} Z_2 = {2\over n} \sum_{i=1}^n X_i \;, $$ both have means equal to $N$. Calculate the variance of $Z_1$ and of $Z_2$. Which estimator do you prefer, and why?

View

1996 Paper 1 Q12

D: 1484.0 B: 1485.4

probability hypergeometric distribution combinatorics binomial coefficients expected value order statistics combinatorial identity sampling without replacement

An examiner has to assign a mark between 1 and $m$ inclusive to each of $n$ examination scripts ($n\leqslant m$). He does this randomly, but never assigns the same mark twice. If $K$ is the highest mark that he assigns, explain why \[ \mathrm{P}(K=k)=\left.\binom{k-1}{n-1}\right/\binom{m}{n} \] for $n\leqslant k\leqslant m,$ and deduce that \[ \sum_{k=n}^{m}\binom{k-1}{n-1}=\binom{m}{n}\,. \] Find the expected value of $K$.

View

Problems

Filters