1995 Paper 1 Q14

Year: 1995
Paper: 1
Question Number: 14

Course: LFM Stats And Pure
Section: Approximating Binomial to Normal Distribution

Difficulty: 1516.0 Banger: 1531.3

Problem

  1. Find the maximum value of \(\sqrt{p(1-p)}\) as \(p\) varies between \(0\) and \(1\).
  2. Suppose that a proportion \(p\) of the population is female. In order to estimate \(p\) we pick a sample of \(n\) people at random and find the proportion of them who are female. Find the value of \(n\) which ensures that the chance of our estimate of \(p\) being more than \(0.01\) in error is less than 1\%.
  3. Discuss how the required value of \(n\) would be affected if (a) \(p\) were the proportion of people in the population who are left-handed; (b) \(p\) were the proportion of people in the population who are millionaires.

Solution

  1. \(\,\) \begin{align*} && \sqrt{p(1-p)} &= \sqrt{p-p^2} \\ &&&= \sqrt{\tfrac14-(\tfrac12-p)^2} \\ &&&\leq \sqrt{\tfrac14} = \tfrac12 \end{align*} Therefore the maximum is \(\tfrac12\) when \(p=\frac12\)
  2. Notice that our estimate \(\hat{p}\) will (for large \(n\)) be follow a normal distribution \(N(p, pq/n)\) by either the normal approximation to the binomial or central limit theorem. We would like \(0.01 > \mathbb{P}\left ( |\hat{p}-p| < 0.01 \right)\) or in other words \begin{align*} && 0.01 &> \mathbb{P}\left ( |\hat{p}-p| > 0.01 \right) \\ &&&=\mathbb{P}\left ( |\sqrt{\frac{pq}{n}}Z+p-p| > 0.01 \right) \\ &&&= \mathbb{P} \left (|Z|>\frac{0.01\sqrt{n}}{\sqrt{pq}}\right) \end{align*} therefore we need \(\frac{0.01\sqrt{n}}{\sqrt{pq}}> 2.58 \Rightarrow n > 258^2 pq \approx 2^{14} \approx 16\,000\), where we are using \(pq = \frac14\) as the worst case possibility and \(258 \approx 256 = 2^8\)
  3. If we were looking at when we are looking at left handed people (maybe ~\(10\%\), we would be looking at \(pq = \frac{9}{100}\) so we need a smaller sample). If we are looking at millionaires (an even smaller again percentage), we would need an even smaller sample. This is surprising since you would expect you would need a larger sample to accurately gauge smaller proportions. However, this surprise can be resolved by considering that this is an absolute error. For smaller values the relative error is larger, but the absolute error is smaller.
Rating Information

Difficulty Rating: 1516.0

Difficulty Comparisons: 1

Banger Rating: 1531.3

Banger Comparisons: 2

Show LaTeX source
Problem source
\begin{questionparts}
\item Find the maximum value of $\sqrt{p(1-p)}$ as $p$ varies between $0$ and $1$. 
\item Suppose that a proportion $p$ of the population is female. In order to estimate $p$ we pick a sample of $n$ people at random and find the proportion of them who are female. Find the value of $n$ which ensures that the chance of our estimate of $p$ being more than $0.01$ in error is less than 1\%. 
\item Discuss how the required value of $n$ would be affected if 
(a) $p$ were the proportion of people in the population who are left-handed;
(b) $p$ were the proportion of people in the population who are millionaires. 
\end{questionparts}
Solution source
\begin{questionparts}
\item $\,$ \begin{align*}
&& \sqrt{p(1-p)} &= \sqrt{p-p^2} \\
&&&= \sqrt{\tfrac14-(\tfrac12-p)^2} \\
&&&\leq \sqrt{\tfrac14} = \tfrac12
\end{align*} Therefore the maximum is $\tfrac12$ when $p=\frac12$

\item Notice that our estimate $\hat{p}$ will (for large $n$) be follow a normal distribution $N(p, pq/n)$ by either the normal approximation to the binomial or central limit theorem.

We would like $0.01 > \mathbb{P}\left ( |\hat{p}-p| < 0.01 \right)$ or in other words

\begin{align*}
&& 0.01 &> \mathbb{P}\left ( |\hat{p}-p| > 0.01 \right) \\
&&&=\mathbb{P}\left ( |\sqrt{\frac{pq}{n}}Z+p-p| > 0.01 \right) \\
&&&= \mathbb{P} \left (|Z|>\frac{0.01\sqrt{n}}{\sqrt{pq}}\right) 
\end{align*} therefore we need $\frac{0.01\sqrt{n}}{\sqrt{pq}}> 2.58 \Rightarrow n > 258^2 pq \approx 2^{14} \approx 16\,000$, where we are using $pq = \frac14$ as the worst case possibility and $258 \approx 256 = 2^8$

\item If we were looking at when we are looking at left handed people (maybe ~$10\%$, we would be looking at $pq = \frac{9}{100}$ so we need a smaller sample). If we are looking at millionaires (an even smaller again percentage), we would need an even smaller sample. This is surprising since you would expect you would need a larger sample to accurately gauge smaller proportions. However, this surprise can be resolved by considering that this is an absolute error. For smaller values the relative error is larger, but the absolute error is smaller.

\end{questionparts}