2000 Paper 3 Q14

Year: 2000
Paper: 3
Question Number: 14

Course: UFM Statistics
Section: Bivariate data

Difficulty: 1700.0 Banger: 1500.0

probability joint distribution independence expectation covariance discrete random variables counterexample bivariate data

Problem

The random variable $X$ takes only the values $x_1$ and $x_2$ (where $ x_1 \not= x_2 $), and the random variable $Y$ takes only the values $y_1$ and $y_2$ (where $y_1 \not= y_2$). Their joint distribution is given by $$ \P ( X = x_1 , Y = y_1 ) = a \ ; \ \ \P ( X = x_1 , Y = y_2 ) = q - a \ ; \ \ \P ( X = x_2 , Y = y_1 ) = p - a \ . $$ Show that if $\E(X Y) = \E(X)\E(Y)$ then $$ (a - p q ) ( x_1 - x_2 ) ( y_1 - y_2 ) = 0 . $$ Hence show that two random variables each taking only two distinct values are independent if $\E(X Y) = \E(X) \E(Y)$. Give a joint distribution for two random variables $A$ and $B$, each taking the three values $- 1$, $0$ and $1$ with probability ${1 \over 3}$, which have $\E(A B) = \E( A)\E (B)$, but which are not independent.

Solution

\begin{align*} \mathbb{P}(X = x_1) &= a + q - a = q \\ \mathbb{P}(X = x_2) &= 1 - q \\ \mathbb{P}(Y = y_1) & = a + p - a = p \\ \mathbb{P}(Y = y_2) & = 1 - p \end{align*} \begin{align*} \mathbb{E}(X)\mathbb{E}(Y) &= \l qx_1 + (1-q)x_2 \r \l p y_1 + (1-p)y_2\r \\ &= qpx_1y_1 + q(1-p)x_1y_2 + (1-q)px_2y_1 + (1-q)(1-p)x_2y_2 \\ \mathbb{E}(XY) &= ax_1y_1 + (q-a)x_1y_2 + (p-a)x_2y_1 + (1 + a - p - q)x_2y_2 &= \end{align*} Therefore $\mathbb{E}(XY) - \mathbb{E}(X)\mathbb{E}(Y)$ is a degree 2 polynomial in the $x_i, y_i$. If $x_1 = x_2$ then we have: \begin{align*} \mathbb{E}(X)\mathbb{E}(Y) &=x_1 \l p y_1 + (1-p)y_2\r \\ \mathbb{E}(XY) &= x_1(ay_1 + (q-a)y_2 + (p-a)y_1 + (1 + a - p - q)y_2) \\ &= x_1 (py_1 + (1-p)y_2) \end{align*} Therefore $x_1 - x_2$ is a root and by symmetry $y_1 - y_2$ is a root. Therefore it remains to check the coefficient of $x_1y_1$ which is $a - pq$ to complete the factorisation. For any two random variables taking two distinct values, we can find $a, q, p$ satisfying the relations above. We also note that $X$ and $Y$ are independent if $\mathbb{P}(X = x_i, Y = y_i) = \mathbb{P}(X = x_i)\mathbb{P}(Y = y_i)$. Since $x_1 \neq x_2$ and $y_1 \neq y_2$ and $\E(A B) = \E( A)\E (B) \Rightarrow a = pq$. But if $a = pq$, we have $\mathbb{P}(X = x_1, Y = y_1) = \mathbb{P}(X = x_1)\mathbb{P}(Y = y_1)$ and all the other relations drop out similarly. Consider \begin{align*} \mathbb{P}(A = -1, B = 1) &= \frac{1}{6} \\ \mathbb{P}(A = -1, B = -1) &= \frac{1}{6} \\ \mathbb{P}(A = 0, B = 0) &= \frac{1}{3} \\ \mathbb{P}(A = 1, B = -1) &= \frac{1}{6} \\ \mathbb{P}(A = -1, B = -1) &= \frac{1}{6} \end{align*}

Rating Information

Difficulty Rating: 1700.0

Difficulty Comparisons: 0

Banger Rating: 1500.0

Banger Comparisons: 0

Show LaTeX source

Problem source

The random variable $X$ takes  only the values $x_1$ and $x_2$  (where $ x_1 \not= x_2 $),  and the random variable $Y$ takes only  the values $y_1$ and $y_2$  (where $y_1 \not= y_2$). 
Their joint distribution is given by  
$$ 
\P ( X = x_1 , Y = y_1 ) = a \ ; \ \  
\P ( X = x_1 , Y = y_2 ) = q - a \ ; \ \  
\P ( X = x_2 , Y = y_1 ) = p - a \ . 
$$  
Show that if $\E(X Y) = \E(X)\E(Y)$ then  
$$  
(a - p q ) ( x_1 - x_2 ) ( y_1 - y_2 ) = 0 . 
$$ 
Hence show that two random variables each taking only two distinct values are independent if  
$\E(X Y) = \E(X) \E(Y)$. 
 
Give a  joint distribution for two random variables  $A$ and $B$, each taking the three values $- 1$, $0$ and $1$  with probability ${1 \over 3}$,  which have $\E(A B) = \E( A)\E (B)$, but which are not independent.

Solution source

\begin{align*}
\mathbb{P}(X = x_1) &= a + q - a = q \\
\mathbb{P}(X = x_2) &= 1 - q \\
\mathbb{P}(Y = y_1) & = a + p - a = p \\
\mathbb{P}(Y = y_2) & = 1 - p
\end{align*}

\begin{align*}
\mathbb{E}(X)\mathbb{E}(Y) &= \l qx_1 + (1-q)x_2 \r \l p y_1 + (1-p)y_2\r  \\
&= qpx_1y_1 + q(1-p)x_1y_2 + (1-q)px_2y_1 + (1-q)(1-p)x_2y_2 \\
\mathbb{E}(XY) &= ax_1y_1 + (q-a)x_1y_2 + (p-a)x_2y_1 + (1 + a - p - q)x_2y_2 &= 
\end{align*}

Therefore $\mathbb{E}(XY) - \mathbb{E}(X)\mathbb{E}(Y)$ is a degree 2 polynomial in the $x_i, y_i$. 

If $x_1 = x_2$ then we have:

\begin{align*}
\mathbb{E}(X)\mathbb{E}(Y) &=x_1 \l p y_1 + (1-p)y_2\r  \\
\mathbb{E}(XY) &= x_1(ay_1 + (q-a)y_2 + (p-a)y_1 + (1 + a - p - q)y_2) \\
&= x_1 (py_1 + (1-p)y_2)
\end{align*}

Therefore $x_1 - x_2$ is a root and by symmetry $y_1 - y_2$ is a root. Therefore it remains to check the coefficient of $x_1y_1$ which is $a - pq$ to complete the factorisation.

For any two random variables taking two distinct values, we can find $a, q, p$ satisfying the relations above. We also note that $X$ and $Y$ are independent if $\mathbb{P}(X = x_i, Y = y_i) = \mathbb{P}(X = x_i)\mathbb{P}(Y = y_i)$. Since $x_1 \neq x_2$ and $y_1 \neq y_2$ and $\E(A B) = \E( A)\E (B) \Rightarrow a = pq$. But if $a = pq$, we have  $\mathbb{P}(X = x_1, Y = y_1) = \mathbb{P}(X = x_1)\mathbb{P}(Y = y_1)$ and all the other relations drop out similarly.

Consider
\begin{align*}
\mathbb{P}(A = -1, B = 1) &= \frac{1}{6} \\
\mathbb{P}(A = -1, B = -1) &= \frac{1}{6} \\
\mathbb{P}(A = 0, B = 0) &= \frac{1}{3} \\
\mathbb{P}(A = 1, B = -1) &= \frac{1}{6} \\
\mathbb{P}(A = -1, B = -1) &= \frac{1}{6}
\end{align*}

Back to List