Year: 2000
Paper: 3
Question Number: 14
Course: UFM Statistics
Section: Bivariate data
Difficulty Rating: 1700.0
Difficulty Comparisons: 0
Banger Rating: 1500.0
Banger Comparisons: 0
The random variable $X$ takes only the values $x_1$ and $x_2$ (where $ x_1 \not= x_2 $), and the random variable $Y$ takes only the values $y_1$ and $y_2$ (where $y_1 \not= y_2$).
Their joint distribution is given by
$$
\P ( X = x_1 , Y = y_1 ) = a \ ; \ \
\P ( X = x_1 , Y = y_2 ) = q - a \ ; \ \
\P ( X = x_2 , Y = y_1 ) = p - a \ .
$$
Show that if $\E(X Y) = \E(X)\E(Y)$ then
$$
(a - p q ) ( x_1 - x_2 ) ( y_1 - y_2 ) = 0 .
$$
Hence show that two random variables each taking only two distinct values are independent if
$\E(X Y) = \E(X) \E(Y)$.
Give a joint distribution for two random variables $A$ and $B$, each taking the three values $- 1$, $0$ and $1$ with probability ${1 \over 3}$, which have $\E(A B) = \E( A)\E (B)$, but which are not independent.
\begin{align*}
\mathbb{P}(X = x_1) &= a + q - a = q \\
\mathbb{P}(X = x_2) &= 1 - q \\
\mathbb{P}(Y = y_1) & = a + p - a = p \\
\mathbb{P}(Y = y_2) & = 1 - p
\end{align*}
\begin{align*}
\mathbb{E}(X)\mathbb{E}(Y) &= \l qx_1 + (1-q)x_2 \r \l p y_1 + (1-p)y_2\r \\
&= qpx_1y_1 + q(1-p)x_1y_2 + (1-q)px_2y_1 + (1-q)(1-p)x_2y_2 \\
\mathbb{E}(XY) &= ax_1y_1 + (q-a)x_1y_2 + (p-a)x_2y_1 + (1 + a - p - q)x_2y_2 &=
\end{align*}
Therefore $\mathbb{E}(XY) - \mathbb{E}(X)\mathbb{E}(Y)$ is a degree 2 polynomial in the $x_i, y_i$.
If $x_1 = x_2$ then we have:
\begin{align*}
\mathbb{E}(X)\mathbb{E}(Y) &=x_1 \l p y_1 + (1-p)y_2\r \\
\mathbb{E}(XY) &= x_1(ay_1 + (q-a)y_2 + (p-a)y_1 + (1 + a - p - q)y_2) \\
&= x_1 (py_1 + (1-p)y_2)
\end{align*}
Therefore $x_1 - x_2$ is a root and by symmetry $y_1 - y_2$ is a root. Therefore it remains to check the coefficient of $x_1y_1$ which is $a - pq$ to complete the factorisation.
For any two random variables taking two distinct values, we can find $a, q, p$ satisfying the relations above. We also note that $X$ and $Y$ are independent if $\mathbb{P}(X = x_i, Y = y_i) = \mathbb{P}(X = x_i)\mathbb{P}(Y = y_i)$. Since $x_1 \neq x_2$ and $y_1 \neq y_2$ and $\E(A B) = \E( A)\E (B) \Rightarrow a = pq$. But if $a = pq$, we have $\mathbb{P}(X = x_1, Y = y_1) = \mathbb{P}(X = x_1)\mathbb{P}(Y = y_1)$ and all the other relations drop out similarly.
Consider
\begin{align*}
\mathbb{P}(A = -1, B = 1) &= \frac{1}{6} \\
\mathbb{P}(A = -1, B = -1) &= \frac{1}{6} \\
\mathbb{P}(A = 0, B = 0) &= \frac{1}{3} \\
\mathbb{P}(A = 1, B = -1) &= \frac{1}{6} \\
\mathbb{P}(A = -1, B = -1) &= \frac{1}{6}
\end{align*}