Next: Missing value prediction based
Up: Methods for Predicting Movie
Previous: Methods for Predicting Movie
In [2],[4], it has been shown that the missing value prediction problem can be formulated as a weighted matrix approximation problem where the weights are 1's for known values and 0's for unknown ones, then we can use co-clustering for finding the best matrix approximation based on some criteria. The assumption is that the original matrix has a low parameter structure involving properties of row and column clusters. By minimizing a desired loss function, co-clustering can find that low parameter structure and this structure is used for reconstructing the approximate matrix. Let
be the random variable that takes values in the matrix,
and
be discrete random variables whose values are the row and column indices respectively, and
and
be discrete random variables which takes values on the row cluster and column cluster indices. Then [2] shows that, for a given co-clustering, there are only six distinct sets of summary statistics or co-clustering bases which can be used for approximating the matrix
:
We also call these co-clustering bases as schemes. Moreover, using the Minimum Bregman Information (MBI), we can find the best approximation matrix
for a given co-clustering, a given co-clustering basis, and a given Bregman divergence. Table 1 and 2 present the best approximation solutions of each basis for squared Euclidean distance and I-divergence, where the expectation in those formulae are interpreted as follows:
: the average value of the entire matrix
and
: row averages and column averages respectively
-
and
: row cluster averages and column cluster averages respectively
-
and
: row column cluster averages and column row cluster averages respectively. In other words, they are the average of each row in a column cluster and the average of each column in a row cluster.
-
: co-cluster averages
Table 1:
Best matrix approximation for squared Euclidean distance
Coclustering basis  |
Approximation matrix  |
 |
+
- ![$ E[Z]$](img8.png) |
 |
![$ E[Z\vert\hat{U},\hat{V}]$](img15.png) |
 |
-
+
![$ E[Z\vert\hat{U},\hat{V}]$](img15.png) |
 |
-
+
![$ E[Z\vert\hat{U},\hat{V}]$](img15.png) |
 |
-
+ -
![$ E[Z\vert\hat{V}]$](img12.png) |
| |
+
![$ E[Z\vert\hat{U},\hat{V}]$](img15.png) |
 |
+
-
![$ E[Z\vert\hat{U},\hat{V}]$](img15.png) |
Table 2:
Best matrix approximation I-divergence
Coclustering basis  |
Approximation matrix  |
 |
![$ \frac{E[Z\vert\hat{U}] \times E[Z\vert\hat{V}]}{E[Z]}$](img23.png) |
 |
![$ E[Z\vert\hat{U},\hat{V}]$](img15.png) |
 |
![$ \frac{E[Z\vert U] \times E[Z\vert\hat{U},\hat{V}]}{E[Z\vert\hat{U}]}$](img24.png) |
 |
![$ \frac{E[Z\vert V] \times E[Z\vert\hat{U},\hat{V}]}{E[Z\vert\hat{V}]}$](img25.png) |
 |
![$ \frac{E[Z\vert U] \times E[Z\vert V] \times E[Z\vert\hat{U},\hat{V}]}{E[Z\vert\hat{U}] \times E[Z\vert\hat{V}]}$](img26.png) |
 |
![$ \frac{E[Z\vert U,\hat{V}] \times E[Z\vert\hat{U},V]}{E[Z\vert\hat{U},\hat{V}]}$](img27.png) |
Next: Missing value prediction based
Up: Methods for Predicting Movie
Previous: Methods for Predicting Movie
Tuyen Huynh
2007-05-09