Variation of information
Encyclopedia
The variation of information () is a measure of the distance between two clusterings (partitions of elements
Partition of a set
In mathematics, a partition of a set X is a division of X into non-overlapping and non-empty "parts" or "blocks" or "cells" that cover all of X...

).

Definition

Suppose we have two clusterings (a division of a set into several subset
Subset
In mathematics, especially in set theory, a set A is a subset of a set B if A is "contained" inside B. A and B may coincide. The relationship of one set being a subset of another is called inclusion or sometimes containment...

s) and where , , . Then the variation of information between two clusterings is:


where is entropy of and is mutual information
Mutual information
In probability theory and information theory, the mutual information of two random variables is a quantity that measures the mutual dependence of the two random variables...

 between and .

This is completely equivalent to the shared information distance.
The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK