Duplicate Detection, Record Linkage, and Identity Uncertainty: Datasets

The following datasets have been kindly provided for evaluating duplicate detection, record linkage, and identity uncertainty systems. Several of these are not yet available for downloading; please contact the authors.

If you can contribute other labeled datasets for this problem, please send them over - this would be greatly appreciated by fellow researchers!

Back to RIDDLE homepage

Last modified: August 25, 2003