Efficient Deduplication using Hadoop
Download paper