Return to Article Details Duplicate and near-duplicate documents in the web: detection by means of fuzzy-hash techniques Download Download PDF