How Search Engines Determine Duplicate Content