Probabalistic Data Matching – How mathematics helps find duplicates in data