thanks for reading them. It would have been nice to see your longer answer.
Microsoft has(had?) a reseearch group on data cleaning https://www.microsoft.com/en-us/research/project/data-cleaning/
which resulted in the fuzzy-matching function in SSIS, and it is also available as an ad on for Excel. If you look at the research papers they reference the papers I mentioned in my post.
The SSIS function works actually quite well, I used it in a project to match customers with names and addresses. Used iteratively and interactively it works quite well.