• For fuzzy matching tools, I've come across DataMatch by Data Ladder, which is an excellent fuzzy matching and record linkage tool used across business and would work really well for this situation. They offer a complimentary trial[/url] for new users.

    In fact, an independent verified evaluation was done of the software comparing it to major software tools by IBM and SAS. There was a study done at Curtin University Centre for Data Linkage in Australia that simulated the matching of 4.4 Million records. It identified what providers had in terms of accuracy (Number of matches found vs available. Number of false matches)

    1.DataMatch Enterprise, Highest Accuracy (>95%), Very Fast, Low Cost

    2.IBM Quality Stage , high accuracy (>90%), Very Fast, High Cost (>$100K)

    3.SAS Data Flux, Medium Accuracy (>85%), Fast, High Cost (>100K)