I have 802K records in a dataset and a reference table. I have two fields: schoolcode and schoolname. I have a reference table with schoolcode and schoolname in it with some other data I need in my original table.
The trouble is, they just don't match up exactly in all cases. Some of the differences are predictable while others aren't.
I ran fuzzy lookup and sent the results to another table to look at how they match up.
There doesn't appear to be a similarity threshold that would automatically just get me the dataset with the automatic matches.
I created a view with all of the distinct code and name matches. I'd like to add a column such as 'isAccpeted' or something like that. This way, I can look over each distinct match and put a 1 for the ones I want and a 0 for the one's I don't.
What I'm not sure about, however, is how to send those changes BACK to the original table. Because my intermediate view only shows distinct values, I would only be marking each match, not each row.
What kind of thing would I need to do to send those changes back to the original table? I've thought of perhaps using the Merge task or maybe the Join task. Any ideas?