Click here to monitor SSC
SQLServerCentral is supported by Redgate
 
Log in  ::  Register  ::  Not logged in
 
 
 


SOUNDEX


SOUNDEX

Author
Message
steve.jacobs
steve.jacobs
SSChasing Mays
SSChasing Mays (648 reputation)SSChasing Mays (648 reputation)SSChasing Mays (648 reputation)SSChasing Mays (648 reputation)SSChasing Mays (648 reputation)SSChasing Mays (648 reputation)SSChasing Mays (648 reputation)SSChasing Mays (648 reputation)

Group: General Forum Members
Points: 648 Visits: 279
Comments posted to this topic are about the item SOUNDEX
Koen Verbeeck
Koen Verbeeck
SSCoach
SSCoach (19K reputation)SSCoach (19K reputation)SSCoach (19K reputation)SSCoach (19K reputation)SSCoach (19K reputation)SSCoach (19K reputation)SSCoach (19K reputation)SSCoach (19K reputation)

Group: General Forum Members
Points: 19487 Visits: 13250
Interesting question, thanks. Keep it up!


How to post forum questions.
Need an answer? No, you need a question.
What’s the deal with Excel & SSIS?
My blog at SQLKover.

MCSE Business Intelligence - Microsoft Data Platform MVP
Stewart "Arturius" Campbell
Stewart "Arturius" Campbell
SSCertifiable
SSCertifiable (6.6K reputation)SSCertifiable (6.6K reputation)SSCertifiable (6.6K reputation)SSCertifiable (6.6K reputation)SSCertifiable (6.6K reputation)SSCertifiable (6.6K reputation)SSCertifiable (6.6K reputation)SSCertifiable (6.6K reputation)

Group: General Forum Members
Points: 6552 Visits: 7195
Koen Verbeeck (11/26/2013)
Interesting question, thanks. Keep it up!

+1
thanks...

____________________________________________
Space, the final frontier? not any more...
All limits henceforth are self-imposed.
“libera tute vulgaris ex”
honza.mf
honza.mf
SSCommitted
SSCommitted (1.8K reputation)SSCommitted (1.8K reputation)SSCommitted (1.8K reputation)SSCommitted (1.8K reputation)SSCommitted (1.8K reputation)SSCommitted (1.8K reputation)SSCommitted (1.8K reputation)SSCommitted (1.8K reputation)

Group: General Forum Members
Points: 1809 Visits: 1323
Great troubles are with slavic languages, especially with transcriptions from cyrilic to latin alphabet.



See, understand, learn, try, use efficient
© Dr.Plch
Ed Wagner
Ed Wagner
SSChampion
SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)

Group: General Forum Members
Points: 11803 Visits: 9795
That was a very interesting question on a topic I know almost nothing about. Thank you.


Tally Tables - Performance Personified
String Splitting with True Performance
Best practices on how to ask questions
Rune Bivrin
Rune Bivrin
Hall of Fame
Hall of Fame (3.8K reputation)Hall of Fame (3.8K reputation)Hall of Fame (3.8K reputation)Hall of Fame (3.8K reputation)Hall of Fame (3.8K reputation)Hall of Fame (3.8K reputation)Hall of Fame (3.8K reputation)Hall of Fame (3.8K reputation)

Group: General Forum Members
Points: 3779 Visits: 1486
If SOUNDEX() as implemented in SQL Server was remotely useful outside of English-speaking countries I might care one way or the other. As it stands it's one of the quaint features of the language.


Just because you're right doesn't mean everybody else is wrong.
steve.jacobs
steve.jacobs
SSChasing Mays
SSChasing Mays (648 reputation)SSChasing Mays (648 reputation)SSChasing Mays (648 reputation)SSChasing Mays (648 reputation)SSChasing Mays (648 reputation)SSChasing Mays (648 reputation)SSChasing Mays (648 reputation)SSChasing Mays (648 reputation)

Group: General Forum Members
Points: 648 Visits: 279
Rune Bivrin (11/26/2013)
If SOUNDEX() as implemented in SQL Server was remotely useful outside of English-speaking countries I might care one way or the other. As it stands it's one of the quaint features of the language.


Rune, I could not agree more;-). Most people (business) ask for "fuzzy" lookups, what they are asking for is more for an "exact" fuzzy lookup. I cannot tell you how many times I have had to explain the differences between the two. Heck, I may as well use LIKE when performing my searches. This is why I use a true programming language and the steps (some of them) I outlined in my answer.
TomThomson
TomThomson
SSChampion
SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)

Group: General Forum Members
Points: 11541 Visits: 12098
Good question, and nice explanation.

Too many people still fall for the claims that Soundex is a phonetic algorithm for indexing by sound and that it's based on American English pronunciation, despite its being extremely flawed for that purpose nothing really based on AmEng pronunciation would return different codes for "sealing" and "ceiling" while delivering the same code for "ceiling" and "killing". That it was originally intened only to be used for names does mean that it's less erratic when used as originally intended, but we still have "Gene" and "Jean" getting different codes, while and "Jean" and "Jane" get the same code as each other, as do "Gene" and "Gawain". It seems quite clear that there was very little attention paid to phonetics in the design of this stuff.

So it's nice to see a QotD that illustrates some of Soundex's failings.

Tom

TomThomson
TomThomson
SSChampion
SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)

Group: General Forum Members
Points: 11541 Visits: 12098
Rune Bivrin (11/26/2013)
If SOUNDEX() as implemented in SQL Server was remotely useful outside of English-speaking countries I might care one way or the other. As it stands it's one of the quaint features of the language.

Does that mean you think it's useful in English-speaking countries?

As long as 70 years ago the US census bureau found Soundex was useless for analysing their early (first 30 years) census records and changed the algorithm; now the US government publishes its own encoding rules, which may be the same as the original Soundex rules (but I doubt it). There are rather a lot of better phonetic fuzzy encoding algorithms, and the name is widely misused: most things that are called Soundex are not actually Soundex.

I don't know whether the Soundex in SQL Server is the original Soundex or not; if not it still shares most of the failings of the original - enough to make it not useful unless augmented by other matching and distinguishing techniques even for names in English-speaking countries.

Tom

Thomas Abraham
Thomas Abraham
SSCrazy
SSCrazy (2.5K reputation)SSCrazy (2.5K reputation)SSCrazy (2.5K reputation)SSCrazy (2.5K reputation)SSCrazy (2.5K reputation)SSCrazy (2.5K reputation)SSCrazy (2.5K reputation)SSCrazy (2.5K reputation)

Group: General Forum Members
Points: 2459 Visits: 2255
Thanks for the interesting question. I got caught out with "Phone". Don't really use this kind of thing in my work right now. But, as with many QotD topics, it keeps me looking at things I don't normally see, in case I ever need them.

Please don't go. The drones need you. They look up to you.
Connect to me on LinkedIn
Go


Permissions

You can't post new topics.
You can't post topic replies.
You can't post new polls.
You can't post replies to polls.
You can't edit your own topics.
You can't delete your own topics.
You can't edit other topics.
You can't delete other topics.
You can't edit your own posts.
You can't edit other posts.
You can't delete your own posts.
You can't delete other posts.
You can't post events.
You can't edit your own events.
You can't edit other events.
You can't delete your own events.
You can't delete other events.
You can't send private messages.
You can't send emails.
You can read topics.
You can't vote in polls.
You can't upload attachments.
You can download attachments.
You can't post HTML code.
You can't edit HTML code.
You can't post IFCode.
You can't post JavaScript.
You can post emoticons.
You can't post or upload images.

Select a forum

































































































































































SQLServerCentral


Search