SQL Clone
SQLServerCentral is supported by Redgate
 
Log in  ::  Register  ::  Not logged in
 
 
 


SOUNDEX


SOUNDEX

Author
Message
steve.jacobs
steve.jacobs
Ten Centuries
Ten Centuries (1.1K reputation)Ten Centuries (1.1K reputation)Ten Centuries (1.1K reputation)Ten Centuries (1.1K reputation)Ten Centuries (1.1K reputation)Ten Centuries (1.1K reputation)Ten Centuries (1.1K reputation)Ten Centuries (1.1K reputation)

Group: General Forum Members
Points: 1076 Visits: 279
Comments posted to this topic are about the item SOUNDEX
Koen Verbeeck
Koen Verbeeck
SSC Guru
SSC Guru (111K reputation)SSC Guru (111K reputation)SSC Guru (111K reputation)SSC Guru (111K reputation)SSC Guru (111K reputation)SSC Guru (111K reputation)SSC Guru (111K reputation)SSC Guru (111K reputation)

Group: General Forum Members
Points: 111602 Visits: 13338
Interesting question, thanks. Keep it up!


How to post forum questions.
Need an answer? No, you need a question.
What’s the deal with Excel & SSIS?
My blog at SQLKover.

MCSE Business Intelligence - Microsoft Data Platform MVP
Stewart "Arturius" Campbell
Stewart "Arturius" Campbell
One Orange Chip
One Orange Chip (27K reputation)One Orange Chip (27K reputation)One Orange Chip (27K reputation)One Orange Chip (27K reputation)One Orange Chip (27K reputation)One Orange Chip (27K reputation)One Orange Chip (27K reputation)One Orange Chip (27K reputation)

Group: General Forum Members
Points: 27445 Visits: 7549
Koen Verbeeck (11/26/2013)
Interesting question, thanks. Keep it up!

+1
thanks...

____________________________________________
Space, the final frontier? not any more...
All limits henceforth are self-imposed.
“libera tute vulgaris ex”
honza.mf
honza.mf
Hall of Fame
Hall of Fame (3.1K reputation)Hall of Fame (3.1K reputation)Hall of Fame (3.1K reputation)Hall of Fame (3.1K reputation)Hall of Fame (3.1K reputation)Hall of Fame (3.1K reputation)Hall of Fame (3.1K reputation)Hall of Fame (3.1K reputation)

Group: General Forum Members
Points: 3141 Visits: 1323
Great troubles are with slavic languages, especially with transcriptions from cyrilic to latin alphabet.



See, understand, learn, try, use efficient
© Dr.Plch
Ed Wagner
Ed Wagner
SSC Guru
SSC Guru (111K reputation)SSC Guru (111K reputation)SSC Guru (111K reputation)SSC Guru (111K reputation)SSC Guru (111K reputation)SSC Guru (111K reputation)SSC Guru (111K reputation)SSC Guru (111K reputation)

Group: General Forum Members
Points: 111445 Visits: 11263
That was a very interesting question on a topic I know almost nothing about. Thank you.


Tally Tables - Performance Personified
String Splitting with True Performance
Best practices on how to ask questions
Rune Bivrin
Rune Bivrin
SSCarpal Tunnel
SSCarpal Tunnel (4.7K reputation)SSCarpal Tunnel (4.7K reputation)SSCarpal Tunnel (4.7K reputation)SSCarpal Tunnel (4.7K reputation)SSCarpal Tunnel (4.7K reputation)SSCarpal Tunnel (4.7K reputation)SSCarpal Tunnel (4.7K reputation)SSCarpal Tunnel (4.7K reputation)

Group: General Forum Members
Points: 4708 Visits: 1601
If SOUNDEX() as implemented in SQL Server was remotely useful outside of English-speaking countries I might care one way or the other. As it stands it's one of the quaint features of the language.


Just because you're right doesn't mean everybody else is wrong.
steve.jacobs
steve.jacobs
Ten Centuries
Ten Centuries (1.1K reputation)Ten Centuries (1.1K reputation)Ten Centuries (1.1K reputation)Ten Centuries (1.1K reputation)Ten Centuries (1.1K reputation)Ten Centuries (1.1K reputation)Ten Centuries (1.1K reputation)Ten Centuries (1.1K reputation)

Group: General Forum Members
Points: 1076 Visits: 279
Rune Bivrin (11/26/2013)
If SOUNDEX() as implemented in SQL Server was remotely useful outside of English-speaking countries I might care one way or the other. As it stands it's one of the quaint features of the language.


Rune, I could not agree more;-). Most people (business) ask for "fuzzy" lookups, what they are asking for is more for an "exact" fuzzy lookup. I cannot tell you how many times I have had to explain the differences between the two. Heck, I may as well use LIKE when performing my searches. This is why I use a true programming language and the steps (some of them) I outlined in my answer.
Tom Thomson
Tom Thomson
SSC-Dedicated
SSC-Dedicated (39K reputation)SSC-Dedicated (39K reputation)SSC-Dedicated (39K reputation)SSC-Dedicated (39K reputation)SSC-Dedicated (39K reputation)SSC-Dedicated (39K reputation)SSC-Dedicated (39K reputation)SSC-Dedicated (39K reputation)

Group: General Forum Members
Points: 39605 Visits: 12889
Good question, and nice explanation.

Too many people still fall for the claims that Soundex is a phonetic algorithm for indexing by sound and that it's based on American English pronunciation, despite its being extremely flawed for that purpose nothing really based on AmEng pronunciation would return different codes for "sealing" and "ceiling" while delivering the same code for "ceiling" and "killing". That it was originally intened only to be used for names does mean that it's less erratic when used as originally intended, but we still have "Gene" and "Jean" getting different codes, while and "Jean" and "Jane" get the same code as each other, as do "Gene" and "Gawain". It seems quite clear that there was very little attention paid to phonetics in the design of this stuff.

So it's nice to see a QotD that illustrates some of Soundex's failings.

Tom

Tom Thomson
Tom Thomson
SSC-Dedicated
SSC-Dedicated (39K reputation)SSC-Dedicated (39K reputation)SSC-Dedicated (39K reputation)SSC-Dedicated (39K reputation)SSC-Dedicated (39K reputation)SSC-Dedicated (39K reputation)SSC-Dedicated (39K reputation)SSC-Dedicated (39K reputation)

Group: General Forum Members
Points: 39605 Visits: 12889
Rune Bivrin (11/26/2013)
If SOUNDEX() as implemented in SQL Server was remotely useful outside of English-speaking countries I might care one way or the other. As it stands it's one of the quaint features of the language.

Does that mean you think it's useful in English-speaking countries?

As long as 70 years ago the US census bureau found Soundex was useless for analysing their early (first 30 years) census records and changed the algorithm; now the US government publishes its own encoding rules, which may be the same as the original Soundex rules (but I doubt it). There are rather a lot of better phonetic fuzzy encoding algorithms, and the name is widely misused: most things that are called Soundex are not actually Soundex.

I don't know whether the Soundex in SQL Server is the original Soundex or not; if not it still shares most of the failings of the original - enough to make it not useful unless augmented by other matching and distinguishing techniques even for names in English-speaking countries.

Tom

Thomas Abraham
Thomas Abraham
SSCarpal Tunnel
SSCarpal Tunnel (5K reputation)SSCarpal Tunnel (5K reputation)SSCarpal Tunnel (5K reputation)SSCarpal Tunnel (5K reputation)SSCarpal Tunnel (5K reputation)SSCarpal Tunnel (5K reputation)SSCarpal Tunnel (5K reputation)SSCarpal Tunnel (5K reputation)

Group: General Forum Members
Points: 4999 Visits: 2256
Thanks for the interesting question. I got caught out with "Phone". Don't really use this kind of thing in my work right now. But, as with many QotD topics, it keeps me looking at things I don't normally see, in case I ever need them.

Please don't go. The drones need you. They look up to you.
Connect to me on LinkedIn
Go


Permissions

You can't post new topics.
You can't post topic replies.
You can't post new polls.
You can't post replies to polls.
You can't edit your own topics.
You can't delete your own topics.
You can't edit other topics.
You can't delete other topics.
You can't edit your own posts.
You can't edit other posts.
You can't delete your own posts.
You can't delete other posts.
You can't post events.
You can't edit your own events.
You can't edit other events.
You can't delete your own events.
You can't delete other events.
You can't send private messages.
You can't send emails.
You can read topics.
You can't vote in polls.
You can't upload attachments.
You can download attachments.
You can't post HTML code.
You can't edit HTML code.
You can't post IFCode.
You can't post JavaScript.
You can post emoticons.
You can't post or upload images.

Select a forum







































































































































































SQLServerCentral


Search