You raise many valid points, but you miss a few key ones related to capitalization:
1. Lower case letters beginning a name, such as "d'Angelio"
2. Split surnames, such a "von Doenitz" (This is Latin-ized. The correct spelling is "D o(with umlaut) n i t z")
3. Unusual punctuation characters, such a "Zu!tic" - when pronounced, the "!" is a verbal click
While some of these are not storage issues, per se, the way they are stored affects later processing using the names. Comparing names from disparate systems often raises various issues, such as:
1. Is "d'Angelio" the same as "DANGELIO"?
2. Is "de la Fayette" the same as "LAFAYETTE"?
3. Is "von Helsing" the same as "VONHELSING" or just "HELSING"? In the latter case, the originating system sliced off the "von" as a middle name!
The answer to all of these examples is "Probably."
Middle names can often be an important discriminator. Consider VICTOR IGNATIUS SMITH and his son VICTOR IGOR SMITH. Their legal names are distinct; thus, there is no suffix. But abbreviating the middle name to a single letter yields VICTOR I SMITH and, if they reside at the same address, may cause infinite mischief during later processing.
With these thoughts in mind, it is important to know how these names will be used later, before defining the structure that will hold them.