Column level collation to store multi linguage characters.

  • Morning all.

    We receive data from a 3rd party, the flat file is correct in that it contains the right characters in the file

    Flat file data

    Emplea la calculadora como instrumento de exploración y verificación de resultados

    Database data

    Emplea la calculadora como instrumento de exploración y verificación de resultados

    I changed the collation at the column to Latin1_General_BIN and Modern_Spanish_BIN, reimported the data, but still the data is showing with the ó instead of ó (it is also doing it for other characters).

    The data that we receive is a mixture of English and Spanish (Mexican).

    What would be the best collation for the import so that it stores the ó and not the ó, would of thought a binary collation but doesnt seem that way?

    Thanks in advance.

    Edit, the column is NVARCHAR already, but the file is not unicode, if I try to change to unicode in the SSIS package it doesnt recognise the columns any more.

    Edit 2 - got to be the file not being in unicode, inserted the value manually for a number of collations and came back as it should for all collations tried, now to try and convert the flat file to unicode before import as it is already doing a data conversion on the import and still not generating the right output.

  • How are you importing the data? For example, if you're using BULK INSERT, the probable way to fix this is to use the RAW file type which will bring the data in with no conversion.

    --Jeff Moden


    RBAR is pronounced "ree-bar" and is a "Modenism" for Row-By-Agonizing-Row.
    First step towards the paradigm shift of writing Set Based code:
    ________Stop thinking about what you want to do to a ROW... think, instead, of what you want to do to a COLUMN.

    Change is inevitable... Change for the better is not.


    Helpful Links:
    How to post code problems
    How to Post Performance Problems
    Create a Tally Function (fnTally)

  • The data is inserted via SSIS, we receive around 10 files which are loaded into holding tables and a vary large set of manipulation is run on the data to create the correct data what we need for our apps.

    The process is run so infrequently that we have made a decision to download the files before hand, and open / save this file in notepad etc as Unicode then upload it to the servers.

    Going to investigate a C# or VB script which will do the conversion for us before it is passed to the data flow task.

Viewing 3 posts - 1 through 2 (of 2 total)

You must be logged in to reply to this topic. Login to reply