|
|
|
Hall of Fame
       
Group: General Forum Members
Last Login: 2 days ago @ 1:49 AM
Points: 3,123,
Visits: 4,310
|
|
Comments posted to this topic are about the item Bad data import
____________________________________________ Space, the final frontier? not any more... All limits henceforth are self-imposed. “libera tute vulgaris ex”
|
|
|
|
|
SSCrazy Eights
        
Group: General Forum Members
Last Login: Today @ 1:01 PM
Points: 9,367,
Visits: 6,463
|
|
|
|
|
|
SSCertifiable
       
Group: General Forum Members
Last Login: Today @ 11:51 AM
Points: 5,232,
Visits: 7,023
|
|
I got it right, because of elimination, but I don't understand the entire scenario.
Since the destination column is NVARCHAR(26), how can a value be outside the codepage range? I though codepages were relevant for non-Unicode data only?
Hugo Kornelis, SQL Server MVP Visit my SQL Server blog: http://sqlblog.com/blogs/hugo_kornelis
|
|
|
|
|
Hall of Fame
       
Group: General Forum Members
Last Login: 2 days ago @ 1:49 AM
Points: 3,123,
Visits: 4,310
|
|
Hugo Kornelis (4/29/2010) I got it right, because of elimination, but I don't understand the entire scenario.
Since the destination column is NVARCHAR(26), how can a value be outside the codepage range? I though codepages were relevant for non-Unicode data only?
in the situation that gave rise to this question, CODEPOINT identified the value of the character as 65533. As far as I am aware, there is no charater (in any character set, albeit unicode or non-unicode) with that value.
____________________________________________ Space, the final frontier? not any more... All limits henceforth are self-imposed. “libera tute vulgaris ex”
|
|
|
|
|
SSCertifiable
       
Group: General Forum Members
Last Login: Today @ 11:51 AM
Points: 5,232,
Visits: 7,023
|
|
stewartc-708166 (4/29/2010)
Hugo Kornelis (4/29/2010) I got it right, because of elimination, but I don't understand the entire scenario.
Since the destination column is NVARCHAR(26), how can a value be outside the codepage range? I though codepages were relevant for non-Unicode data only?in the situation that gave rise to this question, CODEPOINT identified the value of the character as 65533. As far as I am aware, there is no charater (in any character set, albeit unicode or non-unicode) with that value.
So that was a bug in the mainframe program that exportet the file, then?
Hugo Kornelis, SQL Server MVP Visit my SQL Server blog: http://sqlblog.com/blogs/hugo_kornelis
|
|
|
|
|
Hall of Fame
       
Group: General Forum Members
Last Login: 2 days ago @ 1:49 AM
Points: 3,123,
Visits: 4,310
|
|
Hugo Kornelis (4/29/2010)
stewartc-708166 (4/29/2010)
Hugo Kornelis (4/29/2010) I got it right, because of elimination, but I don't understand the entire scenario.
Since the destination column is NVARCHAR(26), how can a value be outside the codepage range? I though codepages were relevant for non-Unicode data only?in the situation that gave rise to this question, CODEPOINT identified the value of the character as 65533. As far as I am aware, there is no charater (in any character set, albeit unicode or non-unicode) with that value. So that was a bug in the mainframe program that exportet the file, then?
that is correct. identifying the record and referring it to the custodian facilitated the extract to be rectified.
____________________________________________ Space, the final frontier? not any more... All limits henceforth are self-imposed. “libera tute vulgaris ex”
|
|
|
|
|
SSCrazy
      
Group: General Forum Members
Last Login: 2 days ago @ 11:53 AM
Points: 2,672,
Visits: 2,416
|
|
stewartc-708166 (4/29/2010)
Hugo Kornelis (4/29/2010) I got it right, because of elimination, but I don't understand the entire scenario.
Since the destination column is NVARCHAR(26), how can a value be outside the codepage range? I though codepages were relevant for non-Unicode data only?in the situation that gave rise to this question, CODEPOINT identified the value of the character as 65533. As far as I am aware, there is no charater (in any character set, albeit unicode or non-unicode) with that value.
This is very interesting. I am wondering where to practically draw the line with regard to how high a number resulting from a CODEPOINT test to draw the line between alpha/numeric characters from any language and other symbols/characters. Looking at this reference: http://www.ssec.wisc.edu/~tomw/java/unicode.html I see there is a Unicode character(?) for the value 65533. Additional searching shows that Unicode goes up to 10FFFF or 1114111. I suspect the place to draw the line is either 65518 or 65276. I would certainly appreciate a link reference or a bit of feedback with regard to how to slice Unicode for the data flow in this question. Thanks.
|
|
|
|
|
Mr or Mrs. 500
      
Group: General Forum Members
Last Login: Wednesday, September 19, 2012 8:39 AM
Points: 595,
Visits: 1,226
|
|
Nice question.
Converting oxygen into carbon dioxide, since 1955.
|
|
|
|
|
SSCrazy
      
Group: General Forum Members
Last Login: 2 days ago @ 1:42 PM
Points: 2,012,
Visits: 2,839
|
|
65533 is a valid unicode number but represents a special replacement character. It is the highest value in the character set. dbowlin makes a good point about what values to check for. You would ahve to know the source and the target databases to know what's reasonable. I doubt that 65533 is reasonable under 99.9% of the cases but there may be rare instances.
Good question, though.
|
|
|
|
|
SSC-Enthusiastic
      
Group: General Forum Members
Last Login: Thursday, May 16, 2013 7:56 AM
Points: 109,
Visits: 564
|
|
| Yeah, sounds like a HIGH-VALUE declaration in the COBOL (or whatever) code.
|
|
|
|