skipping the second line in a spreadsheet being read by ssis

  • hi, my new user provided a new multi tab (sales, customer etc. not what i really wanted either) excel spreadsheet for ingestion by our sales warehouse etl.   Its second line is his erp's column names.   The first line is somewhat close to what i asked for in column names.   for the moment im looking the other way .

    Is there an elegant way to skip the second line but not the first?   my primary goal is to determine the quality of his data before picking on smaller issues.   the second line has down arrows which makes me think it would act like a filter if i was perusing manually.

  • Can I cheat and do this in PowerQuery?

  • If any column has a value in the second row that is unique and identifiable you can add a conditional split to the dataflow where the unmatched output is directed to a table and the row(s) that match are directed to nothing, or to a rowcount variable. You can check the variable < 2 to verify that you didn't drop any unexpected rows.

  • thx ed b but i'd like to leverage the fact that there are distinguishable data types in this file.   some are int etc.   not sure if doing that would generate an additional problem or a need to temporarily treat all data as (n)varchar.

Viewing 4 posts - 1 through 4 (of 4 total)

You must be logged in to reply to this topic. Login to reply