Remove Duplicate Rows in SSIS Data Flow.

  • I am pulling data from teradata in SSIS and building a new table to be placed in SQL. Along the way I am building a Key - prior to inserting the data into the SQL table I would like to remove all duplicate records that have the same key, keeping 1 unique record for each key.

    Ideas on how to perform this from with in SSIS?

    Example:

    bobsmith 21 kansas city

    bobsmith 21 kansas city

    two records with key bobsmith - i want to remove one of them or multiple of them and just keep one.

    Thanks,

    Ad.

  • An aggregate component in your data flow may be able to do what you are looking for - depending on your other columns.

  • I can not believe I missed this - I was able to do what I wanted based on my data using the "Sort" and remove dulicate rows based on my key. It works very well.

    Thanks for the reply.

    Ad.

Viewing 3 posts - 1 through 3 (of 3 total)

You must be logged in to reply to this topic. Login to reply