Merge two big datasets

  • I have two datasets one of 15000+ records and other with 5000+ records.I need to merge these two sets based on Name which may not be in the same format for EX (VISTA HEALTH, FT. SMITH) and(VISTA HEALTH SERVICES, FORT SMITH) and finally create a file which should have both matching and non-matching records from both datasets.

    Is there any better way do this merge.Please suggest

    Thank you.

  • HaraPriya (3/19/2014)


    I have two datasets one of 15000+ records and other with 5000+ records.I need to merge these two sets based on Name which may not be in the same format for EX (VISTA HEALTH, FT. SMITH) and(VISTA HEALTH SERVICES, FORT SMITH) and finally create a file which should have both matching and non-matching records from both datasets.

    Is there any better way do this merge.Please suggest

    Thank you.

    What do you mean by "Merge": is that MERGE update, JOIN, UNION, INTERSECT ?

    In any event, that [Name] column is a poor excuse for a key. Perhaps what you need is a many-to-one bridge table that maps one or more names to an ID.

    http://jsimonbi.wordpress.com/2011/01/25/populating-a-many-to-many-bridge-table/

    "Do not seek to follow in the footsteps of the wise. Instead, seek what they sought." - Matsuo Basho

Viewing 2 posts - 1 through 1 (of 1 total)

You must be logged in to reply to this topic. Login to reply