Raghavendra Narayana


SQLServerCentral Article

Parallel Processing of Large Volume ETL Jobs

ETL processing, generally involves copying/moving, transforming, cleaning the records/transactions from one or multiple sources. Most of the batch processing or warehousing projects involve such data processing in millions on daily/weekly basis. Typically, there is a Staging area and production area. Records are cleaned, transformed, filtered and verified from staging to production area. This demands SQL Set theory based queries, parallel processing with multiple processors/CPU. The article focuses on need of SQL Set theory approach and parallel processing while processing large volume of ETL records using programming approach.

2.44 (9)

You rated this post out of 5. Change rating

2007-11-08

7,076 reads

Blogs

Checking Myself with GenAI

By

I had a suggestion from somone on a place where AI helps them and...

T-SQL Tuesday #186 Roundup - Managing Agent Jobs

By

Thank you to everyone who contributed to this month’s T-SQL Tuesday! Here’s a roundup of...

Speaking at the NYC Lunch and Learn DevOps Devour Hour

By

This Friday is the NYC DevOps Devour hour, which is actually 3 hours. Plus...

Read the latest Blogs

Forums

Deadlocks and Backups - Need help

By Mike-342950

Hey everyone, please excuse my ignorance, I'm just .NET developer and have used SQL...

Does Version Control Scare You

By Steve Jones - SSC Editor

Comments posted to this topic are about the item Does Version Control Scare You

Unlimited Text

By Steve Jones - SSC Editor

Comments posted to this topic are about the item Unlimited Text

Visit the forum

Question of the Day

Unlimited Text

If I want to get unlimited amounts of data back from a varchar(max)/nvarchar(max) column, what do I run?

See possible answers