Raghavendra Narayana


SQLServerCentral Article

Parallel Processing of Large Volume ETL Jobs

ETL processing, generally involves copying/moving, transforming, cleaning the records/transactions from one or multiple sources. Most of the batch processing or warehousing projects involve such data processing in millions on daily/weekly basis. Typically, there is a Staging area and production area. Records are cleaned, transformed, filtered and verified from staging to production area. This demands SQL Set theory based queries, parallel processing with multiple processors/CPU. The article focuses on need of SQL Set theory approach and parallel processing while processing large volume of ETL records using programming approach.

(9)

You rated this post out of 5. Change rating

2007-11-08

7,108 reads

Blogs

T-SQL Tuesday #198 Roundup: How Do You Detect Data Changes?

By

Thank you to everyone who participated in T-SQL Tuesday #198! When I wrote the...

Optimizing Redshift Performance by Configuring WLM Queues

By

Efficient query performance in Amazon Redshift often comes down to how well you manage...

PowerShell Strikes Back: Return of the Loop

By

Welcome back to PowerShell Strikes Back. We’re three weeks in, and the training is...

Read the latest Blogs

Forums

FARE Lab's Leading FSSAI Third Party Auditing Services

By farelabs

If you are looking for India’s best FSSAI Third Party Auditing Services in India....

FARE Labs is The Leading FSSAI Third Party Auditing Services of India.

By farelabs

If you are looking for India’s best FSSAI Third Party Auditing Services in India....

Dealing with huge heap tables

By JasonO

Recently, our dev teams approach me for advice on improving their huge heap table...

Visit the forum

Question of the Day

Distance Metric Algorithms

What are the distance metric algorithms that can be used in VECTOR_DISTANCE()?

See possible answers