Raghavendra Narayana


SQLServerCentral Article

Parallel Processing of Large Volume ETL Jobs

ETL processing, generally involves copying/moving, transforming, cleaning the records/transactions from one or multiple sources. Most of the batch processing or warehousing projects involve such data processing in millions on daily/weekly basis. Typically, there is a Staging area and production area. Records are cleaned, transformed, filtered and verified from staging to production area. This demands SQL Set theory based queries, parallel processing with multiple processors/CPU. The article focuses on need of SQL Set theory approach and parallel processing while processing large volume of ETL records using programming approach.

2.44 (9)

You rated this post out of 5. Change rating

2007-11-08

7,091 reads

Blogs

Monday Monitor Tips: ServiceNow Integration

By

Earlier this year I visited a customer that was using the Redgate Monitor webhook...

Inspecting TSQL Abstract Syntax Trees

By

TSQL ScriptDOM is a useful library for parsing scripts into an abstract syntax tree....

Becoming a creator, my website experience

By

Over the past few months, I have debated starting a new blog to discuss...

Read the latest Blogs

Forums

Cannot get KB959420

By fabio@fullin.com.ar

Hi, we lost our sever sql2000 To restore database to a new one we...

Cannot get KB959420

By fabio@fullin.com.ar

Hi, since we lost our sql server 2000 server we are trying to restore...

Do You Really Need HA?

By Steve Jones - SSC Editor

Comments posted to this topic are about the item Do You Really Need HA?

Visit the forum

Question of the Day

The New Database CheckDB

I run this code:

create database experiment
go
use Experiment
go
select DATABASEPROPERTYEX('Experiment', 'LastGoodCheckDbTime')
What is returned?

See possible answers