  • The main problem is always re-running warehouse population when using the check last day inserts/updates/deletes etc. With that logic you can end up inserting duplicate rows if you need to run a population through the day.

    That's why we stamp a year/month column format (YYYYMM) and use that to determine the logic.

    IE delete where YYYYMM = 200604, insert where YYYYMM = 200604 etc..

    Good article, by the way. Highlights a lot of potential strategies.

    Sigh. If only we can get the oltp guys to set databases up properly in the first instance, life would be a lot easier.

  • I have only read 10% of the article so far but I plan to read all of it when I have time. This is the kind of article that I'm happy to put in my favorite list, because it brings some real value instead of talking about abstract things like the strategic advantage of BI for companies.

    And the language was just fine for me 🙂 (Italian is my mothertongue)


  • English is only my third language, but I can understand every piece of this article.  Probably because I have been working in data warehousing for more than 10 years.

  • Might be a typo, but you have it twice in the article.  It's Ascential, with l, not Ascentia.  And then, Ascential is the name of the previous vendor (now IBM).  Its ETL software was called DataStage, branded by IBM: Websphere DataStage. 

  • Thanks for the correction, Djoni. You are right Ascential is now WebSphere.

    And thanks for the critics, Bill Edwards, I will try to improve it on my next articles.

